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Topic Outline for AP® Statistics 
from the College Board’s AP® Statistics Course Description 


|. Exploring data: describing patterns and departures from patterns (20%-30%) 


A. Constructing and interpreting graphical displays of distributions of univariate data (dotplot, 
stemplot, histogram, cumulative frequency plot) 
1. Center and spread 
2. Clusters and gaps 
3. Outliers and unusual features 
4. Shape 
B. Summarizing distributions of univariate data 
1. Measuring center: median, mean 
2. Measuring spread: range, interquartile range, standard deviation 
3. Measuring position: quartiles, percentiles, standardized scores (z-scores) 
4. Using boxplots 
5. The effect of changing units on summary measures 
C. Comparing distributions of univariate data (dotplots, back-to-back stemplots, parallel boxplots) 


1. Comparing center and spread 

2. Comparing clusters and gaps 

3. Comparing outliers and unusual features 

4. Comparing shape 
D. Exploring bivariate data 

1. Analyzing patterns in scatterplots 

2. Correlation and linearity 

3. Least-squares regression line 

4. Residual plots, outliers, and influential points 

5. Transformations to achieve linearity: logarithmic and power transformations 
E. Exploring categorical data 

1. Frequency tables and bar charts 

2. Marginal and joint frequencies for two-way tables 

3. Conditional relative frequencies and association 

4. Comparing distributions using bar charts 


Il. Sampling and experimentation: planning and conducting a study (10%—-15%) 


The Practice of Statistics, 5th ed. 
Chapter and Section references 


Dotplot, stemplot, histogram 1.2; 
Cumulative frequency plot 2.1 
1.2 

1.2 

1.2 

1.2 

1.3 and 2.1 

1.3 

1.3 

Quartiles 1.3; percentiles and z-scores 2.1 
1.3 

2.1 

Dotplots and stemplots 1.2; 
boxplots 1.3 

1.2 and 1.3 

1.2 and 1.3 

1.2 and 1.3 

1.2 and 1.3 

Chapter 3 and Section 12.2 
3.1 

3.1 

3.2 

3.2 

12.2 

Sections 1.1, 5.2, 5.3 

1.1 (we call them bar graphs) 
Marginal 1.1; joint 5.2 

1.1 and 5.3 

11 


A. Overview of methods of data collection 
1. Census 
2. Sample survey 
3. Experiment 
4. Observational study 
B. Planning and conducting surveys 
1. Characteristics of a well-designed and well-conducted survey 
2. Populations, samples, and random selection 
3. Sources of bias in sampling and surveys 
4. Sampling methods, including simple random sampling, stratified random sampling, 
and cluster sampling 
C. Planning and conducting experiments 
1. Characteristics of a well-designed and well-conducted experiment 
2. Treatments, control groups, experimental units, random assignments, and replication 
3. Sources of bias and confounding, including placebo effect and blinding 
4. Completely randomized design 
5. Randomized block design, including matched pairs design 
D. Generalizability of results and types of conclusions that can be drawn from 
observational studies, experiments, and surveys 


Sections 4.1 and 4.2 
41 

41 

4.2 

4.2 

Section 4.1 

41 

41 

41 

41 


Section 4.2 
4.2 
4.2 
4.2 
4.2 
4.2 
Section 4.3 


Topic Outline for AP® Statistics The Practice of Statistics, 5th ed. 
from the College Board’s AP® Statistics Course Description Chapter and Section references 
lil. Anticipating patterns: exploring random phenomena using probability and simulation (20%-30%) 
A. Probability Chapters 5 and 6 
1. Interpreting probability, including long-run relative frequency interpretation 5.1 
2. “Law of large numbers” concept 5.1 
3. Addition rule, multiplication rule, conditional probability, and independence Addition rule 5.2; other three topics 5.3 
4. Discrete random variables and their probability distributions, including binomial and geometric Discrete 6.1; Binomial 
and geometric 6.3 
5. Simulation of random behavior and probability distributions 5.1 
6. Mean (expected value) and standard deviation of a random variable, and linear Mean and standard deviation 6.1; 
transformation of a random variable Linear transformation 6.2 
B. Combining independent random variables Section 6.2 
1. Notion of independence versus dependence 6.2 
2. Mean and standard deviation for sums and differences of independent random variables 6.2 
C. The Normal distribution Section 2.2 
1. Properties of the Normal distribution 2.2 
2. Using tables of the Normal distribution 2.2 
3. The Normal distribution as a model for measurements 2.2 
D. Sampling distributions Chapter 7; Sections 8.3, 
10.1, 10.2, 11.1 
1. Sampling distribution of a sample proportion 7.2 
2. Sampling distribution of a sample mean 7.3 
3. Central limit theorem 7.3 
4. Sampling distribution of a difference between two independent sample proportions 10.1 
5. Sampling distribution of a difference between two independent sample means 10.2 
6. Simulation of sampling distributions fA 
7. t distribution 8.3 
8. Chi-square distribution 11.1 
IV. Statistical inference: estimating population parameters and testing hypotheses (30%-40%) 
A. Estimation (point estimators and confidence intervals) Chapter 8 plus parts of Sections 9.3, 10.1, 
10.2, 12.1 
1. Estimating population parameters and margins of error 8.1 
2. Properties of point estimators, including unbiasedness and variability 8.1 
3. Logic of confidence intervals, meaning of confidence level and confidence intervals, 8.1 
and properties of confidence intervals 
4. Large-sample confidence interval for a proportion 8.2 
5. Large-sample confidence interval for a difference between two proportions 10.1 
6. Confidence interval for a mean 8.3 
7. Confidence interval for a difference between two means (unpaired and paired) Paired 9.3; unpaired 10.2 
8. Confidence interval for the slope of a least-squares regression line 12.1 
B. Tests of significance Chapters 9 and 11 plus parts of Sections 
10.1, 10.2, 12.1 
1. Logic of significance testing, null and alternative hypotheses; P-values; one-and two-sided tests; 9.1; power in 9.2 
concepts of Type | and Type II errors; concept of power 
2. Large-sample test for a proportion 9.2 
3. Large-sample test for a difference between two proportions 10.1 
4. Test for a mean 9.3 
5. Test for a difference between two means (unpaired and paired) Paired 9.3; unpaired 10.2 
6. Chi-square test for goodness of fit, homogeneity of proportions, and independence Chapter 11 
(one- and two-way tables) 
7. Test for the slope of a least-squares regression line 12.1 
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To the Student 


Statistical Thinking and You 


The purpose of this book is to give you a working knowledge of the big ideas of statistics and 
of the methods used in solving statistical problems. Because data always come from a real- 
world context, doing statistics means more than just manipulating data. The Practice of Statistics 
(TPS), Fifth Edition, is full of data. Each set of data has some brief background to help you 
understand what the data say. We deliberately chose contexts and data sets in the examples and 
exercises to pique your interest. 

TPS 5e is designed to be easy to read and easy to use. This book is written by current high 
school AP® Statistics teachers, for high school students. We aimed for clear, concise explana- 
tions and a conversational approach that would encourage you to read the book. We also tried 
to enhance both the visual appeal and the book’s clear organization in the layout of the pages. 

Be sure to take advantage of all that TPS 5e has to offer. You can learn a lot by reading the 
text, but you will develop deeper understanding by doing Activities and Data Explorations and 
answering the Check Your Understanding questions along the way. The walkthrough guide on 
pages xiv-xx gives you an inside look at the important features of the text. 

You learn statistics best by doing statistical problems. ‘This book offers many different types 
of problems for you to tackle. 


e Section Exercises include paired odd- and even-numbered problems that test the 
same skill or concept from that section. There are also some multiple-choice ques- 
tions to help prepare you for the AP® exam. Recycle and Review exercises at the 
end of each exercise set involve material you studied in previous sections. 

e Chapter Review Exercises consist of free-response questions aligned to specific 
learning objectives from the chapter. Go through the list of learning objectives 
summarized in the Chapter Review and be sure you can say “I can do that” to each 
item. Then prove it by solving some problems. 

e The AP® Statistics Practice Test at the end of each chapter will help you prepare 
for in-class exams. Each test has 10 to 12 multiple-choice questions and three free- 
response problems, very much in the style of the AP® exam. 

e Finally, the Cumulative AP® Practice Tests after Chapters 4, 7, 10, and 12 provide 
challenging, cumulative multiple-choice and free-response questions like ones you 
might find on a midterm, final, or the AP® Statistics exam. 


The main ideas of statistics, like the main ideas of any important subject, took a long time to 
discover and take some time to master. The basic principle of learning them is to be persistent. 
Once you put it all together, statistics will help you make informed decisions based on data in 
your daily life. 


TPS and AP® Statistics 


The Practice of Statistics (TPS) was the first book written specifically for the Advanced Place- 
ment (AP®) Statistics course. Like the previous four editions, TPS 5e is organized to closely fol- 
low the AP® Statistics Course Description. Every item on the College Board’s “Topic Outline” 
is covered thoroughly in the text. Look inside the front cover for a detailed alignment guide. The 
few topics in the book that go beyond the AP® syllabus are marked with an asterisk (*). 

Most importantly, TPS 5e is designed to prepare you for the AP® Statistics exam. The entire 
author team has been involved in the AP® Statistics program since its early days. We have more 
than 80 years’ combined experience teaching introductory statistics and more than 30 years’ 
combined experience grading the AP" exam! Two of us (Starnes and Tabor) have served as 
Question Leaders for several years, helping to write scoring rubrics for free-response questions. 
Including our Content Advisory Board and Supplements ‘Team (page vii), we have two former 
Test Development Committee members and 11 AP® exam Readers. 

TPS 5e will help you get ready for the AP® Statistics exam throughout the course by: 


e Using terms, notation, formulas, and tables consistent with those found on the 
AP® exam. Key terms are shown in bold in the text, and they are defined in the 
Glossary. Key terms also are cross-referenced in the Index. See page F-1 to find 


“Formulas for the AP® Statistics Exam” as well as Tables A, B, and C in the back of 
the book for reference. 


¢ Following accepted conventions from AP® exam rubrics when presenting model 
solutions. Over the years, the scoring guidelines for free-response questions have 
become fairly consistent. We kept these guidelines in mind when writing the 
solutions that appear throughout TPS 5e. For example, the four-step State-Plan- 
Do-Conclude process that we use to complete inference problems in Chapters 8 
through 12 closely matches the four-point AP® scoring rubrics. 

¢ Including AP® Exam Tips in the margin where appropriate. We place exam 
tips in the margins and in some Technology Corners as “on-the-spot” reminders 
of common mistakes and how to avoid them. These tips are collected and summa- 
rized in Appendix A. 

¢ Providing hundreds of AP®-style exercises throughout the book. We even added 
a new kind of problem just prior to each Chapter Review, called a FRAPPY (Free 
Response AP® Problem, Yay!). Each FRAPPY gives you the chance to solve an 
AP®-style free-response problem based on the material in the chapter. After you 
finish, you can view and critique two example solutions from the book’s Web site 
(www.whfreeman.com/tps5e). Then you can score your own response using a ru- 
bric provided by your teacher. 


‘Turn the page for a tour of the text. See how to use the book to realize success in the course and 
on the AP® exam. 
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READ THE TEXT and use the book’s features 
to help you grasp the big ideas. 


[ee Scatterplots and Correlation 


WHAT YOU WILL LEARN By the end of the section, you should be able to: 


Read the LEARNING 
OBJECTIVES at the 
beginning of each section. 


e — Identify explanatory and response variables in situations e Interpret the correlation. 
Focus on mastering these where one variable helps to explain or influences the other. © Understand the basic properties of correlation, 
e Make a scatterplot to display the relationship between including how the correlation is influenced by outliers. 


skills and concepts as you two quantitative variables. © Use technology to calculate correlation. 
work through the chapter. e Describe the direction, form, and strength of a 


relationship displayed in a scatterplot and identify outli- 
ers in a scatterplot. 


Explain why association does not imply causation. 


Scan the margins for Take note of the green 


the purple notes, which 
represent the “voice of 


Often, using the regression line 
to make a prediction for x = 0 is 
an extrapolation. That's why the y 
intercept isn’t always statistically 


DEFINITION: Extrapolation 


Extrapolation is the use of a regression line for prediction far outside the interval 
of values of the explanatory variable x used to obtain the line. Such predictions are 


DEFINITION boxes 
that explain important 


meaningful. often not accurate. 


the teacher” giving helpful 
hints for being successful 
in the course. 


vocabulary. Flip back to 
them to review key terms 
and their definitions. 


Don’t make predictions using values of x that are much larger or much 


Few relationships are linear for all values of the explanatory variable. @ 
smaller than those that actually appear in your data. 


Watch for CAUTION 
ICONS. They alert you 
to common mistakes 
that students make. 


Look for the boxes 
with the blue bands. 
Some explain how to ; 
make graphs or set 2 
up calculations while 3 
others recap important 

concepts. 


HOW TO MAKE A SCATTERPLOT 


Decide which variable should go on each axis. 
Label and scale your axes. 


Plot individual data values. 


. | What does correlation measure? The Fathom screen shots below pro- 
Make connections and cain a } vide more detail. At the left is a scatterplot of the SEC football data with two lines 


deepen your understanding / added—a vertical line at the group’s mean points per game and a horizontal line 


x at the mean number of wins of the group. Most of the points fall in the upper-right 


Read the AP® EXAM 
TIPS. They give advice on 
how to be successful on 
the AP® exam. 


xiv 


by reflecting on the 
questions asked in THINK 
ABOUT IT passages. 


AP® EXAM TIP The formula 
sheet for the AP® exam uses 
different notation for these 


: Sy 
equations: b; = fe and 
iy 


by = Y — b,X. That's because 
the least-squares line is written 
as 9 = bo + b,x. We prefer our 
simpler versions without the 
subscripts! 


or lower-left “quadrants” of the graph. ‘That is, teams with above-average points 
per game tend to have above-average numbers of wins, and teams with below- 
average points per game tend to have numbers of wins that are below average. 
This confirms the positive association between the variables. 

Below on the right is a scatterplot of the standardized scores. ‘To get this graph, 
we transformed both the x- and the y-values by subtracting their mean and divid- 
ing by their standard deviation. As we saw in Chapter 2, standardizing a data set 
converts the mean to 0 and the standard deviation to 1. That’s why the vertical and 
horizontal lines in the right-hand graph are both at 0. 


Wins 
© 
. 
rwins 
3 ° 
ey 
#6 


Ss 2 2 2 3 4 45 470-05 00 05 10 15. 
PointsPerGame PPG 
Notice that all the products of the standardized values will be positive—not 
surprising, considering the strong positive association between the variables. What 
if there was a negative association between two variables? Most of the points would 
be in the upper-left and lower-right “quadrants” and their z-score products would 
be negative, resulting in a negative correlation. 


ACTIVITY | I'ma Great Free-Throw Shooter! 


MATERIALS: A basketball player claims to make 80% of the free throws that he attempts. We 
AC TIV Y Reach ng Computer with Internet think he might be exaggerating. To test this claim, we'll ask him to shoot some free 
access and projection throws—virtually—using The Reasoning of a Statistical Test applet at the book’s 
capability Web site. 


1. Go to www.whfreeman.com/tpsSe and launch the applet. 


MATERIALS: Before class, your teacher will prepare a population o: 
200 colored chips, including having the same color (say, red). The parameter is tt 
1400 of the same color; large chips in the population: = 0.50. In this Activity, y 
bag or other container variability by taking repeated random samples of size 


1. After your teacher has mixed the chips thoroughly 
should take a sample of 20 chips and note the sampl 
When finished, the student should return all the chi 
and pass the bag to the next student. 

Note: If your class has fewer than 25 students, have s 
samples. 

2. Each student should record the f-value in a chart 
value on a class dotplot. Label the graph scale from ( 
spaced 0.05 units apart. 

3. Describe what you see: shape, center, spread, and 
usual features. 


2. Set the applet to take 25 shots. Click “Shoot.” How many of the 25 shots did 
the player make? Do you have enough data to decide whether the player's claim 
is valid? 

3. Click “Shoot” again for 25 more shots. Keep doing this until you are 
convinced either that the player makes less than 80% of his shots or that the 
player’s claim is true. How large a sample of shots did you need to make your 
decision? 


4. Click “Show true probability” to reveal the truth. Was your conclusion 
correct? 

5. If time permits, choose a new shooter and repeat Steps 2 through 4. Is it 
easier to tell that the player is exaggerating when his actual proportion of free 
throws made is closer to 0.8 or farther from 0.8? 


DATA EXPLORATION The SAT essay: Is longer better? 


Following the debut of the new SAT Writing test in March 2005, Dr. Les Perelman 
from the Massachusetts Institute of Technology stirred controversy by reporting, 
“It appeared to me that regardless of what a student wrote, the longer the essay, the 
higher the score.” He went on to say, “I have never found a quantifiable predictor 
in 25 years of grading that was anywhere as strong as this one. If you just graded 
them based on length without ever reading them, you'd be right over 90 percent 
of the time.”* The table below shows the data that Dr. Perelman used to draw his 
conclusions.* 


Words: 460 422 402 365 357 278 236 201 168 156 133 


Score: 6 6 5 5 6 5 4 4 4 3 2 
Words: 114 108 100 403 401 388 320 258 236 189 128 
Score: 2 1 1 5 6 6 5 4 4 3 2 
Words: 67 697 387 355 337 325 272 150 135 
Score: 1 6 6 5 5 4 4 2 3 


Does this mean that if students write a lot, they are guaranteed high scores? 
Carry out your own analysis of the data. How would you respond to each of 
Dr. Perelman’s claims? 


CHECK YOUR UNDERSTANDING 


Identify the explanatory and response variables in each setting. 


1. How does drinking beer affect the level of alcohol in people’s blood? The legal limit 
for driving in all states is 0.08%. In a study, adult volunteers drank different numbers of 
cans of beer. Thirty minutes later, a police officer measured their blood alcohol levels. 

2. The National Student Loan Survey provides data on the amount of debt for recent 
college graduates, their current income, and how stressed they feel about college debt. A 
sociologist looks at the data with the goal of using amount of debt and income to explain 
the stress caused by college debt. 
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144 CHAPTER 3 


You will often see explanatory variables 
called independent variables and 
response variables called dependent 
variables, Because the words 
“independent” and “dependent” have 
other meanings in statistics, we won't 
use them here. 


DESCRIBING RELATIONSHIPS 


It is easiest to identify explanatory and response variables when we actually 
specify values of one variable to see how it affects another variable. For instance, 
to study the effect of alcohol on body temperature, researchers gave several dif- 
ferent amounts of alcohol to mice. Then they measured the change in each 
mouse’s body temperature 15 minutes later. In this 
case, amount of alcohol is the explanatory variable, 
and change in body temperature is the response 
variable. When we don’t specify the values of either 
variable but just observe both variables, there may 
or may not be explanatory and response vari- 
ables. Whether there are depends on how 
you plan to use the data. a 


Linking SAT Math and Critical 


Reading Scores 


Explanatory or response? 


Julie asks, “Can I predict a state’s mean SAT Math score if I know its mean SAT 
Critical Reading score?” Jim wants to know how the mean SAT Math and Critical 
Reading scores this year in the 50 states are related to each other. 


PROBLEM: For each student, identify the explanatory variable and the response variable if possible. 


SOLUTION: Julie is treating the mean SAT Critical Reading score as the explanatory variable and 
the mean SAT Math score as the response variable. Jim is simply interested in exploring the relation- 
ship between the two variables. For him, there is no clear explanatory or response variable. 


For Practice Try Exercise 


Julio asks, "Can | predict a state's mean SAT Math score i Eknow ita 
mann SAT Critical Reading score?” 


pee diame B hclbene [tng ended rl pd taal at 
expinatory and the mean SAT Math score as the 
response varie 


1. Coral reefs How sensitive to changes in water 
temperature are coral reefs? To find out, measure 

the growth of corals in aquariums where the water 
temperature is controlled at different levels. Growth is 
measured by weighing the coral before and after the 
experiment. What are the explanatory and response 
variables? Are they categorical or quantitative? 


Jim wants to know how the mean SAT Math and Cntical Reading 
Scores this year in the 50 states are related to each ther. 


Gesell Scores 
Putting it all together 


Does the age at which a child begins to talk predict a later score on a test of mental 
ability? A study of the development of young children recorded the age in months 
at which each of 2] children spoke their first word and their Gesell Adaptive Score, 
the result of an aptitude test taken much later.'° The data appear in the table be- 
low, along with a scatterplot, residual plot, and computer output. Should we use a 
linear model to predict a child’s Gesell score from his or her age at first word? If so, 
how accurate will our predictions be? 


CHILD AGE SCORE CHILD AGE SCORE CHILD AGE SCORE 
1 15 95 8 an 100 15 11 102 
2 26 71 9 8 104 16 10 100 
3 10 83 10 20 94 17 12 105 
4 9 91 1 7 113 18 42 57 
5 15 102 12 9 96 19 17 121 
6 20 87 13 10 83 20 11 86 
7 18 93 14 1 84 21 10 100 
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Summary 


e — Aregression line isa straight line that describes how a response variable y changes 
as an explanatory variable x changes. You can use a regression line to predict the 
value of y for any value of x by substituting this x into the equation of the line. 


¢ The slope b of a regression line § = a + bx is the rate at which the predicted 
response changes along the line as the explanatory variable x changes. Spe- 
cifically, b is the predicted change in y when x increases by | unit. 


e The y intercept a of a regression line $ = a + bx is the predicted response ¥ 
when the explanatory variable x equals 0. This prediction is of no statistical 
use unless x can actually take values near 0. 


Exercises 


What’s my line? You use the same bar of soap to 
shower each morning. The bar weighs 80 grams when 
it is new. Its weight goes down by 6 grams per day on 
average. What is the equation of the regression line 
for predicting weight from days of use? 


What’s my line? An eccentric professor believes that 
a child with IQ 100 should have a reading test score 
of 50 and predicts that reading score should increase 
by 1 point for every additional point of IQ. What 

is the equation of the professor's regression line for 
predicting reading score from IQ? 


Gas mileage We expect a car’s highway gas mileage 
to be related to its city gas mileage. Data for all 1198 
vehicles in the government's recent Fuel Economy 
Guide give the regression line: predicted highway 
mpg = 4.62 + 1.109 (city mpg). 


hat's the slope of this line? Interpret this value in context. 


What's the y intercept? Explain why the value of the 
intercept is not statistically meaningful. 
Find the predicted highway mileage for a car that gets 
16 miles per gallon in the city. 


. IQ and reading scores Data on the IQ test scores 


and reading test scores for a group of fifth-grade 

children give the following regression line: predicted 
reading score = —33.4 + 0.882(IO sco @ 
What's the slope of this line? Interpret thi 
context. 
What's the y intercept? Explain why the 
intercept is not statistically meaningful. 


Find the predicted reading score for a cl 
1Q score of 90. 


Acid rain Researchers studying acid rai 
the acidity of precipitation in a Coloradc 
area for 150 consecutive weeks. Acidity i 
y phLawer pH values show higher ac 
researchers oBS linear pattern ove 
They reported that the regfeysian lige pl 
0.0053(weeks) fit the data well.!” 


Exercise: Chapter 3, Exercise #39 
(a) Identity the slope of the line and explain what it means in this 


Solution: pi = §.43€0.0053pvee 


. In my Chevrolet (2.2) The Chevrolet Malibu with 
a four-cylinder engine has a combined gas mileage of 
25 mpg. What percent of all vehicles have worse gas 
mileage than the Malibu? 


67. Beavers and beetles Do beavers benefit beetles? 


4 Researchers laid out 23 circular plots, each + meters 


in diameter, in an area where beavers were cutting 
down cottonwood trees. In each plot, they counted the 
number of stumps from trees cut by beavers and the 
number of clusters of beetle larvae. Ecologists think 
that the new sprouts from stumps are more tender than 
other cottonwood growth, so that beetles prefer them. 


in Joan’s midwestern home. The figure below shows 


the original scatterplot with the least-squares line 
added. The equation of the least-squares line is 
fy = 1425 — 19.87x. 


Gas consumed (cubic foot) 


BoE gs ee gs 


so 35) 4045S 
‘Temperature (degrees Fahrenheit) 


(a) Identify the slope of the line and explain what it 
means in this setting. 

(b) Identify the y intercept of the line. Explain why it’s 
risky to use this value as a prediction. 


(c) Use the regression line to predict the amount of 


natural gas Joan will use in a month with an average 


temperature of 30°F. 


41. Acid rain Refer to Exercise 39. Would it be appropri- 
ate to use the regression line to predict pH after 1000 


months? Justify your answer. 


) 


identification: The stope is — 0.0053. 


Interpretation: For every additional week during the study, 
the pH Is predicted to decrease by an average of — 0.0053, 


Section 3.1: Scatterplots and Correlation 
In this section, you learned how to explore the relationship 
between two quantitative variables. As with distributions of 
a single variable, the first step is always to make a grap! 


scatterplot is the appropriate type o 
sociations between two quantitative 


variables. To descril 


h.A 


graph to investigate as- 


ea 


scatterplot, be sure to discuss four characteristics: direction, 
form, strength, and outliers. The direction of an associa- 


tion might be positive, negative, 01 
an association can be linear or non! 


strong if it closely follows a specific form. Finally, out 
are any points that clearly fall outside the pattern of the 


of the data. 


r neither. The form of 
inear. An association is 


iers 
test 


The correlation risa numerical summary thatdescribes 


the direction and strength of a lin 
r > 0, the association is positive, 


ear association. WI 
and when r < 0, 


association is negative. ‘The corre 


hen 


the 


ation will always take 


possible to determine the form of an association from 
only the correlation. Strong nonlinear relationships can 
have a correlation close to 1 or a correlation close to 0, 
depending on the association. You also learned that out- 
liers can greatly affect the value of the correlation and 
that correlation does not imply causation. That is, we 
can’t assume that changes in one variable cause changes 
in the other variable, just because they have a correla- 
tion close to 1 or-1. 


Section 3.2: Least-Squares Regression 


In this section, you learned how to use least-squares re- 
gression lines as models for relationships between vari- 
ables that have a linear association. It is important to 
understand the difference between the actual data and 
the model used to describe the data. For example, when 
you are interpreting the slope of a least-squares regression 


values between —1 and 1, withr = —1 andr = 1 indic 
ing a perfectly linear relationship. Strong linear assoc 
tions have correlations near 1 or —1, while weak lin: 
relationships have correlations near 0). However, it is 


What Did You Learn? 


Chapter 3 Chapter Review Exercises 


These exercises are designed to help you review the important 


ideas and methods of the chapter. 


R3.1 Born to be old? Is there a relationship between the 
gestational period (time from conception to birth) 
of an animal and its average life span? The figure 
shows a scatterplot of the gestational period and av- 


erage life span for 43 species of 


animals,” 
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(a) Describe the association shown in the scatterplot. 
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Learning Objective Section Related Example Relevant Chapter 
on Page(s) Review Exercise(s) 

Identify explanatory and response variables in situations where one 

variable helps to explain or influences the other. 3.1 144 R3.4 

Make a scatterplot to display the relationship between two 

quantitative variables. 3.41 145, 148 R3.4 

Describe the direction, form, and strength of a relationship 

displayed in a scatterplot and recognize outliers in a scatterplot. 3.4 147, 148 R3.1 

Interpret the correlation. 3.1 62 R3.3, R3.4 

Understand the basic properties of correlation, including how the 

correlation is influenced by outliers. 3.1 152, 156, 157 R3.1, R3.2 

Use technology to calculate correlation. 3.1 Activity on 152, 171 R3.4 

Explain why association does not imply causation. 3.4 Discussion on 156, 190 R3.6 

Interpret the slope and y intercept of a least-squares regression line. 3.2 166 R3.2, R3.4 

Use the least-squares regression line to predict y for a given x. 167, Discussion on 168 

Explain the dangers of extrapolation. 3.2 (for extrapolation) R3.2, R3.4, R3.5 

Calculate and interpret residuals. 3.2 69 R3.3, R3.4 

Explain the concept of least squares. 2 Discussion on 169 R3.5 

Determine the equation of a least-squares regression line using Technology Corner on 

technology or computer output. 3.2 171, 181 R3.3, R3.4 

Construct and interpret residual plots to assess whether a linear 

model is appropriate. 3.2 Discussion on 175, 180 R3.3, R3.4 

Interpret the standard deviation of the residuals and r? and use 

these values to assess how well the least-squares regression line 

models the relationship between two variables. 3.2 80 R3.3, R3.5 
\. Discussion on 188 R3.1 
183 R3.5 
= 


R3.3 Stats teachers’ cars A random sample of AP® Sta- 
tistics teachers was asked to report the age (in years) 
and mileage of their primary vehicles. A scatterplot 
of the data, a least-squares regression printout, and 
a residual plot are provided below. 

Predictor coef SE Coef 7 P 
Constant 3704 8268 0.45 
Age 12188 1ag2 8.17 


S = 20870.5 R-Sq = 83.78 R-Sq(adj) = 82.4% 
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Section |: Multiple Choice Select the best answer for each question. 


13.1 A school guidance counselor examines the number alcoholic beverages for each of 11] regions in Great 

of extracurricular activities that students do and their Britain was recorded. A scatterplot of spending on 

grade point average. The guidance counselor says, alcohol versus spending on tobacco is shown below. 

“The evidence indicates that the correlation between Which of the following statements is true? 

the number of extracurricular activities a student par- 

ticipates in and his or her grade point average is close 

to zero.” A correct interpretation of this statement 6.0 4 
would be that 

(a) active students tend to be students with poor grades, 
and vice versa. 

(b) students with good grades tend to be students who 
are not involved in many extracurricular activities, 
and vice versa. 


(c) students involved in many extracurricular activities are 30 a5 40 us 
just as likely to get good grades as bad grades; the same is Tobacco 


true for students involved in few extracurricular activities. : B & i 
PMA cite er ee aL Sane eae AEN (a) The observation (4.5, 6.0) is an outlier. 


th) There is clear evidence of a negative association be- 
‘ending on alcohol and tobacco. 

iation of the least-squares line for this plot 
2 approximately f = 10 — 2x. 

‘elation for these data is r = 0.99. 

ervation in the lower-right corner of the plot is 
lal for the least-squares line. 


(d) there is no linear relationship between number of activ- 


Cumulative AP® Practice Test 1 


Section |: Multiple Choice Choose the best answer for Questions AP1.1 to AP1.14. 


AP1.1 You look at real estate ads for houses in Sarasota, 
Florida. Many houses range from $200,000 to 
$400,000 in price. The few houses on the water, 
however, have prices up to $15 million. Which of 
the following statements best describes the distribu- 
tion of home prices in Sarasota? 

(a) The distribution is most likely skewed to the left, 
and the mean is greater than the median. 


AP1.4 Fora certain experiment, the available experimen- 
tal units are eight rats, of which four are female 
(F1, F2, F3, F4) and four are male (M1, M2, M3, | 
M4). There are to be four treatment groups, A, B, 
C, and D. Ifa randomized block design is used, 
with the experimental units blocked by gender, 
which of the following assignments of treatments 
is impossible? 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded 
on the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


AP1.15 The manufacturer of exercise machines for fitness 
centers has designed two new elliptical machines 
that are meant to increase cardiovascular fit- 


the two machines. Note that higher scores indicate 
larger gains in fitness. 


ness. The two machines are being tested on 30 Machine Mecnne 8 
volunteers at a fitness center near the company’s 9 2 
headquarters. The volunteers are randomly as- 54 1 0 

signed to one of the machines and use it daily for 876320 2 159 

two months. A measure of cardiovascular fitness 97411 3 2489 

is administered at the start of the experiment and ey & Pee 


| FRAPPY! | Free Response AP*® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


and observed how many hours each flower continued to 
look fresh. A scatterplot of the data is shown below. 
240 
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(a) Briefly describe the association shown in the 
scatterplot. 

(b) ‘The equation of the least-squares regression line 
for these data is ¥ = 180.8 + 15.8x. Interpret the 
slope of the line in the context of the study. 


‘Iwo statistics students went to a flower shop and ran- 
domly selected 12 carnations. When they got home, the 
students prepared 12 identical vases with exactly the same 
amount of water in each vase. They put one tablespoon of 
sugar in 3 vases, two tablespoons of sugar in 3 vases, and 
three tablespoons of sugar in 3 vases. In the remaining 
3 vases, they put no sugar. After the vases were prepared, 
the students randomly assigned | carnation to each vase 


(c) Calculate and interpret the residual for the flower 
that had 2 tablespoons of sugar and looked fresh for 
204 hours. 

(d) Suppose that another group of students conducted 
a similar experiment using 12 flowers, but in- 
cluded different varieties in addition to carnations. 
Would you expect the value of r for the second 
group’s data to be greater than, less than, or about 
the same as the value of for the first group’s data? 
Explain. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score. 


XIX 


TECHNOLOGY coaTTERPLOTS ON THE CALCULATOR 


Tl-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site 


Making scatterplots with technology is much easier than constructing them by hand. We'll use the SEC football data 
from page 146 to show how to construct a scatterplot on a ‘TI-83/84 or TI-89. 


e Enter the data values into your lists. Put the points per game in LI /list] and the number of wins in L2/ist2. 


¢ Define a scatterplot in the statistics plot menu (press [F2] on the TI-89). Specify the settings shown below. 
TT ty 


¢ Use ZoomStat (Zoom Data on the TI-89) to obtain a graph. The calculator will set the window dimensions automatically 


by looking at the values in LAist] and L2/ist2. 
[etilettelreccanadreerfonferou ran 
o 


‘, i 8 


SOURS a 


Notice that there are no scales on the axes and that the axes are not labeled. If you copy a scatterplot from your calculator onto 
your paper, make sure that you scale and label the axes. 


AP® EXAM TIP If you are asked to make a scatterplot on a free-response question, be sure to 
label and scale both axes. Don’tjust copy an unlabeled calculator graph directly onto your paper. 


3.2) TECHNOLOGY 
CORNERS 


Tl-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site 


8. Least-squares regression lines on the calculator 
9. Residual plots on the calculator 


59. Merlins breeding Exercise 13 (page 160) gives data on 
pO] the number of breeding pairs of merlins in an isolated 
©) area in each of seven years and the percent of males who 
returned the next year. The data show that the percent 
returning is lower after successful breeding seasons and 
that the relationship is roughly linear. The figure below 
shows Minitab regression output for these data. 


Cm =Tere 


| Regression Analysis: Percent return versus Breeding pairs” 


(a) What is the equation of the least-squares regression 
line for predicting the percent of males that retum 
from the number of breeding pairs? Use the equa- 
tion to predict the percent of returning males after a 
season with 30 breeding pairs. 


(b) What percent of the year-to-year variation in percent 
of returning males is accounted for by the straight- 
line relationship with number of breeding pairs the 
previous year? 


XX 


gee 
Overview What ls Statistics? 


Does listening to music while studying help or hinder learning? If an athlete fails a drug test, how sure 
can we be that she took a banned substance? Does having a pet help people live longer? How well do SAT 
scores predict college success? Do most people recycle? Which of two diets will help obese children lose 
more weight and keep it off? Should a poker player go “all in” with pocket aces? Can a new drug help 
people quit smoking? How strong is the evidence for global warming? 

‘These are just a few of the questions that statistics can help answer. But what is statistics? And why 
should you study it? 


Data are usually numbers, but they are not “just numbers.” Data are numbers with a context. ‘The number 
10.5, for example, carries no information by itself. But if we hear that a family friend’s new baby weighed 10.5 
pounds at birth, we congratulate her on the healthy size of the child. The context engages our knowledge 
about the world and allows us to make judgments. We know that a 
baby weighing 10.5 pounds is quite large, and that a human baby is 
unlikely to weigh 10.5 ounces or 10.5 kilograms. ‘The context makes 
the number meaningful. 

In your lifetime, you will be bombarded with data and sta- 
tistical information. Poll results, television ratings, music sales, 
gas prices, unemployment rates, medical study outcomes, and 
standardized test scores are discussed daily in the media. Using 
data effectively is a large and growing part of most professions. A 
solid understanding of statistics will enable you to make sound, 
data-based decisions in your career and everyday life. 


It is tempting to base conclusions on your own experiences or the experiences of those you know. But our 
experiences may not be typical. In fact, the incidents that stick in our memory are often the unusual ones. 


Do cell phones cause brain cancer? 


Italian businessman Innocente Marcolini developed a brain tumor at age 60. He also talked on a cellular 
phone up to 6 hours per day for 12 years as part of his job. Mr. Marcolini’s physician suggested that the 
brain tumor may have been caused by cell-phone use. So Mr. Marcolini decided to file suit in 
the Italian court system. A court ruled in his favor in October 2012. 

Several statistical studies have investigated the link between cell-phone use and brain 
cancer. One of the largest was conducted by the Danish Cancer Society. Over 350,000 resi- 
dents of Denmark were included in the study. Researchers compared the brain-cancer rate for 
the cell-phone users with the rate in the general population. The result: no statistical differ- 
ence in brain-cancer rates.! In fact, most studies have produced similar conclusions. In spite 
of the evidence, many people (like Mr. Marcolini) are still convinced that cell phones can 
cause brain cancer. 

In the public’s mind, the compelling story wins every time. A statistically literate person 
knows better. Data are more reliable than personal experiences because they systematically 
describe an overall picture rather than focus on a few incidents. 


Xxi 


xxii 


Are you kidding me? 


The famous advice columnist Ann Landers once asked her readers, “If you had it to do over 
again, would you have children?” A few weeks later, her column was headlined “70% OF 
PARENT'S SAY KIDS NOT WORTH IT.” Indeed, 70% of the nearly 10,000 parents who 
wrote in said they would not have children if they could make the choice again. Do you 
believe that 70% of all parents regret having children? 

You shouldn't. The people who took the trouble to write Ann Landers are not represen- 
tative of all parents. Their letters showed that many of them were angry with their children. 
All we know from these data is that there are some unhappy parents out there. A statistically 
designed poll, unlike Ann Landers’s appeal, targets specific people chosen in a way that gives 
all parents the same chance to be asked. Such a poll showed that 91% of parents would have 
children again. 

Where data come from matters a lot. If you are careless about how you get your data, you 
may announce 70% “No” when the truth is close to 90% “Yes.” 


Who talks more—women or men? 


According to Louann Brizendine, author of The Female Brain, women say nearly three times as many 
words per day as men. Skeptical researchers devised a study to test this claim. They used electronic devices 
to record the talking patterns of 396 university students from ‘Texas, Arizona, and Mexico. The device was 
programmed to record 30 seconds of sound every 12.5 minutes without the carrier’s knowledge. What 
were the results? 

According to a published report of the study in Scientific American, “Men showed a slightly wider 
variability in words uttered. . . . But in the end, the sexes came out just about even in the daily averages: 
women at 16,215 words and men at 15,669.”* When asked where she got her figures, Brizendine admitted 
that she used unreliable sources.’ 

The most important information about any statistical study is how the data were produced. Only 
carefully designed studies produce results that can be trusted. 


Yogi Berra, a famous New York Yankees baseball player known for his unusual quotes, had this to say: 
“You can observe a lot just by watching.” ‘That’s a motto for learning from data. A carefully chosen graph 
is often more instructive than a bunch of numbers. 


Do people live longer in wealthier countries? 


‘The Gapminder Web site, www.gapminder.org, provides loads of data on the health and well-being of the 
world’s inhabitants. The graph on the next pages displays some data from Gapminder.* The individual 
points represent all the world’s nations for which data are available. Each point shows the income per 
person and life expectancy in years for one country. 

We expect people in richer countries to live longer. The overall pattern of the graph does show 
this, but the relationship has an interesting shape. Life expectancy rises very quickly as personal income 
increases and then levels off. People in very rich countries like the United States live no longer than 
people in poorer but not extremely poor nations. In some less wealthy countries, people live longer 
than in the United States. Several other nations stand out in the graph. What’s special about each of 
these countries? 
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Income per person in 2012 


Individuals vary. Repeated measurements on the same individual vary. Chance outcomes—like spins of 
a roulette wheel or tosses of a coin—vary. Almost everything varies over time. Statistics provides tools for 
understanding variation. 


Have most students cheated on a test? 


Researchers from the Josephson Institute were determined to find out. So they surveyed about 23,000 
students from 100 randomly selected schools (both public and private) nationwide. The question they 
asked was “How many times have you cheated during a test at school in the past year?” Fifty-one percent 
said they had cheated at least once.’ 

If the researchers had asked the same question of all high school students, would exactly 51% have 
answered “Yes”? Probably not. If the Josephson Institute had selected a different sample of about 23,000 
students to respond to the survey, they would probably have gotten a different estimate. Variation is every- 
where! 

Fortunately, statistics provides a description of how the sample results will vary in relation to the ac- 
tual population percent. Based on the sampling method that this study used, we can say that the estimate 
of 51% is very likely to be within 1% of the true population value. That is, we can be quite confident that 
between 50% and 52% of all high school students would say that they have cheated on a test. 

Because variation is everywhere, conclusions are uncertain. Statistics gives us a language for talk- 
ing about uncertainty that is understood by statistically literate people everywhere. 


Graph of the life 
expectancy of people 

in many nations against 
each nation’s income 
per person in 2012. 
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Chapter 1 AP® Statistics 
Practice Test 


Do Pets or Friends Help Reduce Stress? 


If you are a dog lover, having your dog with you may reduce your stress level. Does having a friend with 
you reduce stress? To examine the effect of pets and friends in stressful situations, researchers recruited 45 
women who said they were dog lovers. Fifteen women were assigned at random to each of three groups: 
to do a stressful task alone, with a good friend present, or with their dogs present. The stressful task was 
to count backward by 13s or 17s. The woman’s average heart rate during the task was one measure of the 
effect of stress. The table below shows the data.! 


RATE GROUP RATE GROUP RATE GROUP RATE 

Fy 69.169 P 68.862 C 84.738 C 75.477 
F 99.692 C 87.231 C 84.877 C 62.646 
iP 70.169 P 64.169 Fi 58.692 P 70.077 
C 80.369 C 91.754 P 79.662 le 88.015 
C 87.446 C 87.785 P 69.231 F 81.600 
P 75.985 F 91.354 C 73.277 F 86.985 
F 83.400 F 100.877 C 84.523 F 92.492 
F 102.154 C 77.800 C 70.877 P 72.262 
P 86.446 P 97.538 F 89.815 P 65.446 
F 80.277 P 85.000 R 98.200 

C 90.015 F 101.062 F 76.908 

C 99.046 F 97.046 P 69.538 


2 CHAPTER 1 EXPLORING DATA 


tinrineoum Lata Analysis: Making Sense 


WHAT YOU WILL LEARN 


of Data 


By the end of the section, you should be able to: 


e — |dentify the individuals and variables in a set of data. e Classify variables as categorical or quantitative. 


Statistics is the science of data. The volume of data available to us is overwhelm- 
ing. For example, the Census Bureau’s American Community Survey collects 
data from 3,000,000 housing units each year. Astronomers work with data on tens 
of millions of galaxies. The checkout scanners at Walmart’s 10,000 stores in 27 
countries record hundreds of millions of transactions every week. 

In all these cases, the data are trying to tell us a story—about U.S. households, 
objects in space, or Walmart shoppers. To hear what the data are saying, we need 
to help them speak by organizing, displaying, summarizing, and asking questions. 
That’s data analysis. 


Individuals and Variables 


Any set of data contains information about some group of individuals. The char- 
acteristics we measure on each individual are called variables. 


DEFINITION: Individuals and variables 


Individuals are the objects described by a set of data. Individuals may be people, 
animals, or things. 


A variable is any characteristic of an individual. A variable can take different values 
for different individuals. 


A high school’s student data base, for example, includes data about every cur- 
rently enrolled student. The students are the individuals described by the data 
set. For each individual, the data contain the values of variables such as age, 
gender, grade point average, homeroom, and grade level. In practice, any set of 
data is accompanied by background information that helps us understand the 
data. When you first meet a new data set, ask yourself the following questions: 


1. Whoare the individuals described by the data? How many individuals are there? 


2. What are the variables? In what units are the variables recorded? Weights, for ex- 
ample, might be recorded in grams, pounds, thousands of pounds, or kilograms. 


We could follow a newspaper reporter’s lead and extend our list of questions 
to include Why, When, Where, and How were the data produced? For now, we'll 
focus on the first two questions. 

Some variables, like gender and grade level, assign labels to individuals that place 
them into categories. Others, like age and grade point average (GPA), take numeri- 
cal values for which we can do arithmetic. It makes sense to give an average GPA for 
a group of students, but it doesn’t make sense to give an “average” gender. 
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DEFINITION: Categorical variable and quantitative variable 
A categorical variable places an individual into one of several groups or categories. 


A quantitative variable takes numerical values for which it makes sense to find an 
average. 


AP® EXAM TIP If you learn Not every variable that takes number values is quantitative. Zip code is 
one example. Although zip codes are numbers, it doesn’t make sense to 
talk about the average zip code. In fact, zip codes place individuals (peo- 
ple or dwellings) into categories based on location. Some variables—such as gen- 
der, race, and occupation—are categorical by nature. Other categorical variables 
are created by grouping values of a quantitative variable into classes. For instance, 
we could classify people in a data set by age: 0-9, 10-19, 20-29, and so on. 

The proper method of analysis for a variable depends on whether it is categori- 
cal or quantitative. As a result, it is important to be able to distinguish these two 
types of variables. The type of data determines what kinds of graphs and which 


numerical summaries are appropriate. 


to distinguish categorical from 
quantitative variables now, it 
will pay big rewards later. You 


will be expected to analyze 
categorical and quantitative 
variables correctly on the AP® 
exam. 


Census at School 
Data, individuals, and variables 


CensusAtSchool is an international project that collects data about primary and 
secondary school students using surveys. Hundreds of thousands of students from 
Australia, Canada, New Zealand, South Africa, and the United Kingdom have 
taken part in the project since 2000. Data from the surveys are available at the 
project’s Web site (www.censusatschool.com). We used the site’s “Random Data 
Selector” to choose 10 Canadian students who completed the survey in a recent 
year. The table below displays the data. 


CensusAtSchool 
= International 
(A) 
[Fe=-=] 
Language Height Wrist Preferred 
Province Gender spoken Handed (cm) circum. (mm) communication 
Saskatchewan Male 1 Right 175 180 In person 
Ontario Female 1 Right 162.5 160 In person 
Alberta Male 1 Right 178 174 Facebook 
Ontario Male 2 Right 169 160 Cell phone 
Ontario Female 2 Right 166 65 In person 
; * Nunavut Male 1 Right 168.5 160 Text messaging 
There is at least one suspicious . : 
value in the data table. We doubt Ontario Female 1 Right 166 165 Cell phone 
has a wrist circumference of 65mm = ntarig Female 2 Right 150.5 187 Text Messaging 


(about 2.6 inches). Always look to be ; : F 
sure the values make sense! Ontario Female 1 Right 171 180 Text Messaging 
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PROBLEM: 
(a) Who are the individuals in this data set? 


(b) What variables were measured? Identify each as categorical or quantitative. 
(c) Describe the individual in the highlighted row. 


We'll see in Chapter 4 why choosing SOLUTION: 
at random, as we did in this 


(a) The individuals are the 10 randomly selected Canadian students who participated in the 
example, is a good idea. 


CensusAtSchool survey. 


(b) The seven variables measured are the province where the student lives (categorical), gender 
(categorical), number of languages spoken (quantitative), dominant hand (categorical), height (quan- 
titative), wrist circumference (quantitative), and preferred communication method (categorical). 
(c) This student lives in Ontario, is male, speaks four languages, is left-handed, is 157.5 cm tall 
(about 62 inches), has a wrist circumference of 14:7 mm (about 5.8 inches), and prefers to com- 
municate via text messaging. 


For Practice Try Exercise 


Most data tables follow the format shown in the example—each row is an indi- 
vidual, and each column is a variable. Sometimes the individuals are called cases. 

To make life simpler, we sometimes A variable generally takes values that vary (hence the name “variable”!). 

refer to “categorical data” or Categorical variables sometimes have similar counts in each category and some- 

“Quantitative data” instead of times don’t. For instance, we might have expected similar numbers of males 

identifying the variable as categorical ; > : 

o# Guuarfitative: and females in the CensusAtSchool data set. But we aren’t surprised to see that 
most students are right-handed. Quantitative variables may take values that are 
very close together or values that are quite spread out. We call the pattern of vari- 
ation of a variable its distribution. 


TT 


DEFINITION: Distribution 


The distribution of a variable tells us what values the variable takes and how often 
it takes these values. 


Section 1.1 begins by looking at how to describe the distribution of a single cat- 
egorical variable and then examines relationships between categorical variables. 
Sections 1.2 and 1.3 and all of Chapter 2 focus on describing the distribution of 
a quantitative variable. Chapter 3 investigates relationships between two quantita- 
tive variables. In each case, we begin with graphical displays, then add numerical 
summaries for a more complete description. 


HOW TO EXPLORE DATA 


e Begin by examining each variable by itself. Then move on to study rela- 
tionships among the variables. 


e Start with a graph or graphs. Then add numerical summaries. 


CHECK YOUR UNDERSTANDING 


Jake is a car buff who wants to find out more about the vehicles that students at his 
school drive. He gets permission to go to the student parking lot and record some data. 
Later, he does some research about each model of car on the Internet. Finally, Jake 


ACTIVITY 


MATERIALS: 


Bag with 25 beads (15 of 
one color and 10 of another) 
or 25 identical slips of paper 
(15 labeled “M” and 10 
labeled “F”) for each student 
or pair of students 
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makes a spreadsheet that includes each car’s model, year, color, number of cylinders, 
gas mileage, weight, and whether it has a navigation system. 


1. Who are the individuals in Jake’s study? 


2. What variables did Jake measure? Identify each as categorical or quantitative. 


From Data Analysis to Inference 


Sometimes, we're interested in drawing conclusions that go beyond the data at 
hand. That’s the idea of inference. In the CensusAtSchool example, 9 of the 10 
randomly selected Canadian students are right-handed. That’s 90% of the sample. 
Can we conclude that 90% of the population of Canadian students who partici- 
pated in CensusAtSchool are right-handed? No. 

If another random sample of 10 students was selected, the percent who are 
right-handed might not be exactly 90%. Can we at least say that the actual popula- 
tion value is “close” to 90%? That depends on what we mean by “close.” 

The following Activity gives you an idea of how statistical inference works. 


Hiring discrimination—it just won’t fly! 


An airline has just finished training 25 pilots— 15 male and 10 female —to become 
captains. Unfortunately, only eight captain positions are available right now. Air- 
line managers announce that they will use a lottery to determine which pilots will 
fill the available positions. The names of all 25 pilots will be written on identical 
slips of paper. The slips will be placed in a hat, mixed thoroughly, and drawn out 
one at a time until all eight captains have been identified. 

A day later, managers announce the results of the lottery. Of the 8 captains 
chosen, 5 are female and 3 are male. Some of the male pilots who weren't selected 
suspect that the lottery was not carried out fairly. One of these pilots asks your 
statistics class for advice about whether to file a grievance with the pilots’ union. 

The key question in this possible discrimination case seems to 
be: Is it plausible (believable) that these results happened just by 
chance? To find out, you and your classmates will simulate the 
lottery process that airline managers said they used. 


1. Mix the beads/slips thoroughly. Without looking, remove 
8 beads/slips from the bag. Count the number of female pilots 
selected. Then return the beads/slips to the bag. 
2. Your teacher will draw and label a number line for a class dot- 
plot. On the graph, plot the number of females you got in Step 1. 
3. Repeat Steps | and 2 if needed to get a total of at least 40 
simulated lottery results for your class. 
4. Discuss the results with your classmates. Does it seem believable that airline 
managers carried out a fair lottery? What advice would you give the male pilot 
who contacted you? 
5. Would your advice change if the lottery had chosen 6 female (and 2 male) 
pilots? What about 7 female pilots? Explain. 
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CHAPTER 1 EXPLORING DATA 


Our ability to do inference is determined by how the data are produced. 
Chapter 4 discusses the two main methods of data production—sampling and 
experiments—and the types of conclusions that can be drawn from each. As the 
Activity illustrates, the logic of inference rests on asking, “What are the chances?” 
Probability, the study of chance behavior, is the topic of Chapters 5 through 7. 
We'll introduce the most common inference techniques in Chapters 8 through 12. 


Summary 


e A data set contains information about a number of individuals. Individuals 
may be people, animals, or things. For each individual, the data give values 
for one or more variables. A variable describes some characteristic of an in- 
dividual, such as a person’s height, gender, or salary. 


e¢ Some variables are categorical and others are quantitative. A categorical vari- 
able assigns a label that places each individual into one of several groups, such as 
male or female. A quantitative variable has numerical values that measure some 
characteristic of each individual, such as height in centimeters or salary in dollars. 


e The distribution of a variable describes what values the variable takes and 
how often it takes them. 


Protecting wood How can we help wood surfaces 
resist weathering, especially when restoring historic 
wooden buildings? In a study of this question, re- 
searchers prepared wooden panels and then exposed 
them to the weather. Here are some of the variables 
recorded: type of wood (yellow poplar, pine, cedar); 
type of water repellent (solvent-based, water-based); 
paint thickness (millimeters); paint color (white, gray, 
light blue); weathering time (months). Identify each 
variable as categorical or quantitative. 


Medical study variables Data from a medical study 
contain values of many variables for each of the people 
who were the subjects of the study. Here are some of 
the variables recorded: gender (female or male); age 
(years); race (Asian, black, white, or other); smoker 
(yes or no); systolic blood pressure (millimeters of mer- 
cury); level of calcium in the blood (micrograms per 
milliliter). Identify each as categorical or quantitative. 
A class survey Here is a small part of the data set that 
describes the students in an AP® Statistics class. The 
data come from anonymous responses to a question- 
naire filled out on the first day of class. 


Exercises 


The solutions to all exercises numbered in red are 
found in the Solutions Appendix, starting on page S-1. 


Pocket 
Height Homework Favorite change 
Gender Hand (in.) time (min) music (cents) 
F L 65 200 Hip-hop 50 
M L 72 30 Country ob) 
M R 62 95 Rock 35 
IF L 64 120 Alternative 0 
M R 63 220 Hip-hop 0 
FR 5B 6D ternative 78 
F R 67 150 Rock 215 
(a) What individuals does this data set describe? 


(b) 


(c) 
4. 


What variables were measured? Identify each as cat- 
egorical or quantitative. 


Describe the individual in the highlighted row. 


Coaster craze Many people like to ride roller coast- 
ers. Amusement parks try to increase attendance by 
building exciting new coasters. ‘The following table 
displays data on several roller coasters that were 
opened in a recent year.” 
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Height Speed Duration Travel Income 
Roller coaster Type (ft) Design (mph) (s) Weight Age to work last 
WildMouse Steel 49.3 Sitdown ~—-28 70 (Ib) (ye) (min) School Gender year ($) 
Terminator Wood 95 Sitdown 50.1 180 ie 66 B Minie etd: eet 
Manta Steel 140 Flying 56 155 ie Pee are solic alae : 
Diamondback Steel 230 Sitdown —80 180 339 a a ] 6000 
91 Dif 10 Some college 2 30,000 
(a) What individuals does this data set describe? 155 18 na High school grad 2 0 
(b) What variables were measured? Identify each as cat- 913 38 15 Master’s degree 2 125,000 
egorical or quantitative. 194 40 0 Highschool grad 1 800 
(c) Describe the individual in the highlighted TOW. 994 18 20 High school grad { 2500 
5. Ranking colleges Popular magazines rank colleges 193 11 n/a Fifth grade 1 0 
and universities on their “academic quality” in serv- 
ing undergraduate students. Describe two categorical 7. ‘The individuals in this data set are 
variables and two quantitative variables that you (ay honceholds 
might record for each institution. 
(b) people. 
6. Students and TV You are preparing to study the (adults 
television-viewing habits of high school students. De- 2 
scribe two categorical variables and two quantitative (d) 120 variables. 
variables that you might record for each student. (e) columns. 


Multiple choice: Select the best answer. 

Exercises 7 and 8 refer to the following setting. At the 

Census Bureau Web site www.census.gov, you can view ( 

detailed data collected by the American Community ( 

Survey. The following table includes data for 10 people ( 
( 
( 


8. ‘This data set contains 
a) 7 variables, 2 of which are categorical. 
b) 7 variables, 1 of which is categorical. 


ey c) 6 variables, 2 of which are categorical. 
chosen at random from the more than | million people 
in households contacted by the survey. “School” gives the 


highest level of education completed. 


d) 6 variables, | of which is categorical. 
e) None of these. 


Analyzing Categorical Data 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e Display categorical data with a bar graph. Decide if it e Calculate and display the conditional distribution of a 
would be appropriate to make a pie chart. categorical variable for a particular value of the other 


Identify what makes some graphs of categorical data categorical variable in a two-way table. 
deceptive. Describe the association between two categori- 


Calculate and display the marginal distribution of a cal variables by comparing appropriate conditional 
categorical variable from a two-way table. distributions. 


The values of a categorical variable are labels for the categories, such as “male” 
and “female.” The distribution of a categorical variable lists the categories and 
gives either the count or the percent of individuals who fall within each category. 
Here’s an example. 
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Radio Station Formats 
Distribution of a categorical variable 


The radio audience rating service Arbitron places U.S radio stations into categories 
that describe the kinds of programs they broadcast. Here are two different tables 
showing the distribution of station formats in a recent year:’ 


Format Count of stations Format Percent of stations 
Adult contemporary 1556 Adult contemporary 11.2 
Adult standards 1196 Adult standards 8.6 
Contemporary hit 569 Contemporary hit 4.1 
Country 2066 Country 14.9 
News/Talk/Information 2179 News/Talk/Information 15.7 
Oldies 1060 Oldies 7.7 
Religious 2014 Religious 14.6 
Rock 869 Rock 6.3 
Spanish language 750 Spanish language 5.4 
Other formats Other formats 

Total Total 


In this case, the individuals are the radio stations and the variable being measured 
is the kind of programming that each station broadcasts. The table on the left, 
which we call a frequency table, displays the counts (frequencies) of stations in 
each format category. On the right, we see a relative frequency table of the data 
that shows the percents (relative frequencies) of stations in each format category. 


It’s a good idea to check data for consistency. The counts should add to 13,838, 
the total number of stations. They do. The percents should add to 100%. In 
fact, they add to 99.9%. What happened? Each percent is rounded to the near- 
est tenth. The exact percents would add to 100, but the rounded percents only 
come close. This is roundoff error. Roundoff errors don’t point to mistakes in 
our work, just to the effect of rounding off results. 


Bar Graphs and Pie Charts 


Columns of numbers take time to read. You can use a pie chart or a bar graph to 
display the distribution of a categorical variable more vividly. Figure 1.1 illustrates 
both displays for the distribution of radio stations by format. 

Pie charts show the distribution of a categorical variable as a “pie” whose slices 
are sized by the counts or percents for the categories. A pie chart must include 
all the categories that make up a whole. In the radio station example, we needed 
the “Other formats” category to complete the whole (all radio stations) and allow 
us to make a pie chart. Use a pie chart only when you want to emphasize each 


?) 
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This bar has height 14.9% 
because 14.9% of the 
radio stations have a 
“Country” format. 


This slice occupies 14.9% of the pie 
because 14.9% of the radio stations 
have a “Country” format. 


Contemporary hit 


Percent of stations 
loo} 
| 


(b) Radio station format 
FIGURE 1.1 (a) Pie chart and (b) bar graph of U.S. radio stations by format. 


category’s relation to the whole. Pie charts are awkward to make by hand, but 
technology will do the job for you. 


I DIDN‘T HAVE 
ANYTHING USEFUL 
TO SAY SO I MADE 

THIS PIE CHART. 


I PLEDGE 
MY LIFE 
AND MY 
FORTUNE 


TO THE PIE! 


3-709 + ©2009Scott Adams, Inc./Dist. by UFS, Inc. 


www.dilbert.com — scottadams@aol.com 


Bar graphs are also called bar charts. Bar graphs represent each category as a bar. The bar heights show the category 
counts or percents. Bar graphs are easier to make than pie charts and are also 
easier to read. To convince yourself, try to use the pie chart in Figure 1.1 to esti- 
mate the percent of radio stations that have an “Oldies” format. Now look at the 
bar graph —it’s easy to see that the answer is about 8%. 

Bar graphs are also more flexible than pie charts. Both graphs can display the 
distribution of a categorical variable, but a bar graph can also compare any set of 
quantities that are measured in the same units. 


Who Owns an MP3 Player? 


Choosing the best graph to display the data 


Portable MP3 music players, such as the Apple iPod, are popular—but not equally 
popular with people of all ages. Here are the percents of people in various age 
groups who own a portable MP3 player, according to an Arbitron survey of 1112 
randomly selected people.* 
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Age group (years) Percent owning an MP3 player 


12 to 17 54 
18 to 24 30 
25 to 34 30 
35 to 54 13 
55 and older 5 
PROBLEM: 
_ (a) Make a well-labeled bar graph to display the data. Describe what you see. 
s so (b) Would it be appropriate to make a pie chart for these data? Explain. 
g 40 SOLUTION: 
= 30 (a) We start by labeling the axes: age group goes on the horizontal axis, and percent 
2 who own an MP3 player goes on the vertical axis. For the vertical scale, which is 
ae measured in percents, we'll start at O and go up to 60, with tick marks for every 10. 
ue Then for each age category, we draw a bar with height corresponding to the percent of 
survey respondents who said they have an MP3 player. Figure 1.2 shows the com- 
ee pleted bar graph. It appears that MP3 players are more popular among young people 
ica and that their popularity generally decreases as the age category increases. 
FIGURE 1.2 Bar graph comparing the per- (b) Making a pie chart to display these data is not appropriate because each percent 
cents of several age groups who own portable —in the table refers toa different age group, not to parts of a single whole. 
MP3 players. 


For Practice Try Exercise 


Graphs: Good and Bad 


Bar graphs compare several quantities by comparing the heights of bars that rep- 
resent the quantities. Our eyes, however, react to the area of the bars as well as to 
their height. When all bars have the same width, the area (width X height) varies 
in proportion to the height, and our eyes receive the right impression. When you 
draw a bar graph, make the bars equally wide. 

Artistically speaking, bar graphs are a bit dull. It is tempting to replace the bars 
with pictures for greater eye appeal. Don’t do it! The following example shows why. 


Who Buys iMacs? 


Beware the pictograph! 


When Apple, Inc., introduced the iMac, the company wanted to know whether 
this new computer was expanding Apple’s market share. Was the iMac mainly be- 
ing bought by previous Macintosh owners, or was it being purchased by first-time 
computer buyers and by previous PC users who were switching over? To find out, 
Apple hired a firm to conduct a survey of 500 iMac customers. Each customer was 
categorized as a new computer purchaser, a previous PC owner, or a previous Mac- 
intosh owner. The table summarizes the survey results.’ 
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400 7 


Previous ownership Count Percent (%) 
None 85 17.0 

al PC 60 12.0 

2 Macintosh 355 71.0 

S 200 4 

5 Total 500 100.0 

2 

; ~ 

Z 100- PROBLEM: 

(i) o , = = (a) Here’s aclever graph of the data that uses pictures instead of the more 
= = traditional bars. How is this graph misleading? 


T 
None Windows Macintosh 


(b) Two possible bar graphs of the data are shown below. Which one could be 
considered deceptive? Why? 


Previous computer 


80 7 80 - 
60 4 60 — 
50 4 
& ge 50- 
v vo 
2 40-4 Z 
< a 40+ 
205 
30 7 
20 - 
: None PC Macintosh None PC Macintosh 
Previous computer Previous computer 


SOLUTION: 

(a) Although the heights of the pictures are accurate, our eyes respond to the area of the pictures. The 
pictograph makes it seem like the percent of iMac buyers who are former Mac owners is at least ten times 
higher than either of the other two categories, which isn't the case. 

(b) The bar graph on the right is misleading. By starting the vertical scale at 10 instead of O, it looks 
like the percent of iMac buyers who previously owned a PCis less than half the percent who are first-time 
computer buyers. We get a distorted impression of the relative percents in the three categories. 


For Practice Try Exercise 


There are two important lessons to be learned from this example: 
(1) beware the pictograph, and (2) watch those scales. 


Two-Way Tables and Marginal Distributions 


We have learned some techniques for analyzing the distribution of a single cate- 
gorical variable. What do we do when a data set involves two categorical variables? 
We begin by examining the counts or percents in various categories for one of the 
variables. Here’s an example to show what we mean. 
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I’m Gonna Be Rich! 


Two-way tables 


A survey of 4826 randomly selected young adults (aged 19 to 25) asked, “What do 
you think the chances are you will have much more than a middle-class income 
at age 30?” The table below shows the responses.° 


Gender 
Opinion Female Male Total 
Almost no chance 96 98 194 
Some chance but probably not 426 286 712 
A 50-50 chance 696 720 1416 
A good chance 663 758 1421 
Almost certain 486 597 1083 
Total 2367 2459 4826 


This is a two-way table because it describes two categorical variables, gender and 
opinion about becoming rich. Opinion is the row variable because each row in the 
table describes young adults who held one of the five opinions about their chances. 
Because the opinions have a natural order from “Almost no chance” to “Almost 
certain,” the rows are also in this order. Gender is the column variable. ‘Vhe entries 
in the table are the counts of individuals in each opinion-by-gender class. 


How can we best grasp the information contained in the two-way table above? 
First, look at the distribution of each variable separately. The distribution of a cat- 
egorical variable says how often each outcome occurred. The “Total” column 
at the right of the table contains the totals for each of the rows. These row totals 
give the distribution of opinions about becoming rich in the entire group of 4826 
young adults: 194 thought that they had almost no chance, 712 thought they had 
just some chance, and so on. (If the row and column totals are missing, the first 
thing to do in studying a two-way table is to calculate them.) The distributions of 
opinion alone and gender alone are called marginal distributions because they 
appear at the right and bottom margins of the two-way table. 


DEFINITION: Marginal distribution 


The marginal distribution of one of the categorical variables in a two-way table of 
counts is the distribution of values of that variable among all individuals described by 
the table. 
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Percents are often more informative than counts, especially when we are com- 
paring groups of different sizes. We can display the marginal distribution of opin- 
ions in percents by dividing each row total by the table total and converting to a 
percent. For instance, the percent of these young adults who think they are almost 
certain to be rich by age 30 is 


almost certain total 1083 
table total 4826 


0.224 = 22.4% 


I’m Gonna Be Rich! 


Examining a marginal distribution 


PROBLEM: 

(a) Use the data in the two-way table to calculate the marginal distribution (in percents) of 
opinions. 

(b) Makea graph to display the marginal distribution. Describe what you see. 
SOLUTION: 


(a) Wecando four more calculations like the one shown above to obtain the marginal distribution of 
opinions in percents. Here is the complete distribution. 


Response Percent 


Chance of being wealthy by age 30 Almost no chance 194 = 4.0% 
4826 


Some chance M12 = 14.8% 


4826 


A50-50 chance 1416 _.9 38 
ta05 = 223% 


A good chance Ca 29.49 
4s26 


Almost certain 1083 _ 29.4% 
Almost Some s0-s0 Good Almost 4826 
none chance chance chance certain 


Survey response 


(b) Figure 1.3 is a bar graph of the distribution of opinion among these young adults. 
FIGURE 1.3 Bar graph showing the marginal It seems that many young adults are optimistic about their future income. Over 50% 
distribution of opinion about chance of being of those who responded to the survey felt that they had “a good chance” or were 

rich by age 30. “almost certain” to be rich by age 30. 


For Practice Try Exercise 


Each marginal distribution from a two-way table is a distribution for a single 
categorical variable. As we saw earlier, we can use a bar graph or a pie chart to 
display such a distribution. 
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CHECK YOUR UNDERSTANDING 
A random sample of 415 children aged 9 to 17 from the United Kingdom and the United 
States who completed a CensusAtSchool survey in a recent year was selected. Each student's 


Country country of origin was recorded along with which superpower they would most like to have: 
Superpower UK. U.S. the ability to fly, ability to freeze time, invisibility, superstrength, or telepathy (ability to read 
Fly 54 45 minds). The data are summarized in the table.’ 
Freeze time 52 44 1. Use the two-way table to calculate the marginal distribution (in percents) of 
Invisibility 30 37 superpower preferences. 
Superstrength 20 93 2. Make a graph to display the marginal distribution. Describe what you see. 
Telepathy 44 66 


Relationships between Categorical 
Variables: Conditional Distributions 


The two-way table contains much more information than the two marginal distri- 
butions of opinion alone and gender alone. Marginal distributions tell us nothing 
about the relationship between two variables. To describe a relationship between 
two categorical variables, we must calculate some well-chosen percents from the 
counts given in the body of the table. 


Gender 
Opinion Female Male Total 
Almost no chance 96 98 194 
Some chance but probably not 426 286 712 
A 50-50 chance 696 720 1416 
A good chance 663 758 1421 
Almost certain 486 597 1083 
Total 2367 2459 4826 


We can study the opinions of women alone by looking only at the 


Response Percent “Female” column in the two-way table. To find the percent of young 
96 women who think they are almost certain to be rich by age 30, divide 
Almost no chance 2367 1% the count of such women by the total number of women, the column 
total: 
Some chance Oey 18.0% . 
2367 women who are almost certain 486 0.205 = 20.5% 
50-50 chance ~ — 99.4% column total 2367 
hes Doing this for all five entries in the “Female” column gives the con- 
A good chance 2367 7 28.0% ditional distribution of opinion among women. See the table in 
the margin. We use the term “conditional” because this distribution 
Almost certain ABB 50.5% describes only young adults who satisfy the condition that they are 
2367 female. 
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DEFINITION: Conditional distribution 


A conditional distribution of a variable describes the values of that variable among 
individuals who have a specific value of another variable. There is a separate condi- 
tional distribution for each value of the other variable. 


Now let’s examine the men’s opinions. 


I’m Gonna Be Rich! 


Calculating a conditional distribution 

PROBLEM: Calculate the conditional distribution of opinion among the young men. 

SOLUTION: To find the percent of young menwho think they are almost certain to be rich by age 

30, divide the count of such men by the total number of men, the column total: 
menwhoarealmostcertain 597 


= = 24.3% 
column total 2459 


If we do this for all five entries in the “Male” column, we get the conditional distribution shown in the 


table. 


Response Percent 
98 
2459 


286 
2459 


Almost no chance = 4.0% 


Some chance = 11.6% 


A 50-50 chance 720 = 9 
9459 29.3% 

A good chance 758 = 9 
3459 30.8% 


Almost certain 597 = 24.3% 


2459 


For Practice Try Exercise 


There are two sets of conditional distributions for any two-way table: one for the 
column variable and one for the row variable. So far, we have looked at the condi- 
tional distributions of opinion for the two genders. We could also examine the five 
conditional distributions of gender, one for each of the five opinions, by looking 
separately at the rows in the original two-way table. For instance, the conditional 
distribution of gender among those who responded “Almost certain” is 

Female Male 
486 597 


7083. 8?” 1083 7” 
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0- That is, of the young adults who said they were almost cer- 
tain to be rich by age 30, 44.9% were female and 55.1% were 


604 
male. 


7 Because the variable “gender” has only two categories, com- 

satel paring the five conditional distributions amounts to comparing 

30 4 the percents of women among young adults who hold each 

20 4 opinion. Figure 1.4 makes this comparison in a bar graph. The 

ma bar heights do not add to 100%, because each bar represents a 
different group of people. 


Almost Some A 50-50 A good Almost 
none chance chance chance certain 


Percent of women 
in the opinion group 


Opinion 
FIGURE 1.4 Bar graph comparing the percents of 
females among those who hold each opinion about 
their chance of being rich by age 30. 


Which conditional distributions should we compare? Our goal 
THINK all along has been to analyze the relationship between gender and opinion about 
ABOUT IT chances of becoming rich for these young adults. We started by examining the 
conditional distributions of opinion for males and females. Then we looked at the 
conditional distributions of gender for each of the five opinion categories. Which 
of these two gives us the information we want? Here’s a hint: think about whether 
changes in one variable might help explain changes in the other. In this case, it 
seems reasonable to think that gender might influence young adults’ opinions 
about their chances of getting rich. To see whether the data support this idea, we 
should compare the conditional distributions of opinion for women and men. 


OR 


Software will calculate conditional distributions for you. Most programs allow 
you to choose which conditional distributions you want to compute. 


TECHNOLOGY AW AV7ING TWO-WAY TABLES 


Figure 1.5 presents the two conditional distributions of [Session ss 

opinion, for women and for men, and also the marginal Yaante Male 

distribution of opinion for all of the young adults. The A: Almost no chance 96 OR 
a a a a o D 4.06 3.99 

distributions agree (up to rounding) with the results in the 

] ] B: Some chance but probably not 286 
ast two examples. 32.63 


Cc: A 50-50 chance 720 
29.28 


D: A good chance 758 
30.83 


E: Almost certain 597 


FIGURE 1.5 Minitab output for the two-way table of young adults Si ae 

by gender and chance of being rich, along with each entry as a ai 3459 
percent of its column total. The “Female” and “Male” columns give 100.00 100.00 
the conditional distributions of opinion for women and men, and the Cell Contents: eee ee 

“All” column shows the marginal distribution of opinion for all these 

young adults. 
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Putting It All Together: Relationships 
Between Categorical Variables 


Now it’s time to complete our analysis of the relationship between gender and 
‘ opinion about chances of becoming rich later in life. 


Women’s and Men’s Opinions 


Conditional distributions and relationships 
PROBLEM: Based on the survey data, can we conclude that young men and women differ in their 
opinions about the likelihood of future wealth? Give appropriate evidence to support your answer. 


SOLUTION: We suspect that gender might influence a young adult’s opinion about the chance of get- 
ting rich. So we'll compare the conditional distributions of response for men alone and for women alone. 


Percent of Females Percent of Males 


8 = 4.1% 8 4.0% 


Response 


Almost no chance 


Chance of being wealthy by age 30 


Some chance 


A 50-50 chance 


A good chance 


Almost certain 


2367 
426 
3367 = 180% 
696 
2367 
663 
2367 
486 
2367 


= 29.4% 


= 28.0% 


= 20.5% 


2459 
286 
—— = 11.69 
2459 au 
720 
——— = 99.39 
pag | 22-3% 
758 


2459 


597 
2459 


= 30.8% 


= 24.3% 


We'll make a side-by-side bar graph to compare the opinions of males and 
females. Figure 1.6 displays the completed graph. 


Based on the sample data, men seem somewhat more optimistic about 
their future income than women. Men were less likely to say that they have 
“some chance but probably not” than women (11.6% vs. 18.0%). Men were 
more likely to say that they have “a good chance” (30.8% vs. 28.0%) or are 
“almost certain” (24.3% vs. 20.5%) to have much more than a middle-class 
income by age 30 than women were. 


Almost Some 50-50 Good Almost 
none chance chance chance certain 


Opinion 


FIGURE 1.6 Side-by-side bar graph comparing the 
opinions of males and females. 


For Practice Try Exercise 


me) BB Almost certain We could have used a segmented bar graph to compare 

. 1 I A cood chance the distributions of male and female responses in the previous 

aa [Bl A 50-50 chance example. Figure 1.7 shows the completed graph. Each bar has 
= 0-4 I Some chance five segments — one for each of the opinion categories. It s fairly 
8 5 - [By Aimost none difficult to compare the percents of males and females in each 
are category because the “middle” segments in the two bars start 

304 at different locations on the vertical axis. The side-by-side bar 

20 5 graph in Figure 1.6 makes comparison easier. 

105 

0 


Female Male 
Opinion 


FIGURE 1.7 Segmented bar graph com- 
paring the opinions of males and females. 
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Both graphs provide evidence of an association between gender and opinion 
about future wealth in this sample of young adults. Men more often rated their 
chances of becoming rich in the two highest categories; women said “some chance 
but probably not” much more frequently. 


TT 


DEFINITION: Association 


We say that there is an association between two variables if knowing the value of 
one variable helps predict the value of the other. If knowing the value of one variable 
does not help you predict the value of the other, then there is no association between 
the variables. 


Can we say that there is an association between gender and opinion in the popu- 
lation of young adults? Making this determination requires formal inference, which 
will have to wait a few chapters. 


THINK What does “no association” mean? Figure 1.6 (page 17) suggests 
an association between gender and opinion about future wealth for young 
ABOUT IT adults. Knowing that a young adult is male helps us predict his opinion: he is 
more likely than a female to say “a good chance” or “almost certain.” What 
would the graph look like if there was no association between the two vari- 
ables? In that case, knowing that a young adult is male would not help us 
predict his opinion. He would be no more or less likely than a female to say “a 
good chance” or “almost certain” or any of the other possible responses. That 
is, the conditional distributions of opinion about becoming rich would be the 
same for males and females. The segmented bar graphs for the two genders 
would look the same, too. 


o_o _ 


CHECK YOUR UNDERSTANDING 


Let’s complete our analysis of the data on superpower preferences from the previ- 


ous Check Your Understanding (page 14). Here is the two-way table of counts once 


Country ; 
feiss scot again. 
Superpower U.K. U.S. ; - a 
FI 54S 1. Find the conditional distributions of superpower preference among students from 
the United Kingdom and the United States. 
prereile = 2. Make an appropriate graph to compare the conditional distributions. 
Invisibility 30037 _— sich ; 
3. Is there an association between country of origin and superpower preference? Give 
Superstrength 200 23 appropriate evidence to support your answer. 
Telepathy 44 66 


There’s one caution that we need to offer: even a strong association be- 6 
tween two categorical variables can be influenced by other variables lurking 
in the background. The Data Exploration that follows gives you a chance to 
explore this idea using a famous (or infamous) data set. 
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DATA EXPLORATION A Titanic disaster 


In 1912 the luxury liner Titanic, on its first voyage across the Atlantic, 
struck an iceberg and sank. Some passengers got off the ship in lifeboats, 
but many died. The two-way table below gives information about adult 
passengers who lived and who died, by class of travel. 


Class of Travel 


Survival status First class Second class _ Third class 
Lived 197 94 151 
Died 122 167 476 


Here’s another table that displays data on survival status by gender and 
class of travel. 


Class of Travel 


First class Second class Third class 
Survival status Female Male Female Male Female Male 
Lived 140 57 80 14 76 75 
Died 4 118 13 154 89 387 


The movie Titanic, starring Leonardo DiCaprio and Kate Winslet, suggested 
the following: 

e First-class passengers received special treatment in boarding the lifeboats, 
while some other passengers were prevented from doing so (especially third- 
class passengers). 

¢ Women and children boarded the lifeboats first, followed by the men. 


1. What do the data tell us about these two suggestions? Give appropriate 
graphical and numerical evidence to support your answer. 


2. How does gender affect the relationship between class of travel and survival 
status? Explain. 


Summary 


e The distribution of a categorical variable lists the categories and gives the 
count (frequency) or percent (relative frequency) of individuals that fall 
within each category. 


e Pie charts and bar graphs display the distribution of a categorical variable. 
Bar graphs can also compare any set of quantities measured in the same 
units. When examining any graph, ask yourself, “What do I see?” 

e A two-way table of counts organizes data about two categorical variables mea- 


sured for the same set of individuals. Two-way tables are often used to sum- 
marize large amounts of information by grouping outcomes into categories. 
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EXPLORING DATA 


TECHNOLOGY 


CORNER 


The row totals and column totals in a two-way table give the marginal dis- 
tributions of the two individual variables. It is clearer to present these distri- 
butions as percents of the table total. Marginal distributions tell us nothing 
about the relationship between the variables. 


There are two sets of conditional distributions for a two-way table: the dis- 
tributions of the row variable for each value of the column variable, and the 
distributions of the column variable for each value of the row variable. You 
may want to use a side-by-side bar graph (or possibly a segmented bar graph) 
to display conditional distributions. 


There is an association between two variables if knowing the value of one vari- 
able helps predict the value of the other. To see whether there is an association 
between two categorical variables, compare an appropriate set of conditional 
distributions. Remember that even a strong association between two categori- 
cal variables can be influenced by other variables. 


1. Analyzing two-way tables 


Exercises 


9. Cool car colors The most popular colors for cars (c) Would it be appropriate to make a pie chart of these 
and light trucks change over time. Silver passed data? Explain. 
green in 2000 to become the most popular color 10. Spam Email spam is the curse of the Internet. Here 
worldwide, then gave way to shades of white in 2007. is a compilation of the most common types of spam:? 


Here is the distribution of colors for vehicles sold in 


North America in 2011.° Type of spam Percent 
Color Percent of vehicles ee 7 ‘ 
White 23 Hone : 
Black 18 ie : 
Silver 16 ee : 
eisure 
Gray 13 Ae op 
Red 10 roducts 
Scams 9) 
Blue 9 
: Other ae 
Brown/beige 5 
Yellow/gold 3 (a) What percent of spam would fall in the “Other” 
Green 2 category? 
(a) What percent of vehicles had colors other than those (b) Display these data in a bar graph. Be sure to label 
listed? your axes. 
(b) Display these data in a bar graph. Be sure to label (c) Would it be appropriate to make a pie chart of these 


your axes. 


data? Explain. 


Ile 


Birth days Births are not evenly distributed across the 
days of the week. Here are the average numbers of ba- 
bies born on each day of the week in the United States 
in a recent year:!° 


Day Births 
Sunday 1374 
Monday 11,704 
Tuesday 13,169 
Wednesday 13,038 
Thursday 13,013 
Friday 12,664 
Saturday 8459 


Present these data in a well-labeled bar graph. Would 
it also be correct to make a pie chart? 


Suggest some possible reasons why there are fewer 
births on weekends. 


. Deaths among young people Among persons aged 


15 to 24 years in the United States, the leading causes 
of death and number of deaths in a recent year were 
as follows: accidents, 12,015; homicide, 4651; suicide, 
4559; cancer, 1594; heart disease, 984; congenital 
defects, 401.!! 


Make a bar graph to display these data. 


To make a pie chart, you need one additional piece of 
information. What is it? 


. Hispanic origins Below is a pie chart prepared by the 


Census Bureau to show the origin of the more than 
50 million Hispanics in the United States in 2010. 
About what percent of Hispanics are Mexican? Puerto 
Rican? 

Percent Distribution of Hispanics by Type: 2010 


Comment: You see that it is hard to determine num- 
bers from a pie chart. Bar graphs are much easier to 
use. (‘The Census Bureau did include the percents in 
its pie chart.) 
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Which major? About 1.6 million first-year students 
enroll in colleges and universities each year. What do 
they plan to study? The pie chart displays data on the 
percents of first-year students who plan to major in sev- 
eral discipline areas.!* About what percent of first-year 
students plan to major in business? In social science? 


Arts/ 


Technical humanities 


Professional 


Business 


Physical sciences 


Engineering 


Buying music online Young people are more likely 
than older folk to buy music online. Here are the 
percents of people in several age groups who bought 
music online in a recent year:!* 


Age group Bought music online 
12 to 17 years 24% 
18 to 24 years 21% 
25 to 34 years 20% 
35 to 44 years 16% 
45 to 54 years 10% 
55 to 64 years 3% 
65 years and over 1% 


Explain why it is not correct to use a pie chart to 
display these data. 


Make a bar graph of the data. Be sure to label your axes. 


. The audience for movies Here are data on the 


percent of people in several age groups who attended 
a movie in the past 12 months:!° 


Age group Movie attendance 
18 to 24 years 83% 
25 to 34 years 13% 
35 to 44 years 68% 
45 to 54 years 60% 
55 to 64 years 47% 
65 to 74 years 32% 
75 years and over 20% 


Display these data in a bar graph. Describe what 
you see. 
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(b) Would it be correct to make a pie chart of these data? 
Why or why not? 

(c) A movie studio wants to know what percent of the to- 
tal audience for movies is 18 to 24 years old. Explain 
why these data do not answer this question. 


. Going to school Students in a high school statistics 
class were given data about the main method of trans- 
portation to school for a group of 30 students. ‘They 
produced the pictograph shown. 


& = 1 Cycler 
@2~ = 2 Cars 


&==2, = 7 Bus takers 
& =2 Walkers 


Mode of transport 


(a) How is this graph misleading? 
(b) Make a new graph that isn’t misleading. 


18. Oatmeal and cholesterol Does eating oatmeal 
reduce cholesterol? An advertisement included the 
following graph as evidence that the answer is “Yes.” 


Representative cholesterol point drop 
210 5 
208 + 
206 - 


204 5 

202 5 

200 - 

198 5 | | 
196 


Week 1 Week2 Week3 Week 4 


Cholesterol 


(a) How is this graph misleading? 


(b) Make a new graph that isn’t misleading. What do 
you conclude about the relationship between eating 
oatmeal and cholesterol reduction? 


19. Attitudes toward recycled products Recycling is sup- 
me] 13 posed to save resources. Some people think recycled 
products are lower in quality than other products, a 
fact that makes recycling less practical. People who 
use a recycled product may have different opinions 
from those who don’t use it. Here are data on attitudes 
toward coffee filters made of recycled paper from a 
sample of people who do and don’t buy these filters:'° 


Buy recycled filters? 


Think quality is Yes No 
Higher 20 29 
The same i 25 
Lower ) 43 


(a) 


(b) 


20. 


How many people does this table describe? How many 
of these were buyers of coffee filters made of recycled 
paper? 

Give the marginal distribution (in percents) of opinion 
about the quality of recycled filters. What percent 

of the people in the sample think the quality of the 
recycled product is the same or higher than the quality 
of other filters? 


Smoking by students and parents Here are data 
from a survey conducted at eight high schools on 
smoking among students and their parents:!7 


Neither One Both 
parent parent parents 
smokes smokes smoke 
Student does not smoke 1168 1823 1380 
Student smokes 188 416 400 
(a) How many students are described in the two-way 
table? What percent of these students smoke? 
(b) Give the marginal distribution (in percents) of parents’ 


ihe 


23. 


smoking behavior, both in counts and in percents. 


Attitudes toward recycled products Exercise 19 
gives data on the opinions of people who have and 
have not bought coffee filters made from recycled 
paper. ‘To see the relationship between opinion and 
experience with the product, find the conditional 
distributions of opinion (the response variable) for 
buyers and nonbuyers. What do you conclude? 


Smoking by students and parents Refer to Exercise 
20. Calculate three conditional distributions of 
students’ smoking behavior: one for each of the three 
parental smoking categories. Describe the relation- 
ship between the smoking behaviors of students and 
their parents in a few sentences. 


Popular colors—here and there Favorite vehicle 
colors may differ among countries. The side-by-side bar 
graph shows data on the most popular colors of cars in 
a recent year for the United States and Europe. Write a 
few sentences comparing the two distributions. 


3075 


Bus. 


255 i Europe 


Percent 
= R ie) 
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24. Comparing car colors Favorite vehicle colors may 
differ among types of vehicle. Here are data on the 
most popular colors in a recent year for luxury cars 
and for SUVs, trucks, and vans. 


Color Luxury cars (%) SUVs, trucks, vans (%) 
Black 22 13 
Silver 16 16 
White pearl 14 1 
Gray 12 ie 
White 11 25 
Blue U 10 
Red 7 ati 
Yellow/gold 6 1 
Green 3 4 
Beige/brown 2 6 


(a) Make a graph to compare colors by vehicle type. 


(b) Write a few sentences describing what you see. 


25. Snowmobiles in the park Yellowstone National Park 
a] 17 surveyed a random sample of 1526 winter visitors 
& to the park. They asked each person whether they 
owned, rented, or had never used a snowmobile. Re- 
spondents were also asked whether they belonged to 
an environmental organization (like the Sierra Club). 
The two-way table summarizes the survey responses. 


Environmental Club 


No Yes Total 
Never used 445 212 657 
Snowmobile renter 497 Te 574 
Snowmobile owner 279 16 295 
Total 1221 305 1526 


Do these data suggest that there is an association 
between environmental club membership and 
snowmobile use among visitors to Yellowstone Na- 
tional Park? Give appropriate evidence to support 
your answer. 


26. Angry people and heart disease People who get an- 
gry easily tend to have more heart disease. ‘That's the 
conclusion of a study that followed a random sample 
of 12,986 people from three locations for about four 
years. All subjects were free of heart disease at the 
beginning of the study. The subjects took the Spiel- 
berger ‘Trait Anger Scale test, which measures how 
prone a person is to sudden anger. Here are data for 
the 8474 people in the sample who had normal blood 
pressure. CHD stands for “coronary heart disease.” 
This includes people who had heart attacks and those 
who needed medical treatment for heart disease. 
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Low anger Moderate anger Highanger Total 
CHD 53 110 27 190 
No CHD 3057 4621 606 8284 
Total 3110 4731 633 8474 


Do these data support the study’s conclusion about 
the relationship between anger and heart disease? 
Give appropriate evidence to support your answer. 


Multiple choice: Select the best answer for Exercises 27 
to 34. 

Exercises 27 to 30 refer to the following setting. The 
National Survey of Adolescent Health interviewed several 
thousand teens (grades 7 to 12). One question asked was 
“What do you think are the chances you will be married 
in the next ten years?” Here is a two-way table of the 
responses by gender:!® 


Female Male 


Almost no chance 119 103 
Some chance, but probably not 150 Wl 
A 50-50 chance 447 512 
A good chance Us) 710 
Almost certain 1174 756 


27. The percent of females among the respondents was 
(a) 2625. (c) about 46%. —(e) None of these. 
(byRtSie7 (d) about 54%. 


28. Your percent from the previous exercise is part of 

a) the marginal distribution of females. 

b) the marginal distribution of gender. 

c) the marginal distribution of opinion about marriage. 


d) the conditional distribution of gender among adoles- 
cents with a given opinion. 


(e) the conditional distribution of opinion among adoles- 
cents of a given gender. 

29. What percent of females thought that they were 
almost certain to be married in the next ten years? 

(a) About 16% = (c) About 40% (e) About 61% 

(b) About 24%  (d) About 45% 


30. Your percent from the previous exercise is part of 

) the marginal distribution of gender. 

(b) the marginal distribution of opinion about marriage. 

) the conditional distribution of gender among adoles- 

cents with a given opinion. 

(d) the conditional distribution of opinion among adoles- 
cents of a given gender. 

(e) the conditional distribution of “Almost certain” 
among females. 
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31. For which of the following would it be inappropriate 
to display the data with a single pie chart? 


(a) The distribution of car colors for vehicles purchased 
in the last month. 

(b) The distribution of unemployment percentages for 
each of the 50 states. 

(c) The distribution of favorite sport for a sample of 30 
middle school students. 

(d) The distribution of shoe type worn by shoppers at a 
local mall. 

(e) The distribution of presidential candidate preference 
for voters in a state. 


32. The following bar graph shows the distribution of 
favorite subject for a sample of 1000 students. What is 
the most serious problem with the graph? 

280 + 

260 4 
£240 4 
3 220 4 
= 
2 200 4 
= 180 4 
4 160 4 


5 140 + 
120 4 [ 
100 —_ 


Favorite subject 


(a) ‘The subjects are not listed in the correct order. 

(b) This distribution should be displayed with a pie chart. 

(c) The vertical axis should show the percent of students. 

(d) ‘The vertical axis should start at 0 rather than 100. 

(e) The foreign language bar should be broken up by 
language. 


33. Inthe 2010-2011 season, the Dallas Mavericks won 
the NBA championship. The two-way table below 
displays the relationship between the outcome of 
each game in the regular season and whether the 
Mavericks scored at least 100 points. 


100 or more points Fewerthan100 points _‘ Total 


Win 43 14 57 
Loss 4 21 25 
Total 47 35 82 


Which of the following is the best evidence that there 
is an association between the outcome of a game and 
whether or not the Mavericks scored at least 100 points? 


(a) The Mavericks won 57 games and lost only 25 games. 

(b) ‘The Mavericks scored at least 100 points in +7 games 
and fewer than 100 points in only 35 games. 

(c) ‘The Mavericks won 43 games when scoring at least 


100 points and only 14 games when scoring fewer 
than 100 points. 


(d) ‘The Mavericks won a higher proportion of games 
when scoring at least 100 points (43/47) than when 
they scored fewer than 100 points (14/35). 

(e) The combination of scoring 100 or more points and 
winning the game occurred more often (43 times) 
than any other combination of outcomes. 


34. The following partially complete two-way table shows 
the marginal distributions of gender and handedness 
for a sample of 100 high school students. 


Male Female Total 


Right X 90 
Left 10 
Total 40 60 100 


If there is no association between gender and handed- 
ness for the members of the sample, which of the 
following is the correct value of x? 


(a) 20. 

(b) 30. 

(c) 36. 

(a) 45. 

(e) Impossible to determine without more information. 
35. Marginal distributions aren’t the whole story Here 


are the row and column totals for a two-way table with 
two rows and two columns: 


a b 50 
G d 50 
60 40 | 100 


Find two different sets of counts a, b, c, and d for the 
body of the table that give these same totals. This 
shows that the relationship between two variables can- 
not be obtained from the two individual distributions 
of the variables. 


36. Fuel economy (Introduction) Here is a small part 
> ofa data set that describes the fuel economy (in miles 
€ per gallon) of model year 2012 motor vehicles: 


Make and Transmission Number of City Highway 
model Vehicle type type cylinders mpg mpg 
Aston Martin Two-seater Manual 8 14 20 
Vantage 

Honda Civic Subcompact Automatic 4 44 44 
Hybrid 

Toyota Prius Midsize Automatic 4 51 48 
Chevrolet Large Automatic 6 18 30 


Impala 


(a) What are the individuals in this data set? 


(b) What variables were measured? Identify each as cat- 
egorical or quantitative. 
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Le Displaying Quantitative Data 


with Graphs 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e Make and interpret dotplots and stemplots of quantita- e Identify the shape of a distribution from a graph 
tive data. as roughly symmetric or skewed. 


Describe the overall pattern (shape, center, and spread) of Make and interpret histograms of quantitative data. 
a distribution and identify any major departures from the Compare distributions of quantitative data using cot- 
pattern (outliers). plots, stemplots, or histograms. 


To display the distribution of a categorical variable, use a bar graph or a pie chart. 
How can we picture the distribution of a quantitative variable? In this section, we 
present several types of graphs that can be used to display quantitative data. 


Dotplots 


One of the simplest graphs to construct and interpret is a dotplot. Each data value 
is shown as a dot above its location on a number line. We'll show how to make a 
dotplot using some sports data. 


Gooooaaaaalllllll! 


How to make a dotplot 


How good was the 2012 U.S. women’s soccer team? With players like Abby Wambach, 
Megan Rapinoe, and Hope Solo, the team put on an impressive showing en route to 
winning the gold medal at the 2012 Olympics in London. Here are data on the num- 
ber of goals scored by the team in the 12 months prior to the 2012 Olympics.”” 


IS A 2 Zs 
5 ee eo ee te ee el, Recent a eae 


Here are the steps in making a dotplot: 


e Draw a horizontal axis (a number line) and label it with the variable name. In 
this case, the variable is number of goals scored. 


¢ Scale the axis. Start by looking at the minimum and maximum values of the 
variable. For these data, the minimum number 

of goals scored was 0, and the maximum was 
es 14. So we mark our scale from 0 to 14, with tick 


6 8 10 marks at every whole-number value. 
Number of goals scored 


f Jeeeccee 


e Mark a dot above the location on the hori- 


FIGURE 1.8 A dotplot of goals scored by the U.S. women’s soccer zontal axis corresponding to each data value. 
team in 2012. Figure 1.8 displays a completed dotplot for the 
soccer data. 
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Making a graph is not an end in itself. The purpose of a graph is to help us 
understand the data. After you make a graph, always ask, “What do I see?” Here is 
a general strategy for interpreting graphs of quantitative data. 


HOW TO EXAMINE THE DISTRIBUTION OF A QUANTITATIVE VARIABLE 


In any graph, look for the overall pattern and for striking departures from 
that pattern. 


e You can describe the overall pattern of a distribution by its shape, 
center, and spread. 


e An important kind of departure is an outlier, an individual value that 
falls outside the overall pattern. 


You'll learn more formal ways of describing shape, center, and spread and iden- 
tifying outliers soon. For now, let’s use our informal understanding of these ideas 
to examine the graph of the U.S. women’s soccer team data. 


Shape: The dotplot has a peak at 4, a single main cluster of dots between 0 and 5, 
and a large gap between 5 and 13. The main cluster has a longer tail to the left of 
the peak than to the right. What does the shape tell us? The U.S. women’s soccer 
team scored between 0 and 5 goals in most of its games, with + being the most 
common value (known as the mode). 


Center: The “midpoint” of the 25 values shown in the graph is the 13th value if 
we count in from either end. You can confirm that the midpoint is at 3. What does 
this number tell us? In a typical game during the 2012 season, the U.S. women’s 
soccer team scored about 3 goals. 


When describing a distribution of Spread: The data vary from 0 goals scored to 14 goals scored. 
quantitative data, don’t forget your F ; . ; 
SOCS (shape, outliers, center, spread)) Outliers: ‘The games in which the women’s team scored 13 goals and 14 goals 


clearly stand out from the overall pattern of the distribution. So we label them 
as possible outliers. (In Section 1.3, we'll establish a procedure for determining 
whether a particular value is an outlier.) 


Are You Driving a Gas Guzzler? 


Interpreting a dotplot 


The Environmental Protection Agency (EPA) is in charge of determining 
and reporting fuel economy ratings for cars (think of those large window 
stickers on a new car). For years, consumers complained that their actual 
gas mileages were noticeably lower than the values reported by the EPA. It 
seems that the EPA’s tests—all of which are done on computerized devices 
to ensure consistency—did not consider things like outdoor temperature, 
use of the air conditioner, or realistic acceleration and braking by driv- 
ers. In 2008 the EPA changed the method for measuring a vehicle’s fuel 


economy to try to give more accurate estimates. 


The following table displays the EPA estimates of highway gas mileage in 


miles per gallon (mpg) for a sample of 24 model year 2012 midsize cars.”” 
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Model mpg Model mpg Model mpg 
Acura RL 24 Dodge Avenger 30 Mercedes-Benz E350 30 
Audi A8 28 Ford Fusion 25 Mitsubishi Galant 30 
Bentley Mulsanne 18 Hyundai Elantra 40 Nissan Maxima 26 
BMW 5501 23 Jaguar XF 23 Saab 9-5 Sedan 28 
Buick Lacrosse 27 Kia Optima 34 Subaru Legacy 31 
Cadillac CTS 27 Lexus ES 350 28 Toyota Prius 48 
Chevrolet Malibu 33 Lincoln MKZ 27 Volkswagen Passat 31 
Chrysler 200 30 Mazda 6 31 Volvo S80 26 


Figure 1.9 shows a dotplot of the data: 


FIGURE 1.9 Dotplot displaying _ 

EPA estimates of highway gas 7 ee ee ae ee ‘ ;: 
mileage for model year 2012 20 25 30 35 40 45 
midsize cars. HwyMPG 


PROBLEM: Describe the shape, center, and spread of the distribution. Are there any outliers? 
SOLUTION: 


Shape: The dotplot has a peak at 30 mpg and a main cluster of values from 23 to 34 mpg. There are 
large gaps between 18 and 23, 34 and 40, 40 and 46 mpg. 

Center: The midpoint of the 24 values shown in the graph is 28. So a typical model year 2012 
midsize car in the sample got about 28 miles per gallon on the highway. 


The 2012 Nissan Leaf, an electric 
Car, got an EPA estimated 92 miles 
per gallon on the highway. With 
the U.S. government's plan to raise © Spread: The datavary from 18 mpg to 46 mpg. 


the fuel economy standard to an ea ee F : ; eae ; 
average of 54.5 mpg by 2025, even Outliers: We see two midsize cars with unusually high gas mileage ratings: the Hyundai Elantra 


more alternative-fuel vehicles like (40 mpg) and the Toyota Prius (48 mpg). The Bentley Mulsanne stands out for its low gas mileage 
the Leaf will have to be developed. rating (1 mpg). All three of these values seem like clear outliers. 


For Practice Try Exercise 


Describing Shape 


When you describe a distribution’s shape, concentrate on the main features. Look 
for major peaks, not for minor ups and downs in the graph. Look for clusters of 
values and obvious gaps. Look for potential outliers, not just for the smallest and 
largest observations. Look for rough symmetry or clear skewness. 


DEFINITION: Symmetric and skewed distributions 


A distribution is roughly symmetric if the right and left sides of the graph are ap- 
proximately mirror images of each other. 


to the 


eeace 


A distribution is skewed to the right if the right side of the graph (containing the half 
of the observations with larger values) is much longer than the left side. It is skewed 


Rot leew satel While Way to the left if the left side of the graph is much longer than the right side. 
should Mr. Starnes go “skewing”? 


o 
a 
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For brevity, we sometimes say “left-skewed” instead of “skewed to the left” and 
“right-skewed” instead of “skewed to the right.” We could also describe a distribu- 
tion with a long tail to the left as “skewed toward negative values” or “negatively 
skewed” and a distribution with a long right tail as “positively skewed.” 

The direction of skewness is the direction of the long tail, not the direc- 
tion where most observations are clustered. See the drawing in the margin @ 
on page 27 for a cute but corny way to help you keep this straight. 


Die Rolls and Quiz Scores 
Describing shape 


Figure 1.10 displays dotplots for two different sets of quantitative data. Let’s practice 
describing the shapes of these distributions. Figure 1.10(a) shows the results of roll- 
ing a pair of fair, six-sided dice and finding the sum of the up-faces 100 times. This 
distribution is roughly symmetric. The dotplot in Figure 1.10(b) shows the scores 
on an AP® Statistics class’s first quiz. This distribution is skewed to the left. 


f | eocccce 
DD | eecccccccccccccocce 
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Dice rolls 


FIGURE 1.10 Dotplots displaying different shapes: (a) roughly symmetric; (6) skewed to the left. 


Although the dotplots in the previous example have different shapes, they do 
have something in common. Both are unimodal, that is, they have a single peak: 
the graph of dice rolls at 7 and the graph of quiz scores at 90. (We don’t count 
minor ups and downs in a graph, like the “bumps” at 9 and 11 in the dice rolls 
dotplot, as “peaks.”) Figure 1.11 is a dotplot of the duration (in minutes) of 220 


FIGURE 1.11 Dotplot displaying 
duration (in minutes) of Old Faithful 
eruptions. This graph has a bimodal 
shape. 
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eruptions of the Old Faithful geyser. We would describe this distribution’s shape 
as roughly symmetric and bimodal because it has two clear peaks: one near 2 
minutes and the other near 4.5 minutes. (Although we could continue the pattern 
with “trimodal” for three peaks and so on, it’s more common to refer to distribu- 
tions with more than two clear peaks as multimodal.) 


What shape will the graph have? Some variables have distributions 
THINK ; ; ee ae 
with predictable shapes. Many biological measurements on individuals from the 
ABOUT IT same species and gender—lengths of bird bills, heights of young women—have 
roughly symmetric distributions. Salaries and home prices, on the other hand, 
usually have right-skewed distributions. There are many moderately priced hous- 
es, for example, but the few very expensive mansions give the distribution of house 
prices a strong right skew. Many distributions have irregular shapes that are nei- 
ther symmetric nor skewed. Some data show other patterns, such as the two peaks 
in Figure 1.11. Use your eyes, describe the pattern you see, and then try to explain 
the pattern. 


oe 


CHECK YOUR UNDERSTANDING 


The Fathom dotplot displays data on the number of siblings reported by each student in 
a statistics class. 


TCNJ survey { Dot Plot i } 
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Siblings 


Describe the shape of the distribution. 
Describe the center of the distribution. 
Describe the spread of the distribution. 


hW NN — 


Identify any potential outliers. 


Comparing Distributions 


Some of the most interesting statistics questions involve comparing two or more 
groups. Which of two popular diets leads to greater long-term weight loss? Who 
texts more—males or females? Does the number of people living in a household 
differ among countries? As the following example suggests, you should always 
discuss shape, center, spread, and possible outliers whenever you compare distri- 
butions of a quantitative variable. 
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Household Size: U.K. versus 
South Africa 


Comparing distributions 


How do the numbers of people living in households in the United Kingdom (U.K.) 
and South Africa compare? To help answer this question, we used CensusAtSchool’s 
“Random Data Selector” to choose 50 students from each country. Figure 1.12 is a 
dotplot of the household sizes reported by the survey respondents. 


PROBLEM: Compare the distributions of household size for these two countries. 
SOLUTION: 


Shape: The distribution of household size for the U.K. sample is roughly symmetric and unimodal, 
while the distribution for the South Africa sample is skewed to the right and unimodal. 


Center: Household sizes for the South African students tended to be larger than for the U.K. 
students. The midpoints of the household sizes for the two groups are 6 people and 4 people, 
respectively. 


Spread: The household sizes for the South African students vary more (from 3 to 26 people) than 
for the U.K. students (from 2 to 6 people). 


Outliers: There don't appear 
to be any outliers in the U.K. 
distribution. The South African 
distribution seems to have two 
outliers in the right tail of the 
distribution—students who 
reported living in households 
with 15 and 26 people. 


South Africa 


AP® EXAM TIP When Household size 
comparing distributions of 

quantitative data, it’s not 

enough just to list values 

for the center and spread of 

each distribution. You have 

to explicitly compare these FIGURE 1.12 Dotplots of 
values, using words like household size for random 
“greater than,” “less than,” samples of 50 students from 
the United Kingdom and 
South Africa. 
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or “about the same as.” Household size 


For Practice Try Exercise 


Notice that we discussed the distributions of household size only for the two 
samples of 50 students in the previous example. We might be interested in whether 
the sample data give us convincing evidence of a difference in the population dis- 
tributions of household size for South Africa and the United Kingdom. We’ll have 
to wait a few chapters to decide whether we can reach such a conclusion, but our 
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ability to make such an inference later will be helped by the fact that the students 
in our samples were chosen at random. 


Stemplots 


Another simple graphical display for fairly small data sets is a stemplot (also called 
a stem-and-leaf plot). Stemplots give us a quick picture of the shape of a distribu- 
tion while including the actual numerical values in the graph. Here’s an example 
that shows how to make a stemplot. 


How Many Shoes? 
Making a stemplot 


How many pairs of shoes does a typical teenager have? To find out, a group of 
AP® Statistics students conducted a survey. They selected a random sample of 20 
female students from their school. Then they recorded the num- 
ber of pairs of shoes that each respondent reported having. Here 
are the data: 


S026) 205 317 Oe 2 ees 385 
ev o0 Nee 342s 30 4O le ps Sl 


Here are the steps in making a stemplot. Figure 1.13 displays the 
process. 


¢ Separate each observation into a stem, consisting of all but 
the final digit, and a leaf, the final digit. Write the stems in a ver- 
tical column with the smallest at the top, and draw a vertical line 
at the right of this column. Do not skip any stems, even if there is 
no data value for a particular stem. For these data, the 


tens digits are the stems, and the ones digits are the 
: ‘ ee : a. Key: 4[9 represents leaves. The stems run from | to 5. 
3 3 | 1840 3 | 0148 een ae ¢ Write each leaf in the row to the right of its stem. 
4) |4]9 419 40 satet shook For example, the female student with 50 pairs of shoes 
5 > | 0701 | WOE would have stem 5 and leaf 0, while the student with 
CE ||, ees Ordon eae vcneniad 31 pairs of shoes would have stem 3 and leaf 1. 


Arrange the leaves in increasing order out from the 


FIGURE 1.13 Making a stemplot of the shoe data. (1) Write es 


the stems. (2) Go through the data and write each leaf on the aa 
proper stem. (3) Arrange the leaves on each stem in order out ° Provide a key that explains in context what the stems 
from the stem. (4) Add a key. and leaves represent. 


The AP® Statistics students in the previous example also collected data from a 
random sample of 20 male students at their school. Here are the numbers of pairs 
of shoes reported by each male in the sample: 


Ie 7 6 > Iz 38 8 FF 10 10 
tO ot 4 5 22- 7 9 WO 35: 7 
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What would happen if we tried the same approach as before: using the first digits 
as stems and the last digits as leaves? The completed stemplot is shown in Figure 
1.14(a). What shape does this distribution have? It is difficult to tell with so few 
stems. We can get a better picture of male shoe ownership by splitting stems. 

In Figure 1.14(a), the values from 0 to 9 are placed on the “0” stem. Figure 1.14(b) 
shows another stemplot of the same data. This time, values having leaves 0 through 
4 are placed on one stem, while values ending in 5 through 9 are placed on another 
stem. Now we can see the single peak, the cluster of values between 4 and 14, and 
the large gap between 22 and 35 more clearly. 


What if we want to compare the number of pairs of shoes that males and females 
have? That calls for a back-to-back stemplot with common stems. The leaves on 
each side are ordered out from the common stem. Figure 1.15 is a back-to-back 
stemplot for the male and female shoe data. Note that we have used the split stems 
from Figure 1.14(b) as the common stems. The values on the right are the male data 
from Figure 1.14(b). The values on the left are the female data, ordered out from 
the stem from right to left. We'll ask you to compare these two distributions shortly. 


0 | 4555677778 Oo] 4 Females Males 
1 0000124 0 | 555677778 0); 4 
2) 2 1 0000124 O | 555677778 
3 | 58 1 ——S= 333 | 1 0000124 
-——— 2 | 2 9511 
2 4332 | 2] 2 Key: 2\2 represents 
3 66 | 2 a male student with 
3 | 58 410 | 3 22 pairs of shoes. 
8} 3] 58 
(a) (b) ; 
Key: 2\2 represents Notice that we include 914 
a male student with this stem even though 100 5 
22 pairs of shoes. it contains no data. 7 5 
FIGURE 1.14 Two stemplots showing the male shoe FIGURE 1.15 Back-to-back stemplot 
data. Figure 1.14(b) improves on the stemplot of comparing numbers of pairs of shoes for 
Figure 1.14(a) by splitting stems. male and female students at a school. 


Instead of rounding, you can also 
truncate (remove one or more digits) 
when data have too many digits. The 
teacher's salary of $42,549 would 
truncate to $42,000. 


Here are a few tips to consider before making a stemplot: 


e Stemplots do not work well for large data sets, where each stem must hold a 
large number of leaves. 


e There is no magic number of stems to use, but five is a good minimum. Too 
few or too many stems will make it difficult to see the distribution’s shape. 


e Ifyou split stems, be sure that each stem is assigned an equal number of pos- 
sible leaf digits (two stems, each with five possible leaves; or five stems, each 
with two possible leaves). 


e You can get more flexibility by rounding the data so that the final digit after 
rounding is suitable as a leaf. Do this when the data have too many digits. 
For example, in reporting teachers’ salaries, using all five digits (for example, 
$42,549) would be unreasonable. It would be better to round to the nearest 
thousand and use 4 as a stem and 3 as a leaf. 


CHECK YOUR UNDERSTANDING 


1. Use the back-to-back stemplot in Figure 1.15 to write a few sentences comparing the 
number of pairs of shoes owned by males and females. Be sure to address shape, center, 
spread, and outliers. 


6] 8 

7 

8] 8 
9179 

10 | 08 

11 | 15566 


12 | 012223444457888999 
13 | 01233333444899 


14 | 02666 
15 | 23 
16 | 8 


Key: 8s represents a state 
in which 8.8% of residents 
are 65 and older. 
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Multiple choice: Select the best answer for Questions 2 through +. 


Here is a stemplot of the percents of residents aged 65 and older in the 50 states and the 
District of Columbia. The stems are whole percents and the leaves are tenths of a percent. 


2. ‘The low outlier is Alaska. What percent of Alaska residents are 65 or older? 
(a) 0.68 (b) 6.8 (c) 8.8 (d) 16.8 (e) 68 
3. Ignoring the outlier, the shape of the distribution is 


(a) skewed to the right. (c) skewed to the middle. — (e) roughly symmetric. 
(b) skewed to the left. (d) bimodal. 


4. The center of the distribution is close to 


(a) 11.6%. (b) 12.0%. (c) 12.8%. (d) 13.3%. (e) 6.8% to 16.8%. 


Histograms 


Quantitative variables often take many values. A graph of the distribution is clearer 
if nearby values are grouped together. One very common graph of the distribution 
of a quantitative variable is a histogram. Let’s look at how to make a histogram 
using data on foreign-born residents in the United States. 


Foreign-Born Residents 
Making a histogram 


What percent of your home state’s residents were born outside the United States? 
A few years ago, the country as a whole had 12.5% foreign-born residents, but the 
states varied from 1.2% in West Virginia to 27.2% in California. The following table 
presents the data for all 50 states.?! The individuals in this data set are the states. The 
variable is the percent of a state’s residents who are foreign-born. It’s much easier 
to see from a graph than from the table how your state compared with other states. 


State Percent State Percent State Percent 
Alabama 2.8 Louisiana 2.9 Ohio 3.6 
Alaska 7.0 Maine 3.2 Oklahoma 49 
Arizona 15.1 Maryland 12.2 Oregon 9.7 
Arkansas 3.8 Massachusetts 14.1 Pennsylvania 5.1 
California 27.2 Michigan 5.9 Rhode Island 12.6 
Colorado 10.3 Minnesota 6.6 South Carolina 41 
Connecticut 12.9 Mississippi 1.8 South Dakota 2.2 
Delaware 8.1 Missouri 3.3 Tennessee 3.9 
Florida 18.9 Montana 1.9 Texas 15.9 
Georgia 9.2 Nebraska 5.6 Utah 8.3 
Hawaii 16.3 Nevada 19.1 Vermont 3.9 
Idaho 5.6 New Hampshire 5.4 Virginia 10.1 
Illinois 13.8 New Jersey 20.1 Washington 12.4 
Indiana 4.2 New Mexico 10.1 West Virginia 1.2 
lowa 3.8 New York 21.6 Wisconsin 44 
Kansas 6.3 North Carolina 6.9 Wyoming 2.7 
Kentucky 2.7 North Dakota 2.1 
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Notice that the frequencies add 
to 50, the number of individuals 
(states) in the data, and that the 
relative frequencies add to 100%. 


FIGURE 1.16 (a) Frequency his- 
togram and (b) relative frequency 
histogram of the distribution 

of the percent of foreign-born 
residents in the 50 states. 


EXPLORING DATA 


Here are the steps in making a histogram: 
Divide the data into classes of equal width. The data in the table vary from 1.2 
to 27.2, so we might choose to use classes of width 5, beginning at 0: 


0-5 5-10 10-15 15-20 20-25 25-30 


But we need to specify the classes so that each individual falls into exactly one class. 
For instance, what if exactly 5.0% of the residents in a state were born outside the 
United States? Because a value of 0.0% would go in the 0-5 class, we'll agree to place 
a value of 5.0% in the 5-10 class, a value of 10.0% in the 10-15 class, and so on. In 
reality, then, our classes for the percent of foreign-born residents in the states are 


Mo<> Siw<il0 IMim<I5 Stmo=a20 Aine Ato <3 


Find the count (frequency) or percent (relative frequency) of individuals in each 
class. Here are a frequency table and a relative frequency table for these data: 


Class Count Class Percent 
Oto <5 20 Oto <5 40 
5 to <10 13 5 to <10 26 
10 to <15 9 10 to <15 18 
15 to <20 5 15 to <20 10 
20 to <25 2 20 to <25 4 
25 to <30 1 25 to <30 2 
Total 50 Total 100 


¢ Label and scale your axes and draw the histogram. Label the horizontal axis 


with the variable whose distribution you are displaying. That’s the percent of 

a state’s residents who are foreign-born. The scale on the horizontal axis runs 
from 0 to 30 because that is the span of the classes we chose. The vertical axis 
contains the scale of counts or percents. Each bar represents a class. The base 
of the bar covers the class, and the bar height is the class frequency or relative 
frequency. Draw the bars with no horizontal space between them unless a class 
is empty, so that its bar has height zero. 


Figure 1.16(a) shows a completed frequency histogram; Figure 1.16(b) shows a 
completed relative frequency histogram. The two graphs look identical except for 
the vertical scales. 


(a) 


20 + 


Number of states 


5 10 


(b) 


40 


This bar has height 13 
because 13 states have 
between 5.0% and 9.9% 
foreign-born residents. 


This bar has height 26 
because 26% of states 
have between 5.0% and 
9.9% foreign-born 
residents. 


30 


20 


Percent of states 


10 15 20 


25 


Percent of foreign-born residents 


30 


25 


Percent of foreign-born residents 


15 20 30 5 


To find the center, remember that we’re 
looking for the value having 25 states 
with smaller percents foreign-born and 


25 with larger. 


Number of states 


0 5 
Percent of foreign-born residents Percent of foreign-born residents 


THINK 
ABOUT IT 
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What do the histograms in Figure 1.16 tell us about the percent of foreign-born 
residents in the states? ‘To find out, we follow our familiar routine: describe the 
pattern and look for any departures from the pattern. 


Shape: The distribution is skewed to the right and unimodal. Most states have few- 
er than 10% foreign-born residents, but several states have much higher percents. 


Center: From the graph, we see that the midpoint would fall somewhere in the 
5.0% to 9.9% class. (Arranging the values in the table in order of size shows that 
the midpoint is 6.1%.) 


Spread: The percent of foreign-born residents in the states varies from less than 


5% to over 25%. 
Outliers: We don’t see any observations outside the overall pattern of the distribution. 


Figure 1.17 shows (a) a frequency histogram and (b) a relative frequency histogram 
of the same distribution, with classes half as wide. The new classes are 0—2.4, 2.54.9, 
and so on. Now California, at 27.2%, stands out as a potential outlier in the right tail. 
The choice of classes in a histogram can influence the appearance of a distribution. 
Histograms with more classes show more detail but may have a less clear pattern. 


(b) 


Percent of states 


10 15 20 25 30 0 5 10 15 20 25 30 


FIGURE 1.17 (a) Frequency histogram and (b) relative frequency histogram of the distribution of 
the percent of foreign-born residents in the 50 states, with classes half as wide as in Figure 1.16. 


Here are some important things to consider when you are constructing a 
histogram: 
¢ Our eyes respond to the area of the bars in a histogram, so be sure to choose 
classes that are all the same width. Then area is determined by height, and all 
classes are fairly represented. 


e There is no one right choice of the classes in a histogram. Too few classes 
will give a “skyscraper” graph, with all values in a few classes with tall bars. 
Too many will produce a “pancake” graph, with most classes having one or 
no observations. Neither choice will give a good picture of the shape of the 
distribution. Five classes is a good minimum. 


What are we actually doing when we make a histogram? 
The dotplot on the left below shows the foreign-born resident data. We grouped 
the data values into classes of width 5, beginning with 0 to <5, as indicated by the 
dashed lines. Then we tallied the number of values in each class. The dotplot on 
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2. 
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the right shows the results of that process. Finally, we drew bars of the appropriate 
height for each class to get the completed histogram shown. 
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Statistical software and graphing calculators will choose the classes for you. The 
default choice is a good starting point, but you should adjust the classes to suit 


your needs. To see what we’re talking about, launch the One-Variable Statistical 
Calculator applet at the book’s Web site, www.whfreeman.com/tps5e. Select the 
“Percent of foreign-born residents” data set, and then click on the “Histogram” 


tab. You can change the number of classes by dragging the horizontal axis with 
your mouse or by entering different values in the boxes above the graph. By doing 
so, it’s easy to see how the choice of classes affects the histogram. Bottom line: Use 
your judgment in choosing classes to display the shape. 


TECHNOLOGY 


HISTOGRAMS ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


TI-83/84 


TI-89 


1. Enter the data for the percent of state residents born outside the United States in your Statistics/List Editor. 


Press |STAT]and choose Edit... e Press /APPS 


and select Stats/List Editor. 


‘Type the values into list L1. e ‘Type the values into list. 


Lato=2,8 Hal 


listi[11=2.8 
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Section 1.2. Displaying Quantitative Data with Graphs 


2. Set up a histogram in the Statistics Plots menu. 


e Press| 2nd || y= | (STAT PLOT). e Press[F2]and choose Plot Setup... 
e Press to go into Plotl. e¢ With Plotl highlighted, press to define. 


MORMAL FLOAT AUTO REAL RADIAN CL f Set Hist. fa Define Piet £ 4 


ALEKS] Plot2 Plot3 Bucket Flot Type Histodram > 
bite 


Width to 5. 


x 


Frea:t 


Color: =a 


e Adjust the settings as shown. e Adjust the settings as shown. 


3. Use ZoomStat (ZoomData on the TI-89) to let the calculator choose classes and make a histogram. 


e Press {ZOOM | and choose ZoomStat . e Press | F5 | (ZoomData). 


e Press | TRACE | and ||| to examine the classes. ¢ Press (Trace) and | 4 to examine the classes. 


Note the calculator’s 
unusual choice of 
classes. 


IMORMAL FLOAT AUTO REAL RADIAN CL fl 


Ploti:La 


Mint -1.4 
min=t.2 maxi3.6 neil. 
MOx<h.9A42057 —-n=20 USE ¢3t4 DF TYPE + (ESCIsCANCEL 


4. Adjust the classes to match those in Figure 1.16, and then graph the histogram. 


Press [WINDOW | and enter the values Press [ @ ][F2] (WINDOW) and enter the values shown 
shown below. below. 


Press [GRAPH |, e Press [#][F3 | (GRAPH). 


Press | TRACE to examine the classes. @ Press| F3](‘Ttace) and to examine the classes. 


MORMAL FLOAT AUTO REAL RADIAN CL fl MORMAL FLOAT AUTO REAL RADIAN CL fl 


xres=1 
oexX=.15151515151515 
TraceStep=. 3030303030303 


5. See if you can match the histogram in Figure 1.17. 


AP® EXAM TIP If you’re asked to make a graph on a free-response question, be sure 
to label and scale your axes. Unless your calculator shows labels and scaling, don’t just 
transfer a calculator screen shot to your paper. 
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CHECK YOUR UNDERSTANDING 


Many people believe that the distribution of IQ scores follows a “bell curve,” like the one 
shown in the margin. But is this really how such scores are distributed? ‘The IQ scores of 
60 fifth-grade students chosen at random from one school are shown below.” 


145 139 126 122 125 130 96 110 118 118 
101 142 134 124 112 109 134 113 81 113 
123, 94 100 136 109 131 117 110 127 124 
106 124 115 133 116 102 127 117 109 137 
117. 90 103) 114 139 101 122 105 97 89 
102, 108 110 128 114 112 114 102 82 101 


1. Construct a histogram that displays the distribution of IQ scores effectively. 
2. Describe what you see. Is the distribution bell-shaped? 


Using Histograms Wisely 


We offer several cautions based on common mistakes students make when using 
histograms. 


1. Don’t confuse histograms and bar graphs. Although histograms re- 
semble bar graphs, their details and uses are different. A histogram 
displays the distribution of a quantitative variable. The horizontal 
axis of a histogram is marked in the units of measurement for the variable. 
A bar graph is used to display the distribution of a categorical variable or to 
compare the sizes of different quantities. The horizontal axis of a bar graph 
identifies the categories or quantities being compared. Draw bar graphs with 
blank space between the bars to separate the items being compared. Draw 
histograms with no space, to show the equal-width classes. For comparison, 
here is one of each type of graph from previous examples. 


Histogram Bar graph 


nm 
| 
2 
8 é 10 5 
Ss 
a = 3- 
St - 
Fl 
| gs 67 
2 é 
oa | 
2 | 
SS SS SS 2 ie os 
0 5 1 8615 = 20st eo eS ss eo = oe 
< 
Percent of foreign-born residents - ® a we Y 9 


Radio station format 


2. Use percents instead of counts on the vertical axis when comparing 
distributions with different numbers of observations. Mary was inter- 
ested in comparing the reading levels of a medical journal and an 
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airline magazine. She counted the number of letters in the first 400 words 
of an article in the medical journal and of the first 100 words of an article in 
the airline magazine. Mary then used Minitab statistical software to produce 
the histograms shown in Figure 1.18(a). This figure is misleading—it com- 
pares frequencies, but the two samples were of very different sizes (100 and 
400). Using the same data, Mary’s teacher produced the histograms in Figure 
1.18(b). By using relative frequencies, this figure provides an accurate com- 
parison of word lengths in the two samples. 


(b) 
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FIGURE 1.18 Two sets of histograms comparing word lengths in articles from a journal and from an airline magazine. In (a), the 
vertical scale uses frequencies. The graph in (b) fixes this problem by using percents on the vertical scale. 
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3. Just because a graph looks nice doesn’t make it a meaningful display of & 


data. The students in a small statistics class recorded the number of 

letters in their first names. One student entered the data into an Excel 
spreadsheet and then used Excel’s “chart maker” to produce the graph shown 
below left. What kind of graph is this? It’s a bar graph that compares the raw data 
values. But firstname length is a quantitative variable, so a bar graph is not an ap- 
propriate way to display its distribution. The dotplot on the right is a much better 
choice. 
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Student First name length 


CHECK YOUR UNDERSTANDING 


About 1.6 million first-year students enroll in colleges and universities each year. What 
do they plan to study? The graph on the next page displays data on the percents of first- 
year students who plan to major in several discipline areas.”* 
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EXPLORING DATA 


1. 


20 7 


Percent of students who plan to major 


Field of study 


Is this a bar graph or a histogram? Explain. 


2. Would it be correct to describe this distribution as right-skewed? Why or why not? 


Summary 


You can use a dotplot, stemplot, or histogram to show the distribution of a 
quantitative variable. A dotplot displays individual values on a number line. 
Stemplots separate each observation into a stem and a one-digit leaf. His- 
tograms plot the counts (frequencies) or percents (relative frequencies) of 
values in equal-width classes. 


When examining any graph, look for an overall pattern and for notable 
departures from that pattern. Shape, center, and spread describe the overall 
pattern of the distribution of a quantitative variable. Outliers are observations 
that lie outside the overall pattern of a distribution. Always look for outliers 
and try to explain them. Don’t forget your SOCS! 


Some distributions have simple shapes, such as symmetric, skewed to the 
left, or skewed to the right. The number of modes (major peaks) is another 
aspect of overall shape. So are distinct clusters and gaps. Not all distributions 
have a simple overall shape, especially when there are few observations. 
When comparing distributions of quantitative data, be sure to compare 
shape, center, spread, and possible outliers. 

Remember: histograms are for quantitative data; bar graphs are for categori- 


cal data. Also, be sure to use relative frequency histograms when comparing 
data sets of different sizes. 
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TECHNOLOGY 
CORNER 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


2. Histograms on the calculator 


Exercises 


37. Feeling sleepy? Students in a college statistics class 


responded to a survey designed by their teacher. One 
of the survey questions was “How much sleep did you 
get last night?” Here are the data (in hours): 


oo 
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(a) Make a dotplot to display the data. 


) Describe the overall pattern of the distribution and 
any departures from that pattern. 


38. Olympic gold! The following table displays the total 


number of gold medals won by a sample of countries 
in the 2012 Summer Olympic Games in London. 


Gold Gold 
Country medals Country medals 
Sri Lanka 0 Thailand 0 
China 38 Kuwait 0 
Vietnam 0 Bahamas 1 
Great Britain 29 Kenya 2 
Norway 2 Trinidad and Tobago 1 
Romania 2 Greece 0 
Switzerland 2 Mozambique 0 
Armenia 0 Kazakhstan 7 
Netherlands 6 Denmark 2 
India 0 Latvia 1 
Georgia 1 Czech Republic 4 
Kyrgyzstan 0 Hungary 8 
Costa Rica 0 Sweden 1 
Brazil 3 Uruguay 0 
Uzbekistan 1 United States 46 


(a) Make a dotplot to display these data. Describe the 


overall pattern of the distribution and any departures 
from that pattern. 


(b) 


(a) 
(b) 


40. 


Overall, 205 countries participated in the 2012 Sum- 
mer Olympics, of which 54 won at least one gold 
medal. Do you believe that the sample of countries 
listed in the table is representative of this larger popu- 
lation? Why or why not? 


. U.S. women’s soccer— 2012 Earlier, we examined 


data on the number of goals scored by the U.S. wom- 
en’s soccer team in games played in the 12 months 
prior to the 2012 Olympics. The dotplot below 
displays the goal differential for those same games, 
computed as U.S. score minus opponent's score. 


Soccer 201 
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Explain what the dot above —1 represents. 


What does the graph tell us about how well the team 
did in 2012? Be specific. 


Fuel efficiency In an earlier example, we examined 
data on highway gas mileages of model year 2012 
midsize cars. The following dotplot shows the differ- 
ence (highway — city) in EPA mileage ratings for each 
of the 24 car models from the earlier example. 


(a) 
(b) 


2 0 2 4 6 8 10 


Difference (highway — city) 
Explain what the dot above 12 represents. 


What does the graph tell us about fuel economy in the 
city versus on the highway for these car models? Be 
specific. 
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Dates on coins Suppose that you and your friends 
emptied your pockets of coins and recorded the year 
marked on each coin. The distribution of dates would 
be skewed to the left. Explain why. 


Phone numbers ‘The dotplot below displays the last 
digit of 100 phone numbers chosen at random from 
a phone book. Describe the shape of the distribution. 
Does this shape make sense to you? Explain. 


O  eeeccece 

Ny + eeeccccoce 

O\ + eececee 

CO + eeccccccce 
— eeccccce 


ds 


Last digit 


Creative writing Do external rewards—things like 
money, praise, fame, and grades—promote creativity? 
Researcher Teresa Amabile designed an experiment 
to find out. She recruited 47 experienced creative 
writers who were college students and divided them 
into two groups using a chance process (like draw- 
ing names from a hat). The students in one group 
were given a list of statements about external reasons 
(E) for writing, such as public recognition, making 
money, or pleasing their parents. Students in the oth- 
er group were given a list of statements about internal 
reasons (1) for writing, such as expressing yourself and 
enjoying playing with words. Both groups were then 
instructed to write a poem about laughter. Each stu- 
dent’s poem was rated separately by 12 different poets 
using a creativity scale.”* These ratings were averaged 
to obtain an overall creativity score for each poem. 

Dotplots of the two groups’ creativity scores are 
shown below. Compare the two distributions. What 
do you conclude about whether external rewards 
promote creativity? 


Average rating 


Reward 


I cee eovecceccece © e 


0 5 10 15 20 2 30 
Average rating 


Healthy cereal? Researchers collected data on 77 
brands of cereal at a local supermarket.” For each 
brand, the sugar content (grams per serving) and the 
shelf in the store on which the cereal was located (1 = 
bottom, 2 = middle, 3 = top) were recorded. A dotplot 


“1 


of the data is shown below. Compare the three distribu- 
tions. Critics claim that supermarkets tend to put sugary 
kids’ cereals on lower shelves, where the kids can see 
them. Do the data from this study support this claim? 


o 
N 
> 
n 


8 10 12 14 16 
sugars 


Where do the young live? Below is a stemplot of 

the percent of residents aged 25 to 34 in each of the 
50) states. As in the stemplot for older residents (page 
33), the stems are whole percents, and the leaves are 
tenths of a percent. This time, each stem has been 
split in two, with values having leaves 0 through 4 
placed on one stem, and values ending in 5 through 9 
placed on another stem. 


11 | 44 
11 | 66778 
12 | 0134 


12 | 666778888 
13 | 0000001111444 
13 | 7788999 


14 | 0044 

14 | 567 

“Sy || “Wal 

415) 

1610 
Why did we split stems? 


Give an appropriate key for this stemplot. 


Describe the shape, center, and spread of the distribu- 
tion. Are there any outliers? 


. Watch that caffeine! The U.S. Food and Drug 


Administration (USFDA) limits the amount of caffeine 
in a 12-ounce can of carbonated beverage to 72 milli- 
grams. [hat translates to a maximum of 48 milligrams 
of caffeine per 8-ounce serving. Data on the caffeine 
content of popular soft drinks (in milligrams per 
8-ounce serving) are displayed in the stemplot below. 


556 

033344 
55667778888899 
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Why did we split stems? 
Give an appropriate key for this graph. 


Describe the shape, center, and spread of the distribu- 
tion. Are there any outliers? 


ate 
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El Nifio and the monsoon It appears that E11 Nifio, 
the periodic warming of the Pacific Ocean west of 
South America, affects the monsoon rains that are es- 
sential for agriculture in India. Here are the monsoon 
rains (in millimeters) for the 23 strong El Nifio years 
between 1871 and 2004:7° 


628 
790 


669 740 651 710 736 717 698 653 604 781 784 
811 830 858 858 896 806 790 792 957 872 


48. 


To make a stemplot of these rainfall amounts, round 
the data to the nearest 10, so that stems are hundreds 
of millimeters and leaves are tens of millimeters. 
Make two stemplots, with and without splitting the 
stems. Which plot do you prefer? Why? 


Describe the shape, center, and spread of the 
distribution. 


The average monsoon rainfall for all years from 1871 
to 2004 is about 850 millimeters. What effect does El 


Nifio appear to have on monsoon rains? 


Shopping spree A marketing consultant observed 
50 consecutive shoppers at a supermarket. One vari- 
able of interest was how much each shopper spent in 
the store. Here are the data (in dollars), arranged in 
increasing order: 


3.11 
18.36 
24.58 
36.37 
50.39 


8.88 
18.43 
29.13 
38.64 
52.75 


9.26 
19.27 
26.24 
39.16 
54.80 


10.81 
19.50 
26.26 
41.02 
59.07 


12.69 
19.54 
27.65 
42.97 
61.22 


13.78 
20.16 
28.06 
44.08 
70.32 


15.23 
20.59 
28.08 
44.67 
82.70 


15.62 
22.22 
28.38 
45.40 
85.76 


17.00 17.39 
23.04 24.47 
32.03 34.98 
46.69 48.65 
86.37 93.34 


(a) 


ao. 


Round each amount to the nearest dollar. Then make 
a stemplot using tens of dollars as the stems and dol- 
lars as the leaves. 


Make another stemplot of the data by splitting stems. 
Which of the plots shows the shape of the distribution 
better? 


Write a few sentences describing the amount of 
money spent by shoppers at this supermarket. 


Do women study more than men? We asked the 
students in a large first-year college class how many 
minutes they studied on a typical weeknight. Here are 
the responses of random samples of 30 women and 30 
men from the class: 


180 
120 
150 
200 
120 

90 


Women Men 
120 180 360 240 90 120 30 90 200 
180 120 240 170 SOAS oO 20 ero 


120 180 180 150 150 120 60 
150 180 150 180 

60 120 180 180 30 
240 180 115 120 0 


(a) 


(b) 


50. 


(a) 


(b) 
He 


(c) 
(d) 


Examine the data. Why are you not surprised that 
most responses are multiples of 10 minutes? Are there 
any responses you consider suspicious? 

Make a back-to-back stemplot to compare the two sam- 


ples. Does it appear that women study more than men 
(or at least claim that they do)? Justify your answer. 


Basketball playoffs Here are the numbers of points 
scored by teams in the California Division LAAA high 
school basketball playoffs in a single day’s games:”” 


71 38 52 47 55 53 76 65 77 63 65 63 68 
54 64 62 87 47 64 56 78 64 58 51 91 74 
71 41 67 62 106 46 


On the same day, the final scores of games in Divi- 
sion V-AA were 


98 45 67 44 74 60 96 54 92 72 93 46 
98 67 62 37 37 36 69 44 86 66 66 58 


Construct a back-to-back stemplot to compare the 
points scored by the 32 teams in the Division LAAA 
playoffs and the 24 teams in the Division V-AA playoffs. 


Write a few sentences comparing the two distributions. 


Returns on common stocks ‘The return on a stock 

is the change in its market price plus any dividend 
payments made. Total return is usually expressed as a 
percent of the beginning price. The figure below shows 
a histogram of the distribution of the monthly returns 
for all common stocks listed on U.S. markets over a 
273-month period.”* The extreme low outlier represents 
the market crash of October 1987, when stocks lost 23% 
of their value in one month. 


80 + 


Number of months 


25 -20 -15 -10 -5 0 5 10 15 
Monthly percent return on common stocks 


Describe the overall shape of the distribution of 
monthly returns. 

What is the approximate center of this distribution? 
Approximately what were the smallest and largest 
monthly returns, leaving out the outliers? 


A return less than zero means that stocks lost value 
in that month. About what percent of all months had 
returns less than zero? 
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52. Shakespeare The histogram below shows the dis- (CO2), which contributes to global warming. The 
tribution of lengths of words used in Shakespeare’s table below displays CO2 emissions per person from 
plays.”” Describe the shape, center, and spread of this countries with populations of at least 20 million. ’! 
distribution. 


(a) Make a histogram of the data using classes of width 2, 
starting at 0. 


N 
Nn 


(b) Describe the shape, center, and spread of the distribu- 
tion. Which countries are outliers? 


NY 
o 


= 
nn 


Carbon dioxide emissions 
(metric tons per person) 


eS 
Oo 


Country CO, Country C0, 


Percent of Shakespeare’s words 


: Algeria 2.6 Mexico 3.7 
; Argentina 3.6 Morocco 1.4 
il 29 SS @ S 6 FF B © 1 hh wm Australia 18.4 Myanmar 0.2 
Number of letters in word Bangladesh 0.3 Nepal 0.1 
Brazil 1.8 Nigeria 0.4 
53. Traveling to work How long do people travel each Caiaaa 17.0 Pakistan 08 
day to get to work? The following table gives the : 
average travel times to work (in minutes) for workers Ghia 3.9 Peku ap 
in each state and the District of Columbia who are at Colombia 1.3 Philippines 0.9 
least 16 years old and don’t work at home.*” Congo 0.2 Poland 7.8 
Egypt 2.0 Romania 4.2 
pe 228 ce, ieee | Ethiopia 04 Russia 10.8 
as eee esas aes France 6.2 Saudi Arabia 13.8 
a ace be =e pe are Germany 9.9 South Africa 7.0 
AR 20.7 MA 26.6 PA 25.0 Ghana 03 Spain 79 
eee eet ee India 14 Sudan 03 
oe co a Gee me HES Indonesia 1.6 Tanzania 0.1 
CT 24.1 MS 24.0 SD 15.9 an 6.0 Tiaiend 33 
DE 23.6 MO 22.9 TN Zod Iraq 29 Turkey 3.0 
FL 25.9 MT 17.6 TX 24.6 Italy 78 ane 63 
Ge Phe NE Oder UT 208 Japan 9.5 United Kingdom 8.8 
Hie eee ame Meus Kenya 0.3 United States 19.6 
Bet eee Ue 29 Korea, North 3.3 Uzbekistan 42 
I a Ne aol ve ac Korea, South 9.3 Venezuela 5.4 
IN aoe NM aus ad aol Malaysia 5.5 Vietnam 1.0 
IA 18.2 NY 30.9 WI 20.8 
KS 18.5 NC 23.4 WY 17.9 
KY 20 A ND 155 De 99.2 55. DRP test scores There are many ways to measure 
the reading ability of children. One frequently used 
; ; ; test is the Degree of Reading Power (DRP). Ina 
(a) Make a histogram of the travel times using classes of research study on third-grade students, the DRP was 
width 2 minutes, starting at 14 minutes. That is, the administered toa smilie” ei crores were: 
first class is 14 to 16 minutes, the second is 16 to 18 
minutes, and so on. 40 26 39 14 42 18 25 43 46 27 19 


47 19 26 35 34 15 44 40 38 31 46 
52 25 35 35 33 29 34 41 49 28 52 
47 35 48 22 33 41 51 27 14 54 45 


(b) The shape of the distribution is a bit irregular. Is it 
closer to symmetric or skewed? Describe the center 
and spread of the distribution. Are there any outliers? 


54. Carbon dioxide emissions Burning fuels in power Make a histogram to display the data. Write a para- 
plants and motor vehicles emits carbon dioxide graph describing the distribution of DRP scores. 
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56. Drive time Professor Moore, who lives a few miles 
outside a college town, records the time he takes to 
drive to the college each morning. Here are the times 
(in minutes) for 42 consecutive weekdays: 


8.25 
9.00 
8.33 
8.67 


7.83 
8.50 
7.83 
10.17 


8.30 
9.00 
7.92 
8.75 


8.42 
7.15 
8.58 
8.58 


8.50 
7.92 
7.83 
8.67 


8.67 
8.00 
8.42 
9.17 


8.17 
8.08 
TET 
9.08 


9.00 
8.42 
7.42 
8.83 


9.00 
8.75 
6.75 
8.67 


8.17 7.92 
8.08 9.75 
7.42 8.50 


Make a histogram to display the data. Write a para- 
graph describing the distribution of Professor Moore’s 
drive times. 


57. The statistics of writing style Numerical data can 
distinguish different types of writing and, sometimes, 
even individual authors. Here are data on the percent 
of words of | to 15 letters used in articles in Popular 
Science magazine:** 


Lee il 2 8 4 oh & 7 & 8 WO Wl 12 Ws Wd Wb 
Percent: 3.6 14.818.7 16.0 12.5 8.2 8.15.9 4.4 3.6 2.1 0.9 0.6 0.4 0.2 


(a) Make a histogram of this distribution. Describe its 
shape, center, and spread. 


(b) How does the distribution of lengths of words used in 
Popular Science compare with the similar distribution for 
Shakespeare’s plays in Exercise 52? Look in particular at 
short words (2, 3, and 4 letters) and very long words (more 
than 10 letters). 


58. Chest out, Soldier! In 1846, a published paper pro- 
vided chest measurements (in inches) of 5738 Scottish 


militiamen. The table below summarizes the data. ** 


Chest size Count Chestsize Count 
68 3 4 934 
34 18 42 658 
35 81 43 370 
36 185 44 92 
37 420 45 50 
38 749 46 21 
39 1073 47 4 
40 1079 48 1 


Make a histogram of this distribution. 


Describe the shape, center, and spread of the chest 
measurements distribution. Why might this information 
be useful? 


. Paying for championships Does paying high salaries 
lead to more victories in professional sports? ‘The 
New York Yankees have long been known for having 
Major League Baseball’s highest team payroll. And 
over the years, the team has won many champion- 
ships. This strategy didn’t pay off in 2008, when the 
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Philadelphia Phillies won the World Series. Maybe 
the Yankees didn’t spend enough money that year. 
The graph below shows histograms of the salary dis- 
tributions for the two teams during the 2008 season. 
Why can’t you use this graph to effectively compare 
the team payrolls? 
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. Paying for championships Refer to Exercise 59. 


Here is another graph of the 2008 salary distributions 
for the Yankees and the Phillies. Write a few sen- 
tences comparing these two distributions. 


Yankees 2008 Phillies 2008 
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Birth months Imagine asking a random sample of 60 
students from your school about their birth months. 
Draw a plausible graph of the distribution of birth 
months. Should you use a bar graph or a histogram to 
display the data? 


Die rolls Imagine rolling a fair, six-sided die 60 
times. Draw a plausible graph of the distribution of 
die rolls. Should you use a bar graph or a histogram 
to display the data? 


Who makes more? A manufacturing company is 
reviewing the salaries of its full-time employees below 
the executive level at a large plant. The clerical staff 
is almost entirely female, while a majority of the pro- 
duction workers and technical staff is male. As a re- 
sult, the distributions of salaries for male and female 
employees may be quite different. The following 
table gives the frequencies and relative frequencies 
for women and men. 
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Women Men 
Salary ($1000) | Number % Number % 
10-15 89 11.8 26 bil 
15-20 192 25.4 221 9.0 
20-25 236 Sle 677 27.6 
25-30 111 14.7 823 33.6 
30-35 86 11.4 365 14.9 
35-40 25 on 182 7.4 
40-45 11 15 91 Sid 
45-50 3 0.4 33 es) 
50-55 2 0.3 19 0.8 
55-60 0 0.0 11 0.4 
60-65 0 0.0 0 0.0 
65-70 1 0.1 3 0.1 
Total 756 100.1 2451 99.9 


(a) Explain why the total for women is greater than 100%. 


(b) Make histograms for these data, choosing the vertical 
scale that is most appropriate for comparing the two 
distributions. 


(c) Write a few sentences comparing the salary distribu- 
tions for men and women. 


64. Comparing AP® scores The table below gives the 
distribution of grades earned by students taking the 
AP® Calculus AB and AP® Statistics exams in 2012.* 


No. of Grade 
exams 5 4 3 2 1 
Calculus AB 266,994 67,394 45,523 46,526 27,216 80,335 


19,267 32,521 39,355 27,684 35,032 


Statistics 153,859 


(a) Make an appropriate graphical display to compare the grade 


distributions for AP® Calculus AB and AP® Statistics. 


(b) Write a few sentences comparing the two distributions 
of exam grades. 


65. Population pyramids A population pyramid is a 
helpful graph for examining the distribution of a 
country’s population. Here is a population pyramid for 
Vietnam in the year 2010. Describe what the graph tells 
you about Vietnam’s population that year. Be specific. 


Male Vietnam 2010 Female 


i) 
90 
[| 85 


80 
1/5 
70 
165 
60 
By 
50 
145 
40 
135 
30 
125 
20 
15 
10 
5 
0 


1 00 1 2 3 4 5 
Population (in millions) 


66. Population pyramids Refer to Exercise 65. Here 
is a graph of the projected population distribution 
for China in the year 2050. Describe what the 
graph suggests about China’s future population. Be 
specific. 

Male China 2050 Female 
10 
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60 48 36 24 12 0 0 12) 24 36 48 60 
Population (in millions) 


67. Student survey A survey of a large high school class 
asked the following questions: 

(i) Are you female or male? (In the data, male = 0, 
female = 1.) 

(ii) Are you right-handed or left-handed? (In the data, 
right = 0, left = 1.) 

(iii) What is your height in inches? 

(iv) How many minutes do you study on a typical 
weeknight? 


The figure below shows graphs of the student respons- 
es, in scrambled order and without scale markings. 
Which graph goes with each variable? Explain your 


reasoning. 
|_| 
(a) (b) 
(c) (d) 


68. Choose a graph What type of graph or graphs would 
you make in a study of each of the following issues at 
your school? Explain your choices. 


(a) Which radio stations are most popular with students? 


(b) How many hours per week do students study? 


— 
fe) 
— 


How many calories do students consume per day? 


Section 1.2 Displaying Quantitative Data with Graphs “yy 47 


Multiple choice: Select the best answer for Exercises 69 to 74. 


69. Here are the amounts of money (cents) in coins car- 
ried by 10 students in a statistics class: 50, 35, 0, 97, 
76, 0, 0, 87, 23, 65. ‘To make a stemplot of these data, 
you would use stems 

(@) ©, 1,2, 3. 4h 5.6, 7,8, ©. 

(b) 0, 2,3, 5, 6, 7,8, 9 

@) , 3,5, 6, 7 

(d) 00, 10, 20, 30, 40, 50, 60, 70, 80, 90. 

(e) None of these. 


70. The histogram below shows the heights of 300 ran- 
domly selected high school students. Which of the 
following is the best description of the shape of the 
distribution of heights? 


604 


> 
So 
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Frequency 
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68 72 /6 


64 
Height 


(a) Roughly symmetric and unimodal 

(b) Roughly symmetric and bimodal 

(c) 

(d) Skewed to the left 

(e) Skewed to the right 

71. You look at real estate ads for houses in Naples, Flor- 
ida. There are many houses ranging from $200,000 
to $500,000 in price. The few houses on the water, 
however, have prices up to $15 million. The distribu- 
tion of house prices will be 


a) skewed to the left. 


Roughly symmetric and multimodal 


te) 


( 
(b) roughly symmetric. 
(c) skewed to the right. 
(d) unimodal. 

(e) too high. 

72. The following histogram shows the distribution of the 
percents of women aged 15 and over who have never 
married in each of the 50 states and the District of 
Columbia. Which of the following statements about 
the histogram is correct? 

The center of the distribution is about 36%. 


There are more states with percents above 32 than 
there are states with percents less than 24. 


gs 


(c) It would be better if the values from 34 to 50 were 
deleted on the horizontal axis so there wouldn’t be a 
large gap. 

(d) There was one state with a value of exactly 33%. 

(e) About half of the states had percents between 24% 
and 28%. 


Number of states 


20 24 28 32; 36 40 44 48 By 


Percent of women over 
age 15 who never married 


73. When comparing two distributions, it would be best 
to use relative frequency histograms rather than fre- 
quency histograms when 


(a) the distributions have different shapes. 

(b) the distributions have different spreads. 

(c) the distributions have different centers. 

(d) the distributions have different numbers of observations. 
(e) at least one of the distributions has outliers. 


74. Which of the following is the best reason for choos- 
ing a stemplot rather than a histogram to display the 
distribution of a quantitative variable? 


(a) Stemplots allow you to split stems; histograms don’t. 


(b) Stemplots allow you to see the values of individual 
observations. 


(c) Stemplots are better for displaying very large sets of data. 

(d) Stemplots never require rounding of values. 

(e) Stemplots make it easier to determine the shape of a 
distribution. 

75. Baseball players (Introduction) Here is a small part 

» of a data set that describes Major League Baseball 


€ players as of opening day of the 2012 season: 


Player Team Position Age Height Weight Salary 
Rodriguez, 

Alex Yankees _ Infielder 37s «6-3 225 29,000,000 
Gonzalez, 

Adrian Dodgers _ Infielder 30 =—-6-2 225 21,000,000 
Cruz, 

Nelson Rangers Outfielder 32 6-2 240 5,000,000 
Lester, 

Jon Red Sox _ Pitcher 28 ~=««6-4 240 7,625,000 
Strasburg, 

Stephen Nationals Pitcher 24 «6-4 220 3,000,000 
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(a) What individuals does this data set describe? 


(b) In addition to the player’s name, how many variables 
does the data set contain? Which of these variables 


. Risks of playing soccer (1.1) A study in Sweden 
looked at former elite soccer players, people who had 
played soccer but not at the elite level, and people of 


d= 


are categorical and which are quantitative? the same age who did not play soccer. Here is a two- 
‘ , way table that classifies these individuals by whether 
76. I love my iPod! (1.1) The rating service Arbitron or not they had arthritis of the hip or knee by their 
> asked adults who used several high-tech devices and a aeaese! 
“© "services whether they “loved” using them. Below is a 
graph of the percents who said they did.*° Elite  Non-Elite Did not play 
(a) Summarize what this graph tells you in a sentence or Arthritis 10 9 24 
two. No arthritis 61 206 548 
b) Would itb iate to make a pie chart of th ate 
(b) Hee a pee Os ahaa ce (a) What percent of the people in this study were elite 
aoe y se soccer players? What percent had arthritis? 
b) What percent of the elite soccer players had arthritis? 
‘ P play 
2 Pe What percent of those who had arthritis were elite soc- 
& cer players? 
S -— 
= 30 4 [| 78. Risks of playing soccer (1.1) Refer to Exercise 77. 
a > We suspect that the more serious soccer players have 
5 20 4 © more arthritis later in life. Do the data confirm this 
= suspicion? Give graphical and numerical evidence to 
P grap 
 10- support your answer. 
< 
a SS © x a) = > Ss A A 
2° » s & s & roa ee . ay 
s a 3 cs 


High-tech device or service 


Describing Quantitative Data 
with Numbers 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 
Calculate measures of center (mean, median). Identify outliers using the 1.5 x /QR rule. 
Calculate and interpret measures of spread (range, /QR, Make and interpret boxplots of quantitative data. 


Use appropriate graphs and numerical summaries to 
Choose the most appropriate measure of center and compare distributions of quantitative variables. 
spread in a given setting. 


standard deviation). 


How long do people spend traveling to work? The answer may depend on where 
they live. Here are the travel times in minutes for 15 workers in North Carolina, 
chosen at random by the Census Bureau:*® 


30 20 10 40 25 20 10 60 15 40 5 30 12 10 10 
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We aren’t surprised that most people estimate their travel time in multiples of 
5 minutes. Here is a stemplot of these data: 


o|5 

1 | 000025 

2 | 005 Key: 2/5 isa NC 

3} 00 worker who travels 25 
4100 minutes to work. 

5 

6] 0 


The distribution is single-peaked and right-skewed. The longest travel time 
(60 minutes) may be an outlier. Our main goal in this section is to describe the 
center and spread of this and other distributions of quantitative data with numbers. 


Measuring Center: The Mean 


The most common measure of center is the ordinary arithmetic average, or mean. 


DEFINITION: The mean x 


To find the mean x (pronounced “x-bar”) of a set of observations, add their values 
and divide by the number of observations. If the n observations are xX;, Xo, ... , Xp; 
their mean is 
sum of observations =X; + Xp + +++ + Xp 

n n 


v= 


or, in more compact notation, 


ees 
xX =— 
n 


The > (capital Greek letter sigma) in the formula for the mean is short for “add 
them all up.” The subscripts on the observations x; are just a way of keeping the n 
observations distinct. They do not necessarily indicate order or any other special 
facts about the data. 

Actually, the notation x refers to the mean of a sample. Most of the time, the data 
we'll encounter can be thought of as a sample from some larger population. When 
we need to refer to a population mean, we'll use the symbol ju (Greek letter mu, 
pronounced “mew”). If you have the entire population of data available, then you 
calculate ju in just the way you’d expect: add the values of all the observations, and 
divide by the number of observations. 


Travel Times to Work in North Carolina 


Calculating the mean 


Here is a stemplot of the travel times to work for the sample of 15 North Carolinians. 


005 Key: 25 isa NC 
worker who travels 25 
00 minutes to work. 


anauURWN Oo 
oO 
Oo 
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PROBLEM: 
(a) Find the mean travel time for all 15 workers. 


(b) Calculate the mean again, this time excluding the person who reported a 60-minute travel time 
to work. What do you notice? 


SOLUTION: 
(a) The mean travel time for the sample of 15 North Carolina workers is 


DI Neh es ey OO Ot et dO Dor 


= = 22.5 minutes 
fn fn 15 5 


= 


(b) lfwe leave out the longest travel time, 60 minutes, the mean for the remaining 14 people is 


ie ae hee AU) 
2A ee se = 19.8 minutes 
n fn 14 


This one observation raises the mean by 2.7 minutes. 


eo 


For Practice Try Exercise 


The previous example illustrates an important weakness of the mean 
as a measure of center: the mean is sensitive to the influence of extreme 
observations. These may be outliers, but a skewed distribution that has 
no outliers will also pull the mean toward its long tail. Because the mean cannot 
resist the influence of extreme observations, we say that it is not a resistant mea- 
sure of center. 


THINK What does the mean mean? A group of elementary schoolchildren was 
asked how many pets they have. Here are their responses, arranged from lowest 


ABOUT IT to highest:*” 


[32 2 4 & 7 8.9 
What’s the mean number of pets for this group of children? It’s 


sum of observations 1+3+4+4+4+54+7+8+9 
n 7 9 
But what does that number tell us? Here’s one way to look at it: if every child in 


the group had the same number of pets, each would have 5 pets. In other words, 
the mean is the “fair share” value. 


o_O 


x= 


= 5 pets 


The mean tells us how large each data value would be if the total were split 
equally among all the observations. The mean of a distribution also has a physical 
interpretation, as the following Activity shows. 
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ACTIVITY | Mean as a “balance point” 


MATERIALS: In this Activity, you'll investigate an interesting property of the mean. 


Foot-long ruler, pencil, and]. Stack all 5 pennies above the 6-inch mark on your ruler. Place your pencil 

z bahar group of Stoner the ruler to make a “seesaw” on a desk or table. Move the pencil until the 
ruler balances. What is the relationship between the location of the pencil and 
the mean of the five data values: 6, 6, 6, 6, 6? 


2. Move one penny off the stack to the 8-inch mark on your 
tuler. Now move one other penny so that the ruler balances 
again without moving the pencil. Where did you put the 
other penny? What is the mean of the five data values repre- 
sented by the pennies now? 


3. Move one more penny off the stack to the 2-inch mark 
on your ruler. Now move both remaining pennies from the 
6-inch mark so that the ruler still balances with the pencil in 
the same location. Is the mean of the data values still 6? 


4. Do you see why the mean is sometimes called the “bal- 
ance point” of a distribution? 


Measuring Center: The Median 


In Section 1.2, we introduced the median as an informal measure of center that 
describes the “midpoint” of a distribution. Now it’s time to offer an official “rule” 
for calculating the median. 


DEFINITION: The median 

The median is the midpoint of a distribution, the number such that about half the ob- 

servations are smaller and about half are larger. To find the median of a distribution: 

1. Arrange all observations in order of size, from smallest to largest. 

2. lf the number of observations nis odd, the median is the center observation in 
the ordered list. 

3. If the number of observations nis even, the median is the average of the two 
center observations in the ordered list. 


Medians require little arithmetic, so they are easy to find by hand for small sets 
of data. Arranging even a moderate number of values in order is tedious, however, 
so finding the median by hand for larger sets of data is unpleasant. 
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Travel Times to Work in North Carolina 
Finding the median when n is odd 


What is the median travel time for our 15 North Carolina workers? Here are the 
data arranged in order: 
5 10 10 10 10 12 15 20 20 25 30 30 40 40 60 


‘The count of observations n = 15 is odd. The bold 20 is the center observation in 
the ordered list, with 7 observations to its left and 7 to its right. This is the median, 
20 minutes. 


The next example shows you how to find the median when there is an even 
number of data values. 


Stuck in Traffic 


Finding the median when n is even 


People say that it takes a long time to get to work in New York State due to the 
heavy traffic near big cities. What do the data say? Here are the travel times in 
minutes of 20 randomly chosen New York workers: 


NO 30) 3. 22> 40 20° 10 15. 30 20 
120 8a o> S60 60140" 45 


PROBLEM: 

(a) Make a stemplot of the data. Be sure to include a key. 
(b) Find the median by hand. Show your work. 
SOLUTION: 


(a) Here is a stemplot of the data. The stems indicate10 minutes and the leaves indicate 
minutes. 


5 . 7 . 7 
05555 (b) Because there is an even number of data values, there is no center observation. There is a center 


000s key: als isa pair—the bold 20 and 25 in the stemplot—which have 9 observations before them and 9 after 


my en Von werer. them in the ordered list. The median is the average of these two observations: 
005 who reported a 

45-minute travel 20 + 25 
005 time to work. 7 = 22.5 minutes 


CHA HAWN AS 


5 


For Practice Try Exercise 


of the distribution) is 20 minutes. The mean travel time is higher, 22.5 minutes. The 
0 mean is pulled toward the right tail of this right-skewed distribution. The median, 


aoe Comparing the Mean and the Median 

a Key: 2s isa ms Our discussion of travel times to work in North Carolina illustrates an important dif- 
; 00 Nannies * | ference between the mean and the median. The median travel time (the midpoint 
5 

6 


CONNAOUABWDY HO 


pebl & 


THINK 
ABOUT IT 


Key: 4|S isa 

New York worker 
who reported a 
45-minute travel 


time to work. 


Section 1.3 Describing Quantitative Data with Numbers 4 53 


unlike the mean, is resistant. If the longest travel time were 600 minutes rather than 
60 minutes, the mean would increase to more than 58 minutes but the median 
would not change at all. The outlier just counts as one observation above the center, 
no matter how far above the center it lies. The mean uses the actual value of each 
observation and so will chase a single large observation upward. 

You can compare the behavior of the mean and median by using the Mean and 
Median applet at the book’s Web site, www.whfreeman.com/tps5e. 


COMPARING THE MEAN AND MEDIAN 


‘The mean and median of a roughly symmetric distribution are close together. 
If the distribution is exactly symmetric, the mean and median are exactly the 
same. In a skewed distribution, the mean is usually farther out in the long tail 
than is the median.” 


The mean and median measure center in different ways, and both are useful. 


Should we choose the mean or the median? Many economic vari- 
ables have distributions that are skewed to the right. College tuitions, home pric- 
es, and personal incomes are all right-skewed. In Major League Baseball (MLB), 
for instance, most players earn close to the minimum salary (which was $480,000 
in 2012), while a few earn more than $10 million. The median salary for MLB 
players in 2012 was about $1.08 million—but the mean salary was about $3.44 
million. Alex Rodriguez, Prince Fielder, Joe Mauer, and several other highly paid 
superstars pull the mean up but do not affect the median. 

Reports about incomes and other strongly skewed distributions usually give 
the median (“midpoint”) rather than the mean (“arithmetic average”). However, 
a county that is about to impose a tax of 1% on the incomes of its residents cares 
about the mean income, not the median. The tax revenue will be 1% of total 
income, and the total is the mean times the number of residents. 


OR 


CHECK YOUR UNDERSTANDING 


Here, once again, is the stemplot of travel times to work for 20 randomly selected New 
Yorkers. Earlier, we found that the median was 22.5 minutes. 


1. Based only on the stemplot, would you expect the mean travel time to be less than, 
about the same as, or larger than the median? Why? 


2. Use your calculator to find the mean travel time. Was your answer to Question | 
correct? 


3. Would the mean or the median be a more appropriate summary of the center of this 
distribution of drive times? Justify your answer. 


Measuring Spread: 
Range and Interquartile Range (/QR) 


A measure of center alone can be misleading. The mean annual temperature in 
San Francisco, California, is 57°F —the same as in Springfield, Missouri. But the 
wardrobe needed to live in these two cities is very different! That’s because daily 
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temperatures vary a lot more in Springfield than in San Francisco. A useful numerical 
description of a distribution requires both a measure of center and a measure of spread. 
Note that the range of a data set The simplest measure of variability is the range. To compute the range of a 
is a single number that represents quantitative data set, subtract the smallest value from the largest value. For the 
ihe eres Seno een New York travel time data, the range is 85 — 5 = 80 minutes. The range shows 


and the minimum value. In everyday : é Bd 
inoea une concinsey the full spread of the data. But it depends on only the maximum and minimum 


things like, “The data values range values, which may be outliers. 

from 5 to 85.” Be sure to use the term We can improve our description of spread by also looking at the spread of the 
range correctly, now that you know its — middle half of the data. Here’s the idea. Count up the ordered list of observations, 
statistical definition. starting from the minimum. The first quartile Q, lies one-quarter of the way up 


the list. The second quartile is the median, which is halfway up the list. The third 
quartile Q; lies three-quarters of the way up the list. These quartiles mark out the 
middle half of the distribution. The interquartile range (IQR) measures the range 
of the middle 50% of the data. We need a tule to make this idea exact. The process 
for calculating the quartiles and the JOR uses the rule for finding the median. 


HOW TO CALCULATE THE QUARTILES Q, AND Q; AND THE INTERQUARTILE 
RANGE (/QR) 


Let’s look at how this process works using a familiar set of data. 


Travel Times to Work in North Carolina 
Calculating quartiles 


Our North Carolina sample of 15 workers’ travel times, arranged in increasing 


order, is 
5 10 10 10 10 12 15)20(20 25 30 30 40 40 60 
Median 


Q, is the median 
of the values to the 


Q; is the median 


of the values to the 
right of the median. 


left of the median. 
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There is an odd number of observations, so the median is the middle one, the 
bold 20 in the list. The first quartile is the median of the 7 observations to the left 
of the median. This is the 4th of these 7 observations, so OQ; = 10 minutes (shown 
in blue). The third quartile is the median of the 7 observations to the right of the 
median, Q; = 30 minutes (shown in green). So the spread of the middle 


50% of the travel times is IQR = Q3 — Q; = 30 — 10 = 20 minutes. rr) 


Be sure to leave out the overall median when you locate the quartiles. 


The quartiles and the interquartile range are resistant because they are not 
affected by a few extreme observations. For example, Q3 would still be 30 and the 
IOR would still be 20 if the maximum were 600 rather than 60. 


Stuck in Traffic Again 
Finding and interpreting the |QR 


In an earlier example, we looked at data on travel times to work for 20 randomly 
selected New Yorkers. Here is the stemplot once again: 


Key: 4|S isa 

New York worker 
who reported a 
45-minute travel 
time to work. 


ONAOURWN | OO 


PROBLEM: Find and interpret the interquartile range (/QR). 
SOLUTION: We begin by writing the travel times arranged in increasing order: 


5 10 10 15 1§ 15 15 20 20 aolas 30 30 40 40 45 60 60 65 85 


There is an even number of observations, so the median lies halfway between the middle pair. Its value 
is 22.5 minutes. (We marked the location of the median by |.) The first quartile is the median of the 
10 observations to the left of 22.5. So it’s the average of the two bold 15s: Q, = 15 minutes. The 
third quartile is the median of the 10 observations to the right of 22.5. It’s the average of the bold 
numbers 40 and 45: Qs; = 42.5 minutes. The interquartile range is 


IQR = Q3 — Q, = 42.5 — 15 = 27.5 minutes 


Interpretation: The range of the middle half of travel times for the New Yorkers in the sample is 
27.5 minutes. 


For Practice Try Exercise 
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Identifying Outliers 


In addition to serving as a measure of spread, the interquartile range (IOR) is used 
as part of a rule of thumb for identifying outliers. 


DEFINITION: The 1.5 < /QR rule for outliers 


Call an observation an outlier if it falls more than 1.5 < /QR above the third quartile 
or below the first quartile. 


Any values not falling between 


0} 5 Does the 1.5 X IQR rule identify any outliers for the New York travel time 
1 | 005555 . i 

2] 0005 [Key dsiva data? In the previous example, we found that Q; = 15 minutes, Q3 = 42.5 min- 
3] 00 New York worker | Utes, and IOR = 27.5 minutes. For these data, 

4 | 005 who reported a 

5 45-minute travel Lox IOR = 1,5(27.5) = 41.25 

6} 005 time to work. 

7 

8 


2 QO, — 1.5 X IOR = 15 — 41.25 = —26.25 and 
Q; + 1.5 X IOR = 42.5 + 41.25 = 83.75 


are flagged as outliers. Look again at the stemplot: the only outlier is the longest travel 
time, 85 minutes. The 1.5 X IOR rule suggests that the three next-longest travel times 
(60 and 65 minutes) are just part of the long right tail of this skewed distribution. 


Travel Times to Work in North Carolina 
Identifying outliers 


Earlier, we noted the influence of one long travel time of 60 minutes in our sam- 
ple of 15 North Carolina workers. 


PROBLEM: Determine whether this value is an outlier. 


SOLUTION: Earlier, we found that Q, = 10 minutes, Q, = 30 minutes, and IQR = 20 
minutes. To check for outliers, we first calculate 


Key: 2|5 isa NC 
worker who travels 25 


minutes to work. lroe idk 1.5(20) = 510) 
By the 1.5 X /ARrule, any value greater than 
Qz + 1.5 X IAR= 30 + 30 = 60 


or less than 
Q, — 1.5 X IAR= 10 — 30 = —20 


would be classified as an outlier. The maximum value of 6O minutes is not quite large enough to be an 
outlier because it falls right on the upper cutoff value. 


For Practice Try Exercise 


Whenever you find outliers in your data, try to find an explanation for them. 
Sometimes the explanation is as simple as a typing error, like typing 10.1 as 101. 
Sometimes a measuring device broke down or someone gave a silly response, like 
the student in a class survey who claimed to study 30,000 minutes per night. (Yes, 


AP® EXAM TIP You may be 
asked to determine whether a 
quantitative data set has any 


outliers. Be prepared to state 
and use the rule for identifying 
outliers. 
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that really happened.) In all these cases, you can simply remove the outlier from your 
data. When outliers are “real data,” like the long travel times of some New York work- 
ers, you should choose measures of center and spread that are not greatly affected by 
the outliers. 


The Five-Number Summary and Boxplots 


The smallest and largest observations tell us little about the distribution as a whole, 
but they give information about the tails of the distribution that is missing if we 
know only the median and the quartiles. To get a quick summary of both center 
and spread, use all five numbers. 


DEFINITION: The five-number summary 


The five-number summary of a distribution consists of the smallest observation, 
the first quartile, the median, the third quartile, and the largest observation, written in 
order from smallest to largest. That is, the five-number summary is 


Minimum Q, Median Q; Maximum 


These five numbers divide each distribution roughly into quarters. About 25% 
of the data values fall between the minimum and Q,, about 25% are between OQ; 
and the median, about 25% are between the median and Q3, and about 25% are 
between Q3 and the maximum. 

The five-number summary of a distribution leads to a new graph, the boxplot 
(sometimes called a box-and-whisker plot). 


HOW TO MAKE A BOXPLOT 


e¢ Acentral box is drawn from the first quartile (Q;) to the third quartile (Q3). 
e Aline in the box marks the median. 


e Lines (called whiskers) extend from the box out to the smallest and larg- 
est observations that are not outliers. 


e Outliers are marked with a special symbol such as an asterisk (*). 


Here’s an example that shows how to make a boxplot. 


Home Run King 
Making a boxplot 


Barry Bonds set the major league record by hitting 73 home runs in a single 
season in 2001. On August 7, 2007, Bonds hit his 756th career home run, which 
broke Hank Aaron’s longstanding record of 755. By the end of the 2007 season 
when Bonds retired, he had increased the total to 762. Here are data on the num- 
ber of home runs that Bonds hit in each of his 21 complete seasons: 


NOR ero Os ot AO ay a 
A 37 St ADB 20. A>. > 265 28 
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PROBLEM: Makea boxplot for these data. 
SOLUTION: Let's start by ordering the data values so that we can find the five-number summary. 


16 19 24 25@5 2628 33 33 34 34 37 37 40 42 G5 4546 46 49 73 


Min Q, = 25.5 Median Q;=45 Max 
min, Med R, max Now we check for outliers. Because IQR = 45 — 25.5 = 19.5, by 
16 ASS 54 45 #3 the 1.5 X /@Rrule, any value greater than Qs + 1.5 X IAR= 45+ 
Par 1.5 X 19.5 = 74.25 or less than Q, — 1.5 X |QR= 25.5 — 1.5 
X 19.5 = —3.75 would be classified as an outlier. So there are no 
1s 20 a5 30 55 40 45 50 SS 60 65 FO FS outliers in this data set. Now we are ready to draw the boxplot. See 
Wamber of home runs hit in a season by Barry Bonds the finished graph at left. 


For Practice Try Exercise 


THINK What are we actually doing when we make a boxplot? The top 
dotplot shows Barry Bonds’s home run data. We have marked the first quartile, the 
ABOUT IT median, and the third quartile with blue lines. The process of testing for outliers 
with the 1.5 X IOR rule is shown in visual form. Because there are no outliers, 
we draw the whiskers to the maximum and minimum data values, as shown in the 

finished boxplot at right. 


1.5 x JOR = 29.25 


Lower cutoff for outliers Upper cutoff for outliers 
> e IOR=195 e ° 
i oe—____"_-® ig 
Q, = 25.5 Med = 34 Q, = 45 i 
e| 38 e é e : 
ee eoee e e ee ee e' 
0 10 20 30 40 50 60 70 
Home runs 


Q,=25.55 Med=34 Q,=45 


Min = 16 Max = 73 
e eo 
e e ee e e e 


(0) 10 20 30 40 50 60 70 
Home runs 


Figure 1.19 shows boxplots (this time, they are oriented 


oo [ An outlier is any point vertically) comparing travel times to work for the samples of 
_ +«——] more than 1.5 box lengths) workers from North Carolina and New York. We will identify 
5 : | iii aca outliers as isolated points in the graph (like the * for the maxi- 
a [Whisker ends at ast mum value in the New York data set). 
4. data value that is not Boxplots show less detail than histograms or stemplots, so 
5 an outlier, 65. they are best used for side-by-side comparison of more than 
ia ; one distribution, as in Figure 1.19. As always, be sure to dis- 
£ 30- XJ The interquartile range cuss shape, center, spread, and outliers as part of your com- 
a H is the length of the box. parison. For the travel time to work data: 
ont 4 
aah - - Shape: We see from the graph that both distributions are 
- right-skewed. For both states, the distance from the minimum 


to the median is much smaller than the distance from the 
median to the maximum. 


North Carolina New York 


FIGURE 1.19 Boxplots comparing the travel times to work 


of samples of workers in North Carolina and New York. Center: It appears that travel times to work are generally a 


bit longer in New York than in North Carolina. The median, 
both quartiles, and the maximum are all larger in New York. 


TECHNOLOGY 
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Spread: ‘Travel times are also more variable in New York, as shown by the lengths 
of the boxes (the JOR) and the range. 

Outliers: Earlier, we showed that the maximum travel time of 85 minutes is an 
outlier for the New York data. There are no outliers in the North Carolina sample. 


CHECK YOUR UNDERSTANDING 
The 2011 roster of the Dallas Cowboys professional football team included 8 offensive 
linemen. Their weights (in pounds) were 


310 307 345 324 305 301 290 307 
Find the five-number summary for these data by hand. Show your work. 
Calculate the JOR. Interpret this value in context. 
Determine whether there are any outliers using the 1.5 X JOR rule. 
Draw a boxplot of the data. 


hWN 


MAKING CALCULATOR BOXPLOTS 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


The T1-83/84 and TI-89 can plot up to three boxplots in the same viewing window. Let’s use the calculator to make parallel 
boxplots of the travel time to work data for the samples from North Carolina and New York. 


1. Enter the travel time data for North Carolina in L1/listl and for New York in L2/list2. 


2. Setup two statistics plots: Plot1 to show a boxplot of the North Carolina data and Plot2 to show a boxplot of the New 
York data. The setup for Plot! is shown below. When you define Plot2, be sure to change LI/list] to L2/list2. 


TI-83/84 


NORMAL FLOAT AUTO REAL RADIAN CL 


o 


[AGES] Plot2 Plots 


Mark: FAl+ = - 
Color: izi= 


TI-89 


(____teingts — 


Flot Tvpe [Prod Fax FTot ey 
Marl: 


s : ae a inh 


£ 


Use Frea and Catederics? NO? 
Pre: 


io > TOL 


Note: The calculator offers two types of boxplots: one that shows outliers and one that doesn’t. We'll always use the 
type that identifies outliers. 


3. Use the calculator’s Zoom feature to display the parallel boxplots. Then Trace to view the five-number summary. 


TI-83/84 TL89 
Press [ZOOM] and select ZoomStat. e Press[F5] (ZoomData). 
Press [TRACE |. e Press[F3] (Trace). 


NORMAL FLOAT AUTO REAL RADIAN CL o 


Ploti:La 


iE 
oO 


Med=20 


led: 
Hain 


RAD AUTO FUME 
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CHAPTER 1 


EXPLORING DATA 


Measuring Spread: The Standard Deviation 


The five-number summary is not the most common numerical description of a 
distribution. That distinction belongs to the combination of the mean to measure 
center and the standard deviation to measure spread. The standard deviation and 
its close relative, the variance, measure spread by looking at how far the observa- 
tions are from their mean. Let’s explore this idea using a simple set of data. 


How Many Pets? 
Investigating spread around the mean 


In the Think About It on page 50, we examined data on the number of pets owned 
by a group of 9 children. Here are the data again, arranged from lowest to highest: 
3 ea ee 


Earlier, we found the mean number of pets to be x = 5. Let’s look at where the 
observations in the data set are relative to the mean. 


deviation = —4 


d 


Figure 1.20 displays the data in a dotplot, with the 
mean clearly marked. The data value | is 4 units be- 


eviation = 2 


s | x27 


low the mean. We say that its deviation from the mean 
« % is —4. What about the data value 7? Its deviation is 


 s 7 — 5 =2 (it is 2 units above the mean). The arrows 


in the figure mark these two deviations from the mean. 


5 
, 
i 


6 


“—~ Mean = balance point 


Number of pets The deviations show how much the data vary about 


FIGURE 1.20 Dotplot of the 
pet data with the mean and 
two of the deviations marked. 


their mean. They are the starting point for calculating 
the variance and standard deviation. 


The table below shows the deviation from the mean (x; — x) for each value in the 
data set. Sum the deviations from the mean. You should get 0, because the mean 
is the balance point of the distribution. Because the sum of the deviations from 
the mean will be 0 for any set of data, we need another way to calculate spread 
around the mean. 


Observations Deviations Squared deviations 
Xj xi— X (x; — x)? 
1 1-5=-4 (-4 = 16 
3 3-5=-2 (-2)? = 4 
+ 4-5=-1 (-1? =1 
# 4-5=-1 (-1? =1 
. 4-5=-1 (-1? =1 
5 5-5=0 0? =0 
7 7-5=2 2=4 
8 8-5=3 g=9 
9 9-5=4 4 = 16 


sum = 0 sum = 52 
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How can we fix the problem of the positive and negative deviations canceling 
out? We could take the absolute value of each deviation. Or we could square the 
deviations. For mathematical reasons beyond the scope of this book, statisticians 
choose to square rather than to use absolute values. 


We have added a column to the table that shows the square of each deviation 
(x; — x)’. Add up the squared deviations. Did you get 52? Now we compute the 
average squared deviation —sort of. Instead of dividing by the number of observa- 
tions n, we divide by n — 1: 


MGs Ai ate cleat Wear llist Ol cee tate atu a 


9-1 ye 


a ” . . 
average” squared deviation = 


This value, 6.5, is called the variance. 


Because we squared all the deviations, our units are in “squared pets.” That’s no 
good. We'll take the square root to get back to the correct units—pets. The result- 
ing value is the standard deviation: 


standard deviation = V variance = V6.5 = 2.55 pets 


This 2.55 is the “typical” distance of the values in the data set from the mean. In this 
case, the number of pets typically varies from the mean by about 2.55 pets. 


As you can see, the “average” in the standard deviation calculation is found in 
a rather unexpected way. Why do we divide by n — | instead of n when calculat- 
ing the variance and standard deviation? The answer is complicated but will be 
revealed in Chapter 7. 


DEFINITION: The standard deviation s, and variance s? 


The standard deviation s, measures the typical distance of the values in a distri- 
bution from the mean. It is calculated by finding an average of the squared devia- 
tions and then taking the square root. This average squared deviation is called the 
variance. In symbols, the variance s?is given by 

(x) — X) + (%) — XP + +++ + (xy, — XP | 


2 = = AZ 
Sx Raa nay ee 


and the standard deviation is given by 


= 1 -__ ye 
5 = X%— X) 


Here’s a brief summary of the process for calculating the standard deviation. 
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HOW TO FIND THE STANDARD DEVIATION 


Many calculators report two standard deviations. One is usually labeled o,, 


the symbol for the standard deviation of a population. This standard deviation is 
calculated by dividing the sum of squared deviations by n instead of n — | before 
taking the square root. If your data set consists of the entire population, then it’s 
appropriate to use a,. Most often, the data we’re examining come from a sample. 
In that case, we should use s,. 


DO YO" THINK THERES ANY 
PLACE ON THE PLANET THATS 


More important than the details of calculating s, are the properties that de- 
scribe the usefulness of the standard deviation: 


s, measures spread about the mean and should be used only when the mean is 
chosen as the measure of center. 


s, 1s always greater than or equal to 0. s, = 0 only when there is no variabil- 
ity. This happens only when all observations have the same value. Otherwise, 
s, > 0. As the observations become more spread out about their mean, s, gets 
larger. 


sy has the same units of measurement as the original observations. For example, 
if you measure metabolic rates in calories, both the mean x and the standard 
deviation s, are also in calories. This is one reason to prefer s, to the variance 
sz, which is in squared calories. 


Like the mean x, s, is not resistant. A few outliers can make s, very large. The 
use of squared deviations makes s, even more sensitive than x to a few extreme 
observations. For example, the standard deviation of the travel times 

for the 15 North Carolina workers is 15.23 minutes. If we omit the 
maximum value of 60 minutes, the standard deviation drops to 11.56 
minutes. 
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CHECK YOUR UNDERSTANDING 
The heights (in inches) of the five starters on a basketball team are 67, 72, 76, 76, and 84. 


1. Find the mean. Show your work. 


2. Make a table that shows, for each value, its deviation from the mean and its squared 
deviation from the mean. 


3. Show how to calculate the variance and standard deviation from the values in your 
table. 


4. Interpret the standard deviation in this setting. 


Numerical Summaries with Technology 


Graphing calculators and computer software will calculate numerical summaries 
for you. That will free you up to concentrate on choosing the right methods and 
interpreting your results. 


COMPUTING NUMERICAL SUMMARIES 


TECHNOLOGY Witt TECHNOLOGY 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Let’s find numerical summaries for the travel times of North Carolina and New York workers from the previous ‘Tech- 
nology Corner (page 59). We'll start by showing you the necessary calculator techniques and then look at output from 
computer software. 


I. One-variable statistics on the calculator If you haven’t done so already, enter the North Carolina data in LI/list] 
and the New York data in L2/list2. 
1. Find the summary statistics for the North Carolina travel times. 


TI-83/84 TI-89 


e Press[STAT ||] (CALC) ; choose 1-VarStats. e  Press|F4|(Calc); choose 1-Var Stats. 


OS 2.55 or later: In the dialog box, press | 2nd e ‘Type list] in the list box. Press |ENTER |. 
[2] (L1) and [ENTER | to specify L1 as the List. 
Leave FreqList blank. Arrow down to Calculate 
and press [ENTER |. Older OS: Press [2nd] [1 
L1) and |ENTER |. 


Press |¥| to see the rest of the one-variable statistics for North Carolina. 


IMORMAL FLOAT AUTO REAL RADIAN CL fl MORMAL FLOAT AUTO REAL RADIAN CL fo 


1-Var Stats 

X=22.46666667 tSx=15. 23092093 
Ex=337 | ox=14. 71446756 
=x?=10819 | n=1S 

Sx=15, 23092093 minx=S 
ox=14,.71446756 Q1=10 

n=1S Med=20 

minx=S Q3=30 


Q1=10 maxx=60 
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2. Repeat Step | using L2/list2 to find the summary statistics for the New York travel times. 


i} 


II. Output from statistical software We used Minitab statistical software to produce descriptive statistics for the 
New York and North Carolina travel time data. Minitab allows you to choose which numerical summaries are 
included in the output. 


Descriptive Statistics: Travel time to work 


Variable N Mean StDev Minimum Qi Median Maximum 
NY Time 20 dl AS PAL faite) Se0lO) aS E1010 EXER 50) BSF 1010 
NC Time ALS) 22.47 AB, AS 5,00) ORO 20.00 60.00 


THINK What’s with that third quartile? Earlier, we saw that the quartiles of the 
New York travel times are Q; = 15 and Q; = 42.5. Look at the Minitab output in 
ABOUT IT the Technology Corner. Minitab says that Q; = 43.75. What happened? Minitab 
and some other software use different rules for locating quartiles. Results from the 
various rules are always close to each other, so the differences are rarely important 
in practice. But because of the slight difference, Minitab wouldn't identify the 

maximum value of 85 as an outlier by the 1.5 X IQR rule. 


OR 


Choosing Measures of Center and Spread 


We now have a choice between two descriptions of the center and spread of a 
distribution: the median and JOR, or x and s,. Because x and s, are sensitive to 
extreme observations, they can be misleading when a distribution is strongly 
skewed or has outliers. In these cases, the median and JOR, which are both 
resistant to extreme values, provide a better summary. We'll see in the next 
chapter that the mean and standard deviation are the natural measures of center 
and spread for a very important class of symmetric distributions, the Normal 
distributions. 


CHOOSING MEASURES OF CENTER AND SPREAD 
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Remember that a graph gives the best overall picture of a distribution. 
Numerical measures of center and spread report specific facts about a dis- 
tribution, but they do not describe its entire shape. Numerical summaries 
do not highlight the presence of multiple peaks or clusters, for example. Always 
plot your data. 


Organizing a Statistics Problem 


As you learn more about statistics, you will be asked to solve more complex prob- 
lems. Although no single strategy will work on every problem, it can be helpful to 
have a general framework for organizing your thinking. Here is a four-step process 
you can follow. 


ay HOW TO ORGANIZE A STATISTICS PROBLEM: A FOUR-STEP PROCESS 


State: What’s the question that you’re trying to answer? 


Plan: How will you go about answering the question? What statistical tech- 
niques does this problem call for? 
To keep the four steps straight, just 
remember: Statistics Problems Demand 
Consistency! Conclude: Give your conclusion in the setting of the real-world problem. 


Do: Make graphs and carry out needed calculations. 


Many examples and exercises in this book will tell you what to do—construct a 
graph, perform a calculation, interpret a result, and so on. Real statistics problems 
don’t come with such detailed instructions. From now on, you will encounter 
some examples and exercises that are more realistic. They are marked with the 
four-step icon. Use the four-step process as a guide to solving these problems, as 
the following example illustrates. 


Who Texts More—Males or Females? 


Putting it all together STEP 


For their final project, a group of AP® Statistics students wanted to L. 
compare the texting habits of males and females. They asked a random 

sample of students from their school to record the number of text messages sent 
and received over a two-day period. Here are their data: 


Males: 127 44 28 
Females: 112 203 102 


83 600 6 78 6 5 213 73 20 214 28 11 
54 379 305 179 24 127 65 41 27 298 6 130 0 


What conclusion should the students draw? Give appropriate evidence to support 
your answer. 


STATE: Do males and females at the school differ in their texting habits? 
PLAN: We'll begin by making parallel boxplots of the data about males and females. Then we'll 

calculate one-variable statistics. Finally, we'll compare shape, center, spread, and outliers for the 
two distributions. 
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DO: Figure 1.21 isa sketch of the boxplots we got from our calculator. The table below shows 
numerical summaries for males and females. 


x Sy Min Q, Med Q; Max IQR 
Male 62.4 71.4 0 6 28 83 214 77 
Female 128.3 116.0 0 34 107 191 379 157 
Due to the strong skewness and outliers, we'll use the median and IARinstead 
These two values appeared as of the mean and standard deviation when comparing center and spread. 
one dot in the calculator graph. 
We found them both by tracing. Shape: Both distributions are strongly right-skewed. 


/ Center: Females typically text more than males. The median number of 
Males oo texts for females (107) is about four times as high as for males (28). In 
0\6 28 «83 12% = 213.214 fact, the median for the females is above the third quartile for the males. 
This indicates that over 75% of the males texted less than the “typical” 
Females (median) female. 


Spread: There is much more variation in texting among the females than 
the males. The IQR for females (157) is about twice the IQR for males (77). 


0 100 200 300 400 
Number of text messages in 2-day period Outliers: There are two outliers in the male distribution: students who 
reported 213 and 214 texts in two days. The female distribution has no 
FIGURE 1.21 Parallel boxplots of the texting data. putters: 


CONCLUDE: The data from this survey project give very strong evidence that male and fe- 
male texting habits differ considerably at the school. A typical female sends and receives about 
79 more text messages in a two-day period than a typical male. The males as a group are also 
much more consistent in their texting frequency than the females. 


For Practice Try Exercise 


Now it’s time for you to put what you have learned into practice in the follow- 
ing Data Exploration. 


DATA EXPLORATION Did Mr. Starnes stack his class? 


Mr. Starnes teaches AP® Statistics, but he also does the class scheduling for the 
high school. There are two AP® Statistics classes—one taught by Mr. Starnes and 
one taught by Ms. McGrail. The two teachers give the same first test to their classes 
and grade the test together. Mr. Starnes’s students earned an average score that 
was 8 points higher than the average for Ms. McGrail’s class. Ms. McGrail won- 
ders whether Mr. Starnes might have “adjusted” the class rosters from the com- 
puter scheduling program. In other words, she thinks he might have “stacked” his 
class. He denies this, of course. 

To help resolve the dispute, the teachers collect data on the cumulative grade 
point averages and SAT Math scores of their students. Mr. Starnes provides the 
GPA data from his computer. The students report their SAT Math scores. The fol- 
lowing table shows the data for each student in the two classes. Note that the two 
data values in each row come from a single student. 
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Starnes GPA Starnes SAT-M McGrail GPA McGrail SAT-M 
2.9 670 2.9 620 
2.86 520 3.3 590 
2.6 570 3.98 650 
3.6 710 2.9 600 
3.2 600 3.2 620 
2.7 590 3.5 680 
3.1 640 2.8 500 
3.085 570 2.9 502.5 
3.75 710 3.95 640 
3.4 630 3.1 630 
3.338 630 2.85 580 
3.56 670 2.9 590 
3.8 650 3.245 600 
3.2 660 3.0 600 
3.1 510 3.0 620 

2.8 580 
2.9 600 
3.2 600 


Did Mr. Starnes stack his class? Give appropriate graphical and numerical 
evidence to support your conclusion. 


AP® EXAM TIP Use statistical terms carefully and correctly on the AP® exam. Don’t say 
“mean” if you really mean “median.” Range is a single number; so are Q,, Q3, and /QR. Avoid 
colloquial use of language, like “the outlier skews the mean.” Skewed is a shape. If you misuse 
a term, expect to lose some credit. 


Refer to the chapter-opening Case Study (page 1). You will now use 
what you have learned in this chapter to analyze the data. 


Construct an appropriate graph for comparing the heart rates of 
the women in the three groups. 

Calculate numerical summaries for each group’s data. Which mea- 
sures of center and spread would you choose to compare? Why? 
Determine if there are any outliers in each of the three groups. 
Show your work. 

Write a few sentences comparing the distributions of heart rates 
for the women in the three groups. 

Based on the data, does it appear that the presence of a pet or friend 
reduces heart rate during a stressful task? Justify your answer. 
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Summary 


e A numerical summary of a distribution should report at least its center and 
its spread, or variability. 

e ‘The mean x and the median describe the center of a distribution in differ- 
ent ways. The mean is the average of the observations, and the median is the 
midpoint of the values. 


e When you use the median to indicate the center of a distribution, describe its 
spread using the quartiles. The first quartile Q; has about one-fourth of the 
observations below it, and the third quartile Q3 has about three-fourths of 
the observations below it. The interquartile range (IQR) is the range of the 


middle 50% of the observations and is found by IQR = Q3 — Q). 


e An extreme observation is an outlier if it is smaller than Q; — (1.5 X JOR) or 
larger than Q; + (1.5 X JOR). 


e The five-number summary consisting of the median, the quartiles, and the 
maximum and minimum values provides a quick overall description of a dis- 
tribution. The median describes the center, and the JOR and range describe 
the spread. 


¢ Boxplots based on the five-number summary are useful for comparing dis- 
tributions. The box spans the quartiles and shows the spread of the middle 
half of the distribution. The median is marked within the box. Lines extend 
from the box to the smallest and the largest observations that are not outliers. 
Outliers are plotted as isolated points. 


e The variance s% and especially its square root, the standard devia- 
tion s,, are common measures of spread about the mean. The standard 
deviation s, is zero when there is no variability and gets larger as the spread 
increases. 


e The median is a resistant measure of center because it is relatively unaf- 
fected by extreme observations. The mean is nonresistant. Among measures 
of spread, the IOR is resistant, but the standard deviation and range are not. 


e The mean and standard deviation are good descriptions for roughly sym- 
metric distributions without outliers. They are most useful for the Normal 
distributions introduced in the next chapter. The median and IQR are a bet- 
ter description for skewed distributions. 


e Numerical summaries do not fully describe the shape of a distribution. 
Always plot your data. 


TECHNOLOGY 
CORNERS 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


3. Making calculator boxplots 


4. Computing numerical summaries with technology 
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Exercises 


Quiz grades Joey’s first 14 quiz grades in a marking 87. Domain names When it comes to Internet domain 


80. 


83. 


84. 


85. 


86. 


period were 


86 84 91 75 78 80 74 
87 76 96 82 90 98 93 


Calculate the mean. Show your work. 


Cowboys ‘The 2011 roster of the Dallas Cowboys 
professional football team included 7 defensive line- 
men. Their weights (in pounds) were 321, 285, 300, 
285, 286, 293, and 298. Calculate the mean. Show 


your work. 


. Quiz grades Refer to Exercise 79. 


Find the median by hand. Show your work. 

Suppose the heaviest lineman had weighed 341 
pounds instead of 321 pounds. How would this 
change affect the mean and the median? What prop- 
erty of measures of center does this illustrate? 


Incomes of college grads According to the Census 
Bureau, the mean and median income in a recent 
year of people at least 25 years old who had a bach- 
elor’s degree but no higher degree were $48,097 and 
$60,954. Which of these numbers is the mean and 


which is the median? Explain your reasoning. 


House prices ‘The mean and median selling prices 
of existing single-family homes sold in July 2012 were 
$263,200 and $224,200.*! Which of these numbers is 
the mean and which is the median? Explain how you 
know. 


Baseball salaries Suppose that a Major League Base- 
ball team’s mean yearly salary for its players is $1.2 
million and that the team has 25 players on its active 
roster. What is the team’s total annual payroll? If you 
knew only the median salary, would you be able to 
answer this question? Why or why not? 


Mean salary? Last year a small accounting firm paid 
each of its five clerks $22,000, two junior accountants 
$50,000 each, and the firm’s owner $270,000. What is 
the mean salary paid at this firm? How many of the em- 
ployees earn less than the mean? What is the median 
salary? Write a sentence to describe how an unethical 
recruiter could use statistics to mislead prospective 
employees. 


88. 


names, is shorter better? According to one ranking 

otf Web sites in 2012, the top 8 sites (by number of 
“hits”) were google.com, youtube.com, wikipedia.org, 
yahoo.com, amazon.com, ebay.com, craigslist.org, and 
facebook.com. These familiar sites certainly have short 
domain names. The histogram below shows the do- 
main name lengths (in number of letters in the name, 
not including the extensions .com and .org) for the 500 
most popular Web sites. 


Find the median by hand. Show your work. 100 
Suppose Joey has an unexcused absence for the 15th A 
quiz, and he receives a score of zero. Recalculate the z 
mean and the median. What property of measures of g 6 
center does this illustrate? 5 a 
. Cowboys Refer to Exercise 80. 
20 


2 4 6 8 10 12 14 16 
Domain name length 


Estimate the mean and median of the distribution. 
Explain your method clearly. 

If you wanted to argue that shorter domain names were 
more popular, which measure of center would you 
choose—the mean or the median? Justify your answer. 


Do adolescent girls eat fruit? We all know that fruit 
is good for us. Below is a histogram of the number of 
servings of fruit per day claimed by 74 seventeen-year- 
old girls in a study in Pennsylvania.” 


iS 


Number of subjects 


0 1 2 3 4 a 6 a 8 
Servings of fruit per day 


With a little care, you can find the median and the 
quartiles from the histogram. What are these num- 
bers? How did you find them? 

Estimate the mean of the distribution. Explain your 
method clearly. 
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89. 

pg fey (a) 

56 | (b) 
e 


90. 
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Quiz grades Refer to Exercise 79. 
Find and interpret the interquartile range (IOR). 


Determine whether there are any outliers. Show your 
work. 


Cowboys Refer to Exercise 80. 
Find and interpret the interquartile range (IQR). 


Determine whether there are any outliers. Show your 
work. 


. Don’t call me In a September 28, 2008, article titled 


“Letting Our Fingers Do the Talking,” the New York 
‘Times reported that Americans now send more text 
messages than they make phone calls. According to a 
study by Nielsen Mobile, “Teenagers ages 13 to 17 are 
by far the most prolific texters, sending or receiving 
1742 messages a month.” Mr. Williams, a high school 
statistics teacher, was skeptical about the claims in 
the article. So he collected data from his first-period 
statistics class on the number of text messages and 
calls they had sent or received in the past 24 hours. 
Here are the texting data: 


0 7 | Os 8 F 1 B® BY OY WO A 
8 118 72 O 92 52 14 3 3 44 5 42 
(a) Make a boxplot of these data by hand. Be sure to 


check for outliers. 


Explain how these data seem to contradict the claim 
in the article. 


. Acing the first test Here are the scores of Mrs. Liao’s 


students on their first statistics test: 


OSI SS 87255 Oil 4S 7:25 S65 9559 93!5 8 93!5 973 
82 45 88 80 86 855 875 81 78 86 89 
92 91 98 85 825 88 945 43 


Make a boxplot of the test score data by hand. Be sure 
to check for outliers. 


How did the students do on Mrs. Liao’s first test? 
Justify your answer. 


. Texts or calls? Refer to Exercise 91. A boxplot of the 


difference (texts — calls) in the number of texts and 
calls for each student is shown below. 


20 0 20 40 60 80 100 120 
Difference (texts — calls) 


Do these data support the claim in the article about 
texting versus calling? Justify your answer with appro- 
priate evidence. 


(b) 


hire 


(a) 


(b) 


95. 


(c) 


Can we draw any conclusion about the preferences 
of all students in the school based on the data from 
Mr. Williams’s statistics class? Why or why not? 


Electoral votes ‘To become president of the United 
States, a candidate does not have to receive a majority 
of the popular vote. The candidate does have to win a 
majority of the 538 electoral votes that are cast in the 
Electoral College. Here is a stemplot of the number of 
electoral votes for each of the 50 states and the District 
of Columbia. 


0 3333333344444 

0 55555666777788999 
dl 0000111123 

1 BD) Sy T/ 

2 || lal 

2 || 0 

3 14 

5 

4 

4 Key: 1Sisa 
5 state with 15 
5 5 electoral votes. 


Make a boxplot of these data by hand. Be sure to 
check for outliers. 


Which measure of center and spread would you use 
to summarize the distribution—the mean and stan- 
dard deviation or the median and IOR? Justify your 
answer. 


Comparing investments Should you put your mon- 
ey into a fund that buys stocks or a fund that invests in 
real estate? The boxplots compare the daily returns 
(in percent) on a “total stock market” fund and a 

real estate fund over a one-year period.” 


Daily percent return 
oo 
on 
Li 
LU 


Stocks Real estate 


Type of investment 


Read the graph: about what were the highest and low- 
est daily returns on the stock fund? 

Read the graph: the median return was about the 
same on both investments. About what was the me- 
dian return? 

What is the most important difference between the 
two distributions? 
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96. Income and education level Each March, the 
Bureau of Labor Statistics compiles an Annual 
Demographic Supplement to its monthly Cur- 
rent Population Survey.** Data on about 71,067 
individuals between the ages of 25 and 64 who 
were employed full-time were collected in one of 
these surveys. ‘The boxplots below compare the 
distributions of income for people with five levels of 
education. This figure is a variation of the boxplot 
idea: because large data sets often contain very 
extreme observations, we omitted the individuals 
in each category with the top 5% and bottom 5% of 
incomes. Write a brief description of how the distri- 
bution of income changes with the highest level of 
education reached. Give specifics from the graphs 
to support your statements. 


50 ; LH 


= «Cc 


Total income (thousands of dollars) 


LL 
Hi 


Not HS Some 
HS grad grad college 


Bachelor’s Advanced 


97. Phosphate levels ‘The level of various substances in 
the blood influences our health. Here are measure- 
ments of the level of phosphate in the blood of a pa- 
tient, in milligrams of phosphate per deciliter of blood, 
made on 6 consecutive visits to a clinic: 5.6, 5.2, 4.6, 
4.9, 5.7, 6.4. A graph of only 6 observations gives little 
information, so we proceed to compute the mean and 
standard deviation. 


(a) Find the standard deviation from its definition. ‘That 
is, find the deviations of each observation from the 
mean, square the deviations, then obtain the variance 
and the standard deviation. 


(b) Interpret the value of s, you obtained in part (a). 


98. Feeling sleepy? ‘The first four students to arrive for 
a first-period statistics class were asked how much 
sleep (to the nearest hour) they got last night. Their 
responses were 7, 7,9, and 9. 


(a) Find the standard deviation from its definition. ‘That 
is, find the deviations of each observation from the 
mean, square the deviations, then obtain the variance 
and the standard deviation. 


(b) Interpret the value of s, you obtained in part (a). 


(c) Do you think it’s safe to conclude that the mean 
amount of sleep for all 30 students in this class is 
close to 8 hours? Why or why not? 

99. Shopping spree The figure displays computer output 
for data on the amount spent by 50 grocery shoppers. 


Descriptive Statistics: Amount spent 


Total 
Variable Count Mean StVev Minimum Ql Median Q3 Maximur 
Amount spent 50 34.70 21.70 3.11 19.06 27.85 45.72 93.34 


(a) What would you guess is the shape of the distribu- 
tion based only on the computer output? Explain. 


(b) Interpret the value of the standard deviation. 
(c) Are there any outliers? Justify your answer. 


100. C-sections Do male doctors perform more cesarean 
sections (C-sections) than female doctors? A study 
in Switzerland examined the number of cesarean 
sections (surgical deliveries of babies) performed in 
a year by samples of male and female doctors. Here 
are summary statistics for the two distributions: 


xX. Sx Min Q, Med Q; Max /QR 
Male 
doctors 41.333 20.607 20 27 34 50 386 23 
Female 


doctors 19.1 10.126 5 10 185 29 33 19 


(a) Based on the computer output, which distribution 
would you guess has a more symmetrical shape? 
Explain. 

(b) Explain how the IQRs of these two distributions can 
be so similar even though the standard deviations 
are quite different. 


(c) Does it appear that male doctors perform more C- 
sections? Justify your answer. 


101. The IQR Is the interquartile range a resistant mea- 
sure of spread? Give an example of a small data set 
that supports your answer. 

102. What do they measure? For each of the following 

summary statistics, decide (i) whether it could be 

used to measure center or spread and (ii) whether it 
is resistant. 

Q; + Q; Max — Min 

Z eZ, 

103. SD contest This is a standard deviation contest. You 
must choose four numbers from the whole numbers 
0 to 10, with repeats allowed. 


(b) 


(a) Choose four numbers that have the smallest possible 
standard deviation. 

(b) Choose four numbers that have the largest possible 
standard deviation. 


(c) Is more than one choice possible in either part (a) or 
(b)? Explain. 
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104. Measuring spread Which of the distributions 


Frequency 


shown has a larger standard deviation? Justify your 
answer. 


Variable DB 


Variable A 


2 4 6 8 


105. SSHA scores Here are the scores on the Survey of 


ye} 65 


STEP A 


Study Habits and Attitudes (SSHA) for 18 first-year 
college women: 


154 109 137 115 152 140 154 178 101 
103 126 126 137 165 4165 129 200 148 


and for 20 first-year college men: 


108 140 +114 91 180 115 126 
92 169 146 109 132 75 88 
113 151 70 115 187 104 


Do these data support the belief that men and 
women differ in their study habits and attitudes to- 
ward learning? (Note that high scores indicate good 
study habits and attitudes toward learning.) Follow 
the four-step process. 


06. Hummingbirds and tropical flower Research- 


STEP J 
A 


ers from Amherst College studied the relationship 
between varieties of the tropical flower Heliconia on 
the island of Dominica and the different species of 
hummingbirds that fertilize the flowers.*° Over time, 
the researchers believe, the lengths of the flowers and 
the forms of the hummingbirds’ beaks have evolved to 
match each other. If that is true, flower varieties fertil- 
ized by different hummingbird species should have 
distinct distributions of length. 

The table below gives length measurements (in 
millimeters) for samples of three varieties of 
Heliconia, each fertilized by a different species of 
hummingbird. Do these data support the researchers’ 
belief? Follow the four-step process. 


H. bihai 


47.12 46.75 
48.07 48.34 48.15 50.26 


46.80 47.12 46.67 47.43 46.44 46.64 


50.12 46.34 46.94 48.36 


H. caribaea red 


41.90 42.01 41.93 43.09 41.47 41.69 39.78 40.57 
39.63 42.18 40.66 37.87 39.16 37.40 38.20 38.07 
38.10 37.97 38.79 38.23 38.87 37.78 38.01 


H. caribaea yellow 


36.78 37.02 36.52 36.11 36.03 35.45 38.13 37.10 
35.17 36.82 36.66 35.68 36.03 34.57 34.63 


Multiple choice: Select the best answer for Exercises 107 
to 110. 


107. Ifa distribution is skewed to the right with no outliers, 


(a) 


(d) mean > median. 
(e) We can’t tell without 
examining the data. 


mean < median. 
mean ~ median. 


mean = median. 


108. The scores on a statistics test had a mean of 81 and 


a standard deviation of 9. One student was absent 

on the test day, and his score wasn’t included in the 
calculation. If his score of 84 was added to the distri- 
bution of scores, what would happen to the mean and 
standard deviation? 


Mean will increase, and standard deviation will 
increase. 

Mean will increase, and standard deviation will 
decrease. 

Mean will increase, and standard deviation will stay 
the same. 

Mean will decrease, and standard deviation will 
increase. 


Mean will decrease, and standard deviation will 
decrease. 


109. ‘The stemplot shows the number of home runs hit by 


(a) 173 


each of the 30 Major League Baseball teams in 2011. 
Home run totals above what value should be consid- 
ered outliers? 


oe) 
Xe) 


Key: 14/8 is a 
team with 148 
home runs. 
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110. Which of the following boxplots best matches the 


distribution shown in the histogram? 


LOR 
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Exercises 111 and 112 refer to the following setting. 


We used CensusAtSchool’s “Random Data Selector” to 
choose a sample of 50 Canadian students who completed a 
survey in a recent year. 


How tall are you? (1.2) Here are the students’ heights 
(in centimeters). 


166.5 170 178 163 150.5 169 173 169 171 166 
190 183 178 161 #171 170 191 168.5 178.5 173 
175 «160.5 166 164 163 174 160 174 182 167 
166 #170 170 181 171.5 160 178 157 165 187 
168 157.5 145.5 156 182 168.5 177 162.5 160.5 185.5 


Make an appropriate graph to display these data. 
Describe the shape, center, and spread of the distribu- 
tion. Are there any outliers? 


112. Let’s chat (1.1) The bar graph displays data on 
students’ responses to the question “Which of these 
methods do you most often use to communicate with 


your friends?” 
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Method of communication 


Would it be appropriate to make a pie chart for these 
data? Why or why not? 

Jerry says that he would describe this bar graph as 
skewed to the right. Explain why Jerry is wrong. 


. Success in college (1.1) ‘The 2007 Freshman Survey 


asked first-year college students about their “habits of 
mind” — specific behaviors that college faculty have 
identified as being important for student success. One 
question asked students, “How often in the past year 
did you revise your papers to improve your writing?” 
Another asked, “How often in the past year did you 
seek feedback on your academic work?” The figure is 
a bar graph comparing male and female responses to 
these two questions. *° 


[ Male [fj Female 
100 4 
80 4 
60 4 54.9% 


% “Frequently” 


Seek feedback 
on work 


Revise papers to 
improve writing 


What does the graph tell us about the habits of mind 


of male and female college freshmen? 
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Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


Using data from the 2010 census, a random sample of 348 
US. residents aged 18 and older was selected. Among the 
variables recorded were gender (male or female), housing sta- 
tus (rent or own), and marital status (married or not married). 

The two-way table below summarizes the relationship 
between gender and housing status. 


Own 
Rent 
Total 


What percent of males in the sample own their 
home? 

Make a graph to compare the distribution of hous- 
ing status for males and females. 


Introduction: Data Analysis: Making Sense of Data 


In this brief section, you learned several fundamental 
concepts that will be important throughout the course: 
the idea of a distribution and the distinction between 
quantitative and categorical variables. You also learned a 
strategy for exploring data: 


e Begin by examining each variable by itself. ‘Then move 
on to study relationships between variables. 


e Start with a graph or graphs. Then add numerical 
summaries. 


Section 1.1: Analyzing Categorical Data 


In this section, you learned how to display the distribution of 
a single categorical variable with pie charts and bar graphs 


(c) Using your graph from part (b), describe the rela- 
tionship between gender and housing status. 

(d) ‘The two-way table below summarizes the relation- 
ship between marital status and housing status. 


Married Not Married Total 
Own 254 
Rent 94 
Total 348 


For the members of the sample, is the relationship between 
marital status and housing status stronger or weaker than 
the relationship between gender and housing status that 
you described in part (c)? Justify your choice using the data 
provided in the two-way tables. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score. 


and what to look for when describing these displays. Re- 
member to properly label your graphs! Poor labeling is an 
easy way to lose points on the AP® exam. You should also be 
able to recognize misleading graphs and be careful to avoid 
making misleading graphs yourself. 

Next, you learned how to investigate the association 
between two categorical variables. Using a two-way table, 
you learned how to calculate and display marginal and 
conditional distributions. Graphing and comparing con- 
ditional distributions allow you to look for an association 
between the variables. If there is no association between 
the two variables, graphs of the conditional distributions 
will look the same. However, if differences in the con- 
ditional distributions do exist, there is an association be- 
tween the variables. 


Section 1.2: Displaying Quantitative Data with Graphs 


In this section, you learned how to create three differ- 
ent types of graphs for a quantitative variable: dotplots, 
stemplots, and histograms. Each of the graphs has dis- 
tinct benefits, but all of them are good tools for examin- 
ing the distribution of a quantitative variable. Dotplots 
and stemplots are handy for small sets of data. Histograms 
are the best choice when there are a large number of 
observations. On the AP® exam, you will be expected to 
create each of these types of graphs, label them properly, 
and comment on their characteristics. 

When you are describing the distribution of a quanti- 
tative variable, you should look at its graph for the overall 
pattern (shape, center, spread) and striking departures 
from that pattern (outliers). Use the acronym SOCS 
(shape, outliers, center, spread) to help remember these 
four characteristics. Likewise, when comparing distri- 
butions, you should include explicit comparison words 
such as “is greater than” or “is approximately the same 
as.” When asked to compare distributions, a very com- 
mon mistake on the AP® exam is describing the charac- 
teristics of each distribution separately without making 
these explicit comparisons. 


What Did You Learn? 


Section 1.3: Describing Quantitative Data with Numbers 


To measure the center of a distribution of quantitative data, 
you learned how to calculate the mean and the median of a 
distribution. You also learned that the median is a resistant 
measure of center but the mean isn’t resistant because it can 
be greatly affected by skewness or outliers. 

To measure the spread of a distribution of quantitative 
data, you learned how to calculate the range, interquartile 
range, and standard deviation. The interquartile range (IOR) 
is a resistant measure of spread because it ignores the upper 
25% and lower 25% of the distribution, but the range isn’t 
resistant because it uses only the minimum and maximum 
value. The standard deviation is the most commonly used 
measure of spread and approximates the typical distance of a 
value in the data set from the mean. The standard deviation 
is not resistant— it is heavily affected by extreme values. 

‘To identify outliers in a distribution of quantitative data, 
you learned the 1.5 X IQR rule. You also learned that boxplots 
are a great way to visually summarize the distribution of quan- 
titative data. Boxplots are helpful for comparing distributions 
because they make it easy to compare both center (median) 
and spread (range, JOR). Yet boxplots aren’t as useful for dis- 
playing the shape of a distribution because they do not display 
modes, clusters, gaps, and other interesting features. 


Learning Objective Section Related Example Relevant Chapter 
on Page Review Exercise(s) 

Identify the individuals and variables in a set of data. Intro 3 R11 

Classify variables as categorical or quantitative. Intro 3 R11 

Display categorical data with a bar graph. Decide whether it 

would be appropriate to make a pie chart. el R1.2,R1.3 

Identify what makes some graphs of categorical data deceptive. ileal 10 R1.3 

Calculate and display the marginal distribution of a categorical 

variable from a two-way table. dell Is R1.4 

Calculate and display the conditional distribution of a categorical 

variable for a particular value of the other categorical variable in 

a two-way table. ale 15 R1.4 

Describe the association between two categorical variables by 

comparing appropriate conditional distributions. ale ali Rie 

Dotplots: 25 

Make and interpret dotplots and stemplots of quantitative data. 12 Stemplots: 31 R1.6 

Describe the overall pattern (shape, center, and spread) of a 

distribution and identify any major departures from the pattern 

(outliers). 12 Dotplots: 26 R1.6, R1.9 

Identify the shape of a distribution from a graph as roughly 

symmetric or skewed. 12 28 R1.6, R1.7, R1.8, R1.9 
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What Did You Learn? (continued) 


Learning Objective 


Section 


Related Example 
on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Make and interpret histograms of quantitative data. 


33 R1.7, R1.8 


Compare distributions of quantitative data using dotplots, 
stemplots, or histograms. 


Calculate measures of center (mean, median). 


30 R1.8, R1.10 


Mean: 49 
Median: 52 R1.6 


Calculate and interpret measures of spread (range, /QR, 
standard deviation). 


IQR: 55 
Std. dev.: 60 R1.9 


Choose the most appropriate measure of center and spread 
in a given setting. 


Identify outliers using the 1.5 * /QR rule. 


65 R1.7 
56 R1.6, R1.7, R1.9 


Make and interpret boxplots of quantitative data. 


Use appropriate graphs and numerical summaries to compare 
distributions of quantitative variables. 


of R1.7 


65 R1.8, R1.10 


Chapter 1 Chapter Review Exercises 


These exercises are designed to help you review the important 
ideas and methods of the chapter. 


R1.1 Hit movies According to the Internet Movie Data- 
base, Avatar is tops based on box office sales world- 
wide. ‘The following table displays data on several 
popular movies.*” 


Pen ee R1.3 


Avatar 2009 PG-13 = 162 Action 2,781,505,847 
Titanic 1997 PG-13 194 Drama _1,835,300,000 
Harry Potter and 

the Deathly 

Hallows: Part 2 2011 PG-13 130 Fantasy § 1,327,655,619 
Transformers: 

Dark of the Moon =.2011_—s- PG-13 154 Action 1,123,146,996 
The Lord of the 

Rings: The Return 

of the King 2003 PG-13 201 Action 1,119,929,521 
Pirates of the 

Caribbean: Dead 

Man's Chest 2006 PG-13 151 Action 1,065,896,541 
Toy Story 3 2010 G 103 Animation 1,062,984,497 


(a) What individuals does this data set describe? 
(b) Clearly identify each of the variables. Which are 


quantitative? 
(c) Describe the individual in the highlighted row. 


R1.2 Movie ratings ‘The movie rating system we use today 


was first established on November 1, 1968. Back 
then, the possible ratings were G, PG, R, and X. In 
1984, the PG-13 rating was created. And in 1990, 
NC-17 replaced the X rating. Here is a summary 

of the ratings assigned to movies between 1968 and 
2000: 8% rated G, 24% rated PG, 10% rated PG-13, 
55% rated R, and 3% rated NC-17.*8 Make an ap- 
propriate graph for displaying these data. 


Vd die without my phone! In a survey of over 2000 
US. teenagers by Harris Interactive, +7% said that 
“their social life would end or be worsened without 
their cell phone.” One survey question asked the 
teens how important it is for their phone to have 
certain features. ‘The figure below displays data on the 
percent who indicated that a particular feature is vital. 


60 5 
so 
40 + 
30 4 
20 4 


104] 


Send/receive Camera Send/receive 


Make/receive 
calls text messages pictures 
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(a) Explain how the graph gives a misleading impression. (a) Make a histogram of the data and describe its main 
(b) Would it be appropriate to make a pie chart to features. Does it show the expected right skew? 

display these data? Why or why not? (b) Now make a boxplot of the data. Be sure to check 
(c) Make a graph of the data that isn’t misleading. for outliers. 


(c) Which measure of center and spread would you 
use to summarize the distribution—the mean and 
standard deviation or the median and IOR? Justify 
your answer. 


R1.8 Household incomes Rich and poor households differ 


R1.4 Facebook and age Is there a relationship between 
Facebook use and age among college students? ‘The 
following two-way table displays data for the 219 
students who responded to the survey.” 


Age in ways that go beyond income. Following are histo- 
Facebook Younger Middle Older grams that compare the distributions of household size 
user? (18-22) (23-27) (28 and up) (number of people) for low-income and high-income 
Yes 78 49 21 households.’? Low-income households had annual in- 
No 4 m1 46 comes less than $15,000, and high-income households 


had annual incomes of at least $100,000. 
(a) What percent of the students who responded were 


Facebook users? Is this percent part of a marginal eo 
distribution or a conditional distribution? Explain. 50 
(b) What percent of the younger students in the sample 
were Facebook users? What percent of the Facebook ee 
users in the sample were younger students? 3 30 
R1.5 Facebook and age Use the data in the previous a a 
exercise to determine whether there is an associa- 
tion between Facebook use and age. Give appropri- 10 
ate graphical and numerical evidence to support 
your answer. yi Lg 2s es 
R1.6 Density of the earth In 1798, the English scientist Household size, low income 
Henry Cavendish measured the density of the earth ee 


several times by careful work with a torsion bal- 
ance. lhe variable recorded was the density of the 50 
earth as a multiple of the density of water. Here are 


Cavendish’s 29 measurements:”! ee 
————— =| 
5.50 5.61 4.88 5.07 5.26 5.55 5.36 5.29 5.58 5.65 B 30 
5.57 5.53 5.62 5.29 5.44 5.34 5.79 5.10 5.27 5.39 = Pe 
5.42 5.47 5.63 534 546 5.30 5.75 5.68 5.85 
(a) Present these measurements graphically in a stemplot. 10 
(b) Discuss the shape, center, and spread of the distri- é — 
bution. Are there any outliers? L293 2 3 8 7 
(c) What is your estimate of the density of the earth Household size, high income 


based on th ts? Explain. 
aie say eae Sues aes ks (a) About what percent of each group of households 


R1.7 Guinea pig survival times Here are the survival times consisted of two people? 
in days of 72 guinea pigs after they were injected with (b) What are the important differences between these 
infectious bacteria in a medical experiment.” Survival two distributions? What do you think explains these 
times, whether of machines under stress or cancer differences? 
patients after treatment, usually have distributions that 
are skewed to the right. Exercises R1.9 and R1.10 refer to the following setting. Do 


you like to eat tuna? Many people do. Unfortunately, some 


Be ED 88 20 28) ta oh ACO BL ee of the tuna that people eat may contain high levels of mer- 


80 80 81 81 81 682 683 83 84 8B BD cury. Exposure to mercury can be especially hazardous for 
91 92 92 97 99 99 100 100 101 102 102 102 pregnant women and small children. How much mercury 
103 104 107 108 109 113 114 118 121 123 126 128 is safe to consume? ‘The Food and Drug Administration will 


take action (like removing the product from store shelves) 
if the mercury concentration in a six-ounce can of tuna is 


1.00 ppm (parts per million) or higher. 


137 138 139 144 145 147 156 162 174 178 179 184 
191 198 211 214 243 249 329 380 403 511 522 598 
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What is the typical mercury concentration in cans of 
tuna sold in stores? A study conducted by Defenders of 
Wildlife set out to answer this question. Defenders col- 
lected a sample of 164 cans of tuna from stores across the 
United States. They sent the selected cans to a laboratory 
that is often used by the Environmental Protection Agency 
for mercury testing.** 

R1.9 Mercury in tuna A histogram and some computer 
output provide information about the mercury con- 
centration in the sampled cans (in parts per million, 
ppm). 


00 02 04 06 08 10 12 14 16 18 


Descriptive Statistics: Mercury_ppm 


Variable N Mean StDev Min 
Mercury 164 02285 0.300 0. 012 
Variable Q1 Med Q3 Max 


Mercury ORO 1 OF 1810 0.380 IL SOW 


(a) Interpret the standard deviation in context. 


(b) Determine whether there are any outliers. 


RI. 


(c) Describe the shape, center, and spread of the 
distribution. 


10 Mercury in tuna Is there a difference in the 
mercury concentration of light tuna and albacore 
tuna? Use the parallel boxplots and the computer 
output to write a few sentences comparing the two 
distributions. 


Mercury in tuna 


Albacore 


Descriptive Statistics: Mercury_ppm 


Type N Mean StDev Min 
Albacore 20 0.401 ORS 2 @), aLW/@ 
Light 144 0.269 Onsar2 QO. @12 
Type Qi Med Q3 Max 
Albacore 0.293 0.400 0.460 QO. 730) 
Light 0.059 W150 0.347 i. 500 


Chapter 1 AP® Statistics Practice Test 


Section |: Multiple Choice Select the best answer for each question. 


T1.1 You record the age, marital status, and earned in- 
come of a sample of 1463 women. The number and 
type of variables you have recorded is 


a) 3 quantitative, 0 categorical. 
b) 
(c) 3 quantitative, 1 categorical. 
) 
) 


( 
(b) 4 quantitative, 0 categorical. 


(d) 2 quantitative, 1 categorical. 


(e) 2 quantitative, 2 categorical. 


T1.2 Consumers Union measured the gas mileage in 
miles per gallon of 38 vehicles from the same model 
year on a special test track. The pie chart provides 


(a) 


information about the country of manufacture of 
the model cars tested by Consumers Union. Based 
on the pie chart, we conclude that 


Japanese cars get significantly lower gas mileage than 
cars from other countries. 


US. cars get significantly higher gas mileage than 


cars from other countries. 


Swedish cars get gas mileages that are between those 
of Japanese and U.S. cars. 


cars from France have the lowest gas mileage. 


more than half of the cars in the study were from the 
United States. 
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Ibs 


10 - 


yermar | 5 7 
Sweden } 
/ a 


France Germany Italy Japan Sweden US. 


(d) 
20 
T1.3 Which of the following bar graphs is equivalent to 
the pie chart in Question T'1.2? al 
(a) 
155 


10s 
i i 
10 5 
Fs 
o 


France Germany Italy Japan Sweden US. 
5 — 
(e) None of these. 


called a seismograph, which is designed to be most 
France Germany Italy Japan Sweden USS. sensitive to earthquakes with intensities between 4.0 
and 9.0 on the Richter scale. Measurements of nine 
(b) earthquakes gave the following readings: 
85 


fe T1.4 Earthquake intensities are measured using a device 


4.5 L 1) H 8.7 8.9 6.0 H 52 


205 where L indicates that the earthquake had an inten- 
sity below 4.0 and an H indicates that the earthquake 
is 5 had an intensity above 9.0. The median earthquake 
intensity of the sample is 
Ins (ay ea 
(b) 6.00. 


(c) 6.47. 
| | —_ = (d) 8.70. 


France Germany Italy Japan Sweden Us. (e) Cannot be determined. 
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Questions T1.5 and T1.6 refer to the following setting. In a 
statistics class with 136 students, the professor records how 
much money (in dollars) each student has in his or her pos- 
session during the first class of the semester. ‘The histogram 
shows the data that were collected. 


Frequency 


ro. 
0 10 20 30 40 50 60 70 80 90 100 110 
Amount of money 


T1.5 The percentage of students with less than $10 in 
their possession is closest to 


(a) 30%. (b) 35%. (c) 45%. (d) 60%. (e) 70%. 


T1.6 Which of the following statements about this distri- 
bution is not correct? 


( 
T1.7 Forty students took a statistics examination having 
a maximum of 50 points. The score distribution is 
given in the following stem-and-leaf plot: 


28 

2245 
01333358889 
001356679 
22444466788 
000 


UnWN A] oO 


The third quartile of the score distribution is equal to 
(aio. (bya, (ce) 43. (dd) 32, (6) 23 


T1.8 The mean salary of all female workers is $35,000. The 
mean salary of all male workers is $41,000. What 
must be true about the mean salary of all workers? 

(a) It must be $38,000. 
(b) It must be larger than the median salary. 


(c) It could be any number between $35,000 and 
$41,000. 


(d) It must be larger than $38,000. 
(e) It cannot be larger than $40,000. 


Questions T1.9 and T1.10 refer to the following setting. A survey 
was designed to study how business operations vary according 
to their size. Companies were classified as small, medium, or 
large. Questionnaires were sent to 200 randomly selected busi- 
nesses of each size. Because not all questionnaires in a survey 
of this type are returned, researchers decided to investigate the 
relationship between the response rate and the size of the busi- 
ness. ‘The data are given in the following two-way table: 


Business size 


Response? Small Medium Large 
Yes 125 81 40 
No 5 119 160 


T1.9 What percent of all small companies receiving 
questionnaires responded? 


(a) 12.5% (c) 33.3% (e) 62.5% 
(b) 20.8% (d) 50.8% 
T1.10 Which of the following conclusions seems to be 
supported by the data? 


(a) There are more small companies than large compa- 
nies in the survey. 


(b) Small companies appear to have a higher response 
rate than medium or big companies. 


(c) Exactly the same number of companies responded 
as didn’t respond. 


(d) Overall, more than half of companies responded to 
the survey. 


(e) If we combined the medium and large companies, 
then their response rate would be equal to that of 
the small companies. 


T1.11 An experiment was conducted to investigate the effect 
of a new weed killer to prevent weed growth in onion 
crops. ‘Iwo chemicals were used: the standard weed 
killer (C) and the new chemical (W). Both chemicals 
were tested at high and low concentrations on a total 
of 50 test plots. The percent of weeds that grew in 
each plot was recorded. Here are some boxplots of the 
results. Which of the following is not a correct state- 
ment about the results of this experiment? 


W-—low conc. 
C—low conc. 
W—high conc. 


C—high conc. 


Percent of weeds that grew 


(a) At both high and low concentrations, the new chemical 
(W) gives better weed control than the standard weed 
killer (C). 

(b) Fewer weeds grew at higher concentrations of both 
chemicals. 


(c) The results for the standard weed killer (C) are less 
variable than those for the new chemical (W). 
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(d) High and low concentrations of either chemical have 
approximately the same effects on weed growth. 

(e) Some of the results for the low concentration of weed 
killer W show fewer weeds growing than some of the 
results for the high concentration of W. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T1.12 You are interested in how much time students 
spend on the Internet each day. Here are data on 
the time spent on the Internet (in minutes) for a 
particular day reported by a random sample of 30 
students at a large high school: 


GU A) Oh wi) 7H A 2 all 32. 3 
42 43 44 45 46 47 48 48 50a 


7 Tf i it} Te & Gy 8 i185 Sil 


(a) Construct a histogram of these data. 
(b) Are there any outliers? Justify your answer. 


(c) Would it be better to use the mean and standard 
deviation or the median and JOR to describe the 
center and spread of this distribution? Why? 


T1.13 A study among the Pima Indians of Arizona investi- 
gated the relationship between a mother’s diabetic 
status and the appearance of birth defects in her chil- 
dren. The results appear in the two-way table below. 


Diabetic Status 
Birth Defects Nondiabetic Prediabetic Diabetic Total 
None 754 362 38 
One or more 31 13 9 
Total 


(a) Fill in the row and column totals in the margins of 
the table. 

(b) Compute (in percents) the conditional distributions 
of birth defects for each diabetic status. 

(c) Display the conditional distributions in a graph. 
Don’t forget to label your graph completely. 

(d) Do these data give evidence of an association between 
diabetic status and birth defects? Justify your answer. 


T1.14 The back-to-back stemplot shows the lifetimes of 
several Brand X and Brand Y batteries. 


Brand X 


Brand Y 


1 
il || @ 
2 || 2 
2 || & 
Al@ || & 
MTS || 3 
3221 | 4 | 223334 
4 | 56889 
4);5]0 
yal |S) 


Key: 4|2 represents 
420-429 hours. 


(a) What is the longest that any battery lasted? 


(b) Give a reason someone might prefer a Brand 
X battery. 


(c) Give a reason someone might prefer a Brand 
Y battery. 


T1.15 During the early part of the 1994 baseball season, 
many fans and players noticed that the number of 
home runs being hit seemed unusually large. Here 
are the data on the number of home runs hit by 
American League and National League teams in 
the early part of the 1994 season: 


American League: 35 40 43 49 51 54 57 58 58 64 68 68 75 77 
National League: 29 31 42 46 47 48 48 53 55 55 55 63 63 67 


Compare the distributions of home runs for the two 
leagues graphically and numerically. Write a few 
sentences summarizing your findings. 


Chapter 


Introduction 


Section 2.1 


Section 2.2 


Free Response 
AP® Problem, YAY! 


Chapter 2 Review 
Chapter 2 Review Exercises 


Chapter 2 AP® Statistics 
Practice Test 


Modeling Distributions 
of Data 


Do You Sudoku? 


The sudoku craze has officially swept the globe. Here’s what Will Shortz, crossword puzzle editor for the 


New York Times, said about sudoku: 


As humans we seem to have an innate desire to fill up 
empty spaces. This might explain part of the appeal of su- 
doku, the new international craze, with its empty squares 
to be filled with digits. Since April 2005, when sudoku was 
introduced to the United States in ‘The New York Post, 
more than half the leading American newspapers have 
begun printing one or more sudoku a day. No puzzle has 
had such a fast introduction in newspapers since the cross- 
word craze of 1924-25.! 


Since then, millions of people have made sudoku part of 
their daily routines. 


One of the authors played an online game of sudoku at 
www.websudoku.com. The graph provides information 
about how well he did. (His time is marked with an arrow.) 


Your time: 3 minutes, 19 seconds 


0 min | 30 mins 
Rank: Top 19% 


Easy level average time: 5 minutes, 6 seconds. 
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fe Introduction 


Suppose Jenny earns an 86 (out of 100) on her next statistics test. Should she be 
satisfied or disappointed with her performance? That depends on how her score 
compares with the scores of the other students who took the test. If 86 is the high- 
est score, Jenny might be very pleased. Maybe her teacher will “curve” the grades 
so that Jenny’s 86 becomes an “A.” But if Jenny’s 86 falls below the “average” in 
the class, she may not be so happy. 

Section 2.1 focuses on describing the location of an individual within a distri- 
bution. We begin by discussing a familiar measure of position: percentiles. Next, 
we introduce a new type of graph that is useful for displaying percentiles. Then 
we consider another way to describe an individual’s position that is based on the 
mean and standard deviation. In the process, we examine the effects of transform- 
ing data on the shape, center, and spread of a distribution. 

Sometimes it is helpful to use graphical models called density curves to describe 
the location of individuals within a distribution, rather than relying on actual data 
values. Such models are especially helpful when data fall in a bell-shaped pattern 
called a Normal distribution. Section 2.2 examines the properties of Normal dis- 
tributions and shows you how to perform useful calculations with them. 


ACTIVITY | Where do | stand? 


MATERIALS: In this Activity, you and your classmates will explore ways to describe where you 
Masking tape to mark stand (literally!) within a distribution. 


number line scale ‘ ; : 5 
1. Your teacher will mark out a number line on the floor with a scale running 


from about 58 to 78 inches. 
2. Make a human dotplot. Each member of the class should stand at the appro- 
priate location along the number line scale based on height (to the nearest inch). 
3. Your teacher will make a copy of the dotplot on the board for your reference. 
4. What percent of the students in the class have heights less than yours? This is 
your percentile in the distribution of heights. 
5. Work with a partner to calculate the mean and standard deviation of the 
class’s height distribution from the dotplot. Confirm these values with your 
classmates. 
6. Where does your height fall in relation to the mean: above or below? How 
far above or below the mean is it? How many standard deviations above or 
below the mean is it? This last number is the z-score corresponding to your 
height. 
7. Class discussion: What would happen to the class’s height distribution if 
} . ‘) } you converted each data value from inches to centimeters? (‘There are 2.54 
i} ii ) Se centimeters in | inch.) How would this change of units affect the measures 
a of center, spread, and location (percentile and z-score) that you calculated? 


Se 


Want to know more about where you stand—in terms of height, weight, or 
even body mass index? Do a Web search for “Clinical Growth Charts” at the 
National Center for Health Statistics site, www.cdc.gov/nchs. 
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[at Describing Location ina 


Distribution 


WHAT YOU WILL LEARN By the end of the section, you should be able to: 


e Find and interpret the percentile of an individual value e Find and interpret the standardized score (z-score) of 
within a distribution of data. an individual value within a distribution of data. 


e Estimate percentiles and individual values using a Describe the effect of adding, subtracting, multi- 
cumulative relative frequency graph. plying by, or dividing by a constant on the shape, 
center, and spread of a distribution of data. 


Here are the scores of all 25 students in Mr. Pryor’s statistics class on their first test: 


79 81 B80 77 73 83 74 93 78 80 75 67 7 
77 83 86 90 79 85 63 89 84 82 77 72 


The bold score is Jenny’s 86. How did she perform on this test relative to her 
classmates? 

The stemplot displays this distribution of test scores. Notice that the distribu- 
tion is roughly symmetric with no apparent outliers. From the stemplot, we can 
see that Jenny did better than all but three students in the class. 
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Key: 7|2 is a student who 
scored 72 on the test 


Measuring Position: Percentiles 


One way to describe Jenny’s location in the distribution of test scores is to tell what 
percent of students in the class earned scores that were below Jenny’s score. That is, 
we can calculate Jenny’s percentile. 


Using the stemplot, we see that Jenny’s 86 places her fourth from the top of the 
class. Because 21 of the 25 observations (84%) are below her score, Jenny is at the 
84th percentile in the class’s test score distribution. 
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Mr. Pryor’s First Test 


Finding percentiles 


PROBLEM: Use the scores on Mr. Pryor’s first statistics test to find the percentiles for the 
following students: 


(a) Norman, who earned a 72. 

(b) Katie, who scored 93. 

(c) The two students who earned scores of 80. 
SOLUTION: 


(a) Only 1 of the 25 scores in the class is below Norman's 72. His percentile is computed as 
follows: 1/25 = 0.04, or 4%. So Norman scored at the 4th percentile on this test. 

(b) Katie's 93 puts her at the 96th percentile, because 24 out of 25 test scores fall below her 
result. 


(c) Two students scored an 80 on Mr. Pryor’s first test. Because 12 of the 25 scores in the class 
were less than 80, these two students are at the 48th percentile. 


For Practice Try Exercise 


Note: Some people define the pth percentile of a distribution as the value with 
p percent of observations less than or equal to it. Using this alternative definition 
of percentile, it is possible for an individual to fall at the 100th percentile. If we 
used this definition, the two students in part (c) of the example would fall at the 
56th percentile (14 of 25 scores were less than or equal to 80). Of course, because 
80 is the median score, it is also possible to think of it as being the 50th percen- 
tile. Calculating percentiles is not an exact science, especially with small data 
sets! We'll stick with the definition of percentile we gave earlier for consistency. 


Cumulative Relative Frequency Graphs 


Age Frequency There are some interesting graphs that can be made with percentiles. One of the 
40-44 9 most common graphs starts with a frequency table for a quantitative variable. For 
instance, the frequency table in the margin summarizes the ages of the first 44 


45-49 d US. presidents when they took office. 

50-54 13 Let’s expand this table to include columns for relative frequency, cumulative 
55-59 12 frequency, and cumulative relative frequency. 

60-64 7 e ‘To get the values in the relative frequency column, divide the count in each 
65-69 3 class by 44, the total number of presidents. Multiply by 100 to convert to a 


percent. 
e To fill in the cumulative frequency column, add the counts in the frequency 
column for the current class and all classes with smaller values of the variable. 
e For the cumulative relative frequency column, divide the entries in the cumu- 


lative frequency column by 44, the total number of individuals. Multiply by 
100 to convert to a percent. 
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Here is the original frequency table with the relative frequency, cumulative fre- 
quency, and cumulative relative frequency columns added. 


Relative Cumulative Cumulative relative 

Age Frequency frequency frequency frequency 

40-44 2 2/44 = 0.045, or 4.5% 2 2/44 = 0.045, or 4.5% 
45-49 7 7/44 = 0.159, or 15.9% 9 9/44 = 0.205, or 20.5% 
50-54 13 13/44 = 0.295, or 29.5% 22 22/44 = 0.500, or 50.0% 
55-59 12 12/44 = 0.273, or 27.3% 34 34/44 = 0.773, or 77.3% 
60-64 7 7/44 = 0.159, or 15.9% 41 41/44 = 0.932, or 93.2% 
65-69 3 3/44 = 0.068, or 6.8% 44 44/44 = 1.000, or 100% 


Some people refer to cumulative 
relative frequency graphs as “ogives” 
(pronounced “o-jives”). 


To make a cumulative relative frequency graph, we plot a point correspond- 
ing to the cumulative relative frequency in each class at the smallest value of the 
next class. For example, for the 40 to 44 class, we plot a point at a height of 4.5% 
above the age value of 45. This means that 4.5% of presidents were inaugurated 

before they were 45 years old. (In other words, age 45 is the 4.5th 
100 — percentile of the inauguration age distribution.) 

It is customary to start a cumulative relative frequency graph with 
a point at a height of 0% at the smallest value of the first class (in this 
case, 40). The last point we plot should be at a height of 100%. We 
connect consecutive points with a line segment to form the graph. 
40 + Figure 2.1 shows the completed cumulative relative frequency graph. 

Here’s an example that shows how to interpret a cumulative 
relative frequency graph. 


oo 
5 
! 


Cumulative relative frequency (%) 
3 
| 


40 45 50 55 60 65 70 FIGURE 2.1 Cumulative relative frequency graph for 
Age at inauguration the ages of U.S. presidents at inauguration. 


Age at Inauguration 


Interpreting a cumulative relative frequency graph 


What can we learn from Figure 2.1? The graph grows very gradually at first because 
few presidents were inaugurated when they were in their 40s. Then the graph gets 
very steep beginning at age 50. Why? Because most U.S. presidents were in their 50s 
when they were inaugurated. The rapid growth in the graph slows at age 60. 


Suppose we had started with only the graph in Figure 2.1, without any of the 
information in our original frequency table. Could we figure out what percent of 
presidents were between 55 and 59 years old at their inaugurations? Sure. Because 
the point at age 60 has a cumulative relative frequency of about 77%, we know 
that about 77% of presidents were inaugurated before they were 60 years old. 
Similarly, the point at age 55 tells us that about 50% of presidents were younger 
than 55 at inauguration. As a result, we’d estimate that about 77% — 50% = 27% 
of U.S. presidents were between 55 and 59 when they were inaugurated. 
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A cumulative relative frequency graph can be used to describe the position of 
an individual within a distribution or to locate a specified percentile of the distri- 
) bution, as the following example illustrates. 


Ages of U.S. Presidents 


Interpreting cumulative relative frequency graphs 


PROBLEM: Use the graph in Figure 2.1 on the previous page to help you answer each question. 
(a) Was Barack Obama, who was first inaugurated at age 47, unusually young? 

(b) Estimate and interpret the 65th percentile of the distribution. 

SOLUTION: 

(a) To find President Obama’s location in the distribution, we draw a vertical line up from his age (47) 
on the horizontal axis until it meets the graphed line. Then we draw a horizontal line from this point 

of intersection to the vertical axis. Based on Figure 2.2(a), we would estimate that Barack Obama's 
inauguration age places him at the 11% cumulative relative frequency mark. That is, he’s at the 11th 


percentile of the distribution. In other words, about 11% of all U.S. presidents were younger than 
Barack Obama when they were inaugurated and about 89% were older. 


Cumulative relative frequency (%) 
Cumulative relative frequency (%) 


T— 5g 1 
50 55 60 


Age at inauguration Age at inauguration 
(a) (b) 


FIGURE 2.2 The cumulative relative frequency graph of presidents’ ages at inauguration is used to (a) locate 
President Obama within the distribution and (b) determine the 65th percentile, which is about 58 years. 


(b) The 65th percentile of the distribution is the age with cumulative relative frequency 65%. To 
find this value, draw a horizontal line across from the vertical axis at a height of 65% until it meets 
the graphed line. From the point of intersection, draw a vertical line down to the horizontal axis. In 
Figure 2.2(b), the value on the horizontal axis is about 58. So about 65% of all U.S. presidents were 
younger than 56 when they took office. 


For Practice Try Exercise 9 | 


THINK Percentiles and quartiles: Have you made the connection between per- 
centiles and the quartiles from Chapter 1? Earlier, we noted that the median (second 
ABOUT IT quartile) corresponds to the 50th percentile. What about the first quartile, Q;? It’s at 
the median of the lower half of the ordered data, which puts it about one-fourth of 
the way through the distribution. In other words, Q, is roughly the 25th percentile. 
By similar reasoning, Q3 is approximately the 75th percentile of the distribution. 
$a 
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ae 


CHECK YOUR UNDERSTANDING 


1. Multiple choice: Select the best answer. Mark receives a score report detailing his per- 
formance on a statewide test. On the math section, Mark earned a raw score of 39, which 
placed him at the 68th percentile. This means that 


(a) Mark did better than about 39% of the students who took the test. 
(b) Mark did worse than about 39% of the students who took the test. 
(c) Mark did better than about 68% of the students who took the test. 
(d) Mark did worse than about 68% of the students who took the test. 
(e) Mark got fewer than half of the questions correct on this test. 


Rt Ne Se 


2. Mrs. Munson is concerned about how her daughter’s height 
and weight compare with those of other girls of the same age. 
She uses an online calculator to determine that her daughter is at 
the 87th percentile for weight and the 67th percentile for height. 
Explain to Mrs. Munson what this means. 

Questions 3 and 4 relate to the following setting. The graph dis- 
plays the cumulative relative frequency of the lengths of phone calls 
made from the mathematics department office at Gabalot High last 
month. 


Cumulative relative frequency (%) 


0 5 10 15 20 


Call length (minutes) 
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Key: 7/2 is a student who 
scored 72 on the test 


The relationship between the mean and 
the median is about what you’d expect 
in this fairly symmetric distribution. 


25 


1 3. About what percent of calls lasted less than 30 minutes? 30 
30,35 40, 45s minutes or more? 


4. Estimate Q), Q3, and the IOR of the distribution. 


Measuring Position: z-Scores 


Let’s return to the data from Mr. Pryor’s first statistics test, which are shown in 
the stemplot. Figure 2.3 provides numerical summaries from Minitab for these 
data. Where does Jenny’s score of 86 fall relative to the mean of this distribution? 
Because the mean score for the class is 80, we can see that Jenny’s score is “above 
average.” But how much above average is it? 

We can describe Jenny’s location in the distribution of her class’s test scores 
by telling how many standard deviations above or below the mean her score is. 
Because the mean is 80 and the standard deviation is about 6, Jenny’s score of 86 
is about one standard deviation above the mean. 

Converting observations like this from original values to standard deviation 
units is known as standardizing. To standardize a value, subtract the mean of the 
distribution and then divide the difference by the standard deviation. 


Minitab 


Descriptive Statistics: Test 1 scores 


Variable N Mean Median TrMean StDev SE Mean 
Test 1 scores 25 80.00 80.00 80.00 6.07 £22 


Variable Minimum Maximum Ql Q3 
Test 1 scores 67.00 93.00 76.00 83.50 


FIGURE 2.3 Minitab output for the scores of Mr. Pryor’s students on their first statistics test. 
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De 


DEFINITION: Standardized score (z-score) 


If xis an observation from a distribution that has known mean and standard devia- 
tion, the standardized score for x is 
= X — mean 
7 standard deviation 
A standardized score is often called a z-score. 


A zscore tells us how many standard deviations from the mean an observa- 
tion falls, and in what direction. Observations larger than the mean have positive 
z-scores. Observations smaller than the mean have negative z-scores. For example, 
Jenny’s score on the test was x = 86. Her standardized score (z-score) is 


x — mean _ 86-80 


a standard deviation 6.07 ollie 


That is, Jenny’s test score is 0.99 standard deviations above the mean score of the 
class. 


Mr. Pryor’s First Test, Again 


Finding and interpreting z-scores 


PROBLEM: Use Figure 2.3 on the previous page to find the standardized scores (z-scores) for 
each of the following students in Mr. Pryor’s class. Interpret each value in context. 


(a) Katie, who scored 93. 
(b) Norman, who earned a 72. 
SOLUTION: 


(a) Katie’s 93 was the highest score in the class. Her corresponding z-score is 


93 — 80 
BS = BAC 
6.07 
In other words, Katie’s result is 2.14 standard deviations above the mean score for this test. 


(b) For Norman's 72, his standardized score is 


72-80 _ 


oy 
6.07 


Norman’s score is 1.32 standard deviations below the class mean of 80. 


For Practice Try Exercise 1B) 


We can also use z-scores to compare the position of individuals in different 
distributions, as the following example illustrates. 
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Jenny Takes Another Test 


Using z-scores for comparisons 


The day after receiving her statistics test result of 86 from Mr. Pryor, Jenny earned 
an 82 on Mr. Goldstone’s chemistry test. At first, she was disappointed. Then Mr. 
Goldstone told the class that the distribution of scores was fairly symmetric with a 
mean of 76 and a standard deviation of 4. 

PROBLEM: Onwhich test did Jenny perform better relative to the class? Justify your answer. 
SOLUTION: Jenny's z-score for her chemistry test result is 


62 —76 
Zz = ——— 
4 


= 1.50 


Her 82 in chemistry was 1.5 standard deviations above the mean score for the class. Because she 
scored only 0.99 standard deviations above the mean on the statistics test, Jenny did better rela- 
tive to the class in chemistry. 


For Practice Try Exercise 


We often standardize observations to express them on a common scale. We 
might, for example, compare the heights of two children of different ages by cal- 
culating their z-scores. At age 2, Jordan is 89 centimeters (cm) tall. Her height 
puts her at a z-score of 0.5; that is, she is one-half standard deviation above the 
mean height of 2-year-old girls. Zayne’s height at age 3 is 101 cm, which yields a 
z-score of 1. In other words, he is one standard deviation above the mean height 
of 3-year-old boys. So Zayne is taller relative to boys his age than Jordan is relative 
to girls her age. The standardized heights tell us where each child stands 
(pun intended!) in the distribution for his or her age group. 


CHECK YOUR UNDERSTANDING 

Mrs. Navard’s statistics class has just completed the first three steps of the “Where Do | 
Stand?” Activity (page 8+). The figure below shows a dotplot of the class’s height distribu- 
tion, along with summary statistics from computer output. 


1. Lynette, a student in the class, is 65 inches tall. Find and interpret her z-score. 
2. Another student in the class, Brent, is 74 inches tall. How tall 


P : ; 2s. il ie .S tt is Brent compared with the rest of the class? Give appropriate 
numerical evidence to support your answer. 
6 62 64 «426 6 70 #72 74 
Height dinches) 3. Brent is a member of the school S basketball team. The mean 
height of the players on the team is 76 inches. Brent’s height 
Variable on x 5s, Min Q, Med Q; Max translates to a z-score of —0.85 in the team’s height distribution. 


Height 25 67 429 60 63 66 69 75 What is the standard deviation of the team members’ heights? 
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FIGURE 2.4 Fathom dotplot and 
summary statistics for Australian 
students’ guesses of the classroom 
width. 


Transforming Data 


To find the standardized score (z-score) for an individual observation, we transform 
this data value by subtracting the mean and dividing the difference by the standard 
deviation. Transforming converts the observation from the original units of 
measurement (inches, for example) to a standardized scale. What effect do these 
kinds of transformations—adding or subtracting; multiplying or dividing—have 
on the shape, center, and spread of the entire distribution? Let’s investigate using 
an interesting data set from “down under.” 

Soon after the metric system was introduced in Australia, a group of students 
was asked to guess the width of their classroom to the nearest meter. Here are their 
guesses in order from lowest to highest:? 


8 9 10 10 10 10 10 10 11 11 1 12 
Wee Bee eb bb ob & 
15 15 16 16 16 17 17 #17 «#17 18 18 20 22 
25 27 35 38 40 


Figure 2.4 includes a dotplot of the data and some numerical summaries. 


Guess_m 


n > s, Min Q, Med Q; Max IQR Range 
Guess 44 16.02 7.14 8 11 15 L7 40 6 32 


Let’s practice what we learned in Chapter | and describe what we see. 


Shape: The distribution of guesses appears skewed to the right and bimodal, with 
peaks at 10 and 15 meters. 


Center: The median guess was 15 meters and the mean guess was about 16 meters. 
Due to the clear skewness and potential outliers, the median is a better choice for 
summarizing the “typical” guess. 


Spread: Because Q; = 11, about 25% of the students estimated the width of the 
room to be fewer than 11 meters. The 75th percentile of the distribution is at about 
Q; = 17. The IQR of 6 meters describes the spread of the middle 50% of stu- 
dents’ guesses. The standard deviation tells us that the typical distance of students’ 
guesses from the mean was about 7 meters. Because s, is not resistant to extreme 
values, we prefer the IQR to describe the variability of this distribution. 


Outliers: By the 1.5 X IOR rule, values greater than 17 + 9 = 26 meters or less 
than 11 — 9 = 2 meters are identified as outliers. So the four highest guesses — 
which are 27, 35, 38, and 40 meters —are outliers. 
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Effect of adding or subtracting a constant: By now, you're probably 
wondering what the actual width of the room was. In fact, it was 13 meters wide. 
How close were students’ guesses? The student who guessed 8 meters was too low 
by 5 meters. The student who guessed 40 meters was too high by 27 meters (and 
probably needs to study the metric system more carefully). We can examine the 
distribution of students’ guessing errors by defining a new variable as follows: 


error = guess — 13 


That is, we'll subtract 13 from each observation in the data set. Try to predict what 
the shape, center, and spread of this new distribution will be. Refer to Figure 2.4 
as needed. 


Estimating Room Width 


Effect of subtracting a constant 


Let’s see how accurate your predictions were (you did make predictions, right?). 
Figure 2.5 shows dotplots of students’ original guesses and their errors on the same 
scale. We can see that the original distribution of guesses has been shifted to the left. 
By how much? Because the peak at 15 meters 
—_ in the original graph is located at 2 meters in 
(Dot Piot 89 the error distribution, the original data values 
have been translated 13 units to the left. That 
should make sense: we calculated the errors 
by subtracting the actual room width, 13 me- 
ters, from each student's guess. 


Estimates AU 


From Figure 2.5, it seems clear that sub- 
tracting 13 from each observation did not 
affect the shape or spread of the distribution. 
But this transformation appears to have de- 
creased the center of the distribution by 13 
FIGURE 2.5 Fathom dotplots of students’ original guesses of classroom meters. ‘The summary statistics in the table 
width and the errors in their guesses. below confirm our beliefs. 


E 
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Max 


Q3 IQR Range 
Guess (m) 44 16.02 7.14 8 Ad. 5, 17 40 6 32 
Error (m) 44 3.02 7.14 -5 -2 2 4 27 6 32 


The error distribution is centered at a value that is clearly positive—the median 
error is 2 meters and the mean error is about 3 meters. So the students generally 
tended to overestimate the width of the room. 


As the example shows, subtracting the same positive number from each value 
in a data set shifts the distribution to the left by that number. Adding a posi- 
tive constant to each data value would shift the distribution to the right by that 
constant. 
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Let’s summarize what we’ve learned so far about transforming data. 


EFFECT OF ADDING (OR SUBTRACTING) A CONSTANT 


Adding the same positive number a to (subtracting a from) each observation 


e adds a to (subtracts a from) measures of center and location (mean, 
median, quartiles, percentiles), but 


¢ does not change the shape of the distribution or measures of spread 
(range, IOR, standard deviation). 


Effect of multiplying or dividing by a constant: Because our 
group of Australian students is having some difficulty with the metric system, it 
may not be helpful to tell them that their guesses tended to be about 2 to 3 me- 
ters too high. Let’s convert the error data to feet before we report back to them. 
There are roughly 3.28 feet in a meter. So for the student whose error was —5 
meters, that translates to 

3,28 feet 


—5 meters X ———— = — 16.4 feet 
1 meter 


To change the units of measurement from meters to feet, we multiply each of the 
error values by 3.28. What effect will this have on the shape, center, and spread of 
the distribution? (Go ahead, make some predictions!) 


Estimating Room Width 
Effect of multiplying by a constant 


Figure 2.6 includes dotplots of the students’ guessing errors in meters and feet, 
along with summary statistics from computer software. The shape of the two distri- 
butions is the same—right-skewed and bimodal. However, the centers and spreads 


Dot Plot zs) 


n x Sx Min Qy Med Q3 Max IQR Range 
Error (m) 44 3/02 7.14 = 5 2 2 4 27 6 32 


Error (ft) 44 9.91 23.43 —16.4 —6.56 6.56 13.12 88.56 19.68 104.96 


FIGURE 2.6 Fathom dotplots and numerical summaries of students’ errors guessing the width of 
their classroom in meters and feet. 


It is not common to multiply (or divide) 
each observation in a data set by a 
negative number b. Doing so would 
multiply (or divide) the measures of 
spread by the absolute value of b. 

We can’t have a negative amount of 
variability! Multiplying or dividing by a 
negative number would also affect the 
shape of the distribution as all values 
would be reflected over the y axis. 
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of the two distributions are quite different. The bottom dotplot is centered at a 
value that is to the right of the top dotplot’s center. Also, the bottom dotplot shows 
much greater spread than the top dotplot. 


When the errors were measured in meters, the median was 2 and the mean was 
3.02. For the transformed error data in feet, the median is 6.56 and the mean 
is 9.91. Can you see that the measures of center were multiplied by 3.28? That 
makes sense. If we multiply all the observations by 3.28, then the mean and me- 
dian should also be multiplied by 3.28. 


What about the spread? Multiplying each observation by 3.28 increases the vari- 
ability of the distribution. By how much? You guessed it—by a factor of 3.28. The 
numerical summaries in Figure 2.6 show that the standard deviation, the inter- 
quartile range, and the range have been multiplied by 3.28. 


We can safely tell our group of Australian students that their estimates of the class- 
room’s width tended to be too high by about 6.5 feet. (Notice that we choose not 
to report the mean error, which is affected by the strong skewness and the three 
high outliers.) 


As before, let’s recap what we discovered about the effects of transforming 
data. 


EFFECT OF MULTIPLYING (OR DIVIDING) BY A CONSTANT 


Multiplying (or dividing) each observation by the same positive number b 


e multiplies (divides) measures of center and location (mean, median, 
quartiles, percentiles) by 5, 
¢ multiplies (divides) measures of spread (range, IQR, standard deviation) 


by b, but 
e does not change the shape of the distribution. 


Putting it all together: Adding/subtracting and multiplying/ 
dividing: What happens if we transform a data set by both adding or sub- 
tracting a constant and multiplying or dividing by a constant? For instance, if 
we need to convert temperature data from Celsius to Fahrenheit, we have to 
use the formula °F = 9/5(°C) + 32. That is, we would multiply each of the 
observations by 9/5 and then add 32. As the following example shows, we just 
use the facts about transforming data that we’ve already established. 


Too Cool at the Cabin? 


Analyzing the effects of transformations 


During the winter months, the temperatures at the Starnes’s Colorado cabin 
can stay well below freezing (32°F or 0°C) for weeks at a time. To prevent the 
pipes from freezing, Mrs. Starnes sets the thermostat at 50°F. She also buys a 
digital thermometer that records the indoor temperature each night at midnight. 
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Unfortunately, the thermometer is programmed to measure the temperature in 
degrees Celsius. A dotplot and numerical summaries of the midnight temperature 
readings for a 30-day period are shown below. 


12 14 16 
Temperature (Celsius) 


n Mean StDev Min Q, Median Q3 Max 
Temperature 30 8.43 2.27 3.00 7.00 8.5.0 10.00 14.00 


PROBLEM: Usethe fact that °F = (9/5)°C + 32 to help you answer the following questions. 
(a) Find the mean temperature in degrees Fahrenheit. Does the thermostat setting seem accurate? 


(b) Calculate the standard deviation of the temperature readings in degrees Fahrenheit. Interpret this 
value in context. 


(c) The 93rd percentile of the temperature readings was 12°C. What is the 93rd percentile tem- 
perature in degrees Fahrenheit? 


SOLUTION: 


(a) To convert the temperature measurements from Celsius to Fahrenheit, we multiply each value by 
9/5 and then add 32. Multiplying the observations by 9/5 also multiplies the mean by 9/5. Adding 
32 to each observation increases the mean by 32. So the mean temperature in degrees Fahrenheit is 
(9/5)(8.43) + 32 = 47.17°F. The thermostat doesn’t seem to be very accurate. It is set at 50°F, 
but the mean temperature over the 30-day period is about 47°F. 


(b) Multiplying each observation by 9/5 multiplies the standard deviation by 9/5. However, adding 
32 to each observation doesn't affect the spread. So the standard deviation of the temperature 
measurements in degrees Fahrenheit is (9/5)(2.27) = 4.09°F. This means that the typical distance 
of the temperature readings from the mean is about 4°F. That's a lot of variation! 

(c) Both multiplying by a constant and adding a constant affect the value of the 93rd percentile. To 


find the 93rd percentile in degrees Fahrenheit, we multiply the 93rd percentile in degrees Celsius by 
9/5 and then add 32: (9/5)(12) + 32 = 53.6°F. 


For Practice Try Exercise 


Let’s look at part (c) of the example more closely. The data value of 12°C is 
at the 93rd percentile of the distribution, meaning that 28 of the 30 tempera- 
ture readings are less than 12°C. When we transform the data, 12°C becomes 
53.6°F. The value of 53.6°F is at the 93rd percentile of the transformed distribu- 
tion because 28 of the 30 temperature readings are less than 53.6°F. What have 
we learned? Adding (or subtracting) a constant does not change an individual data 
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value’s location within a distribution. Neither does multiplying or dividing by a 
positive constant. 


THINK Connecting transformations and z-scores: What does all this 
ABOUT IT transformation business have to do with z-scores? ‘To standardize an observation, 
you subtract the mean of the distribution and then divide by the standard devia- 

tion. What if we standardized every observation in a distribution? 

Returning to Mr. Pryor’s statistics test scores, we recall that the distribution was 
roughly symmetric with a mean of 80 and a standard deviation of 6.07. To convert 
the entire class’s test results to z-scores, we would subtract 80 from each observa- 
tion and then divide by 6.07. What effect would these transformations have on the 
shape, center, and spread of the distribution? 


e Shape: The shape of the distribution of z-scores would be the same as the 


Mr. Pryorstest scores _ [Dot Plot 9) shape of the original distribution of test scores. Neither subtracting a constant 
nor dividing by a constant would change the shape of the graph. The dotplots 
confirm that the combination of these two transformations does not affect the 

65 70 75 80 8 90 95 shape. 
Test_scores ¢ Center: Subtracting 80 from each data value would also reduce the mean 
uecPryoretestacorcs: (TD by 80. Because the mean of the original distribution was 80, the mean of the 


transformed data would be 0. Dividing each of these new data values by 6.07 
would also divide the mean by 6.07. But because the mean is now 0, dividing 
by 6.07 would leave the mean at 0). That is, the mean of the z-score distribu- 


: 0 1 2 tion would be 0. 
zscores ‘a 


Spread: The spread of the distribution would not be affected by subtracting 
80 from each observation. However, dividing each data value by 6.07 would 
also divide our common measures of spread by 6.07. The standard deviation 
This is a result worth noting! If you of the distribution of z-scores would therefore be 6.07/6.07 = 1. 

start with any set of quantitative data 
and convert the values to standardized 
scores (z-scores), the transformed cade 
data set will have a mean of 0 anda standard deviation 1. 
standard deviation of 1. The shape of 


the two distributions willbe the same. Descriptive Statistics: Test scores, z-scores 
We will use this result to our advantage 


The Minitab computer output below confirms the result: If we standardize 
every observation in a distribution, the resulting set of z-scores has mean 0 and 


in Section 2.2 Variable n Mean StDev Minimum Q, Median Q3 Maximum 
~ Test scores 25 80.00 6.07 67.00 76.00 80.00 83.50 93.00 
z-scores 25 0.00 1.00 —2.14 —0.66 0.00 0.58 2.14 


OR 


Many other types of transformations can be very useful in analyzing data. We 
have only studied what happens when you transform data through addition, sub- 
traction, multiplication, or division. 


CHECK YOUR UNDERSTANDING 


The figure on the next page shows a dotplot of the height distribution for Mrs. Navard’s 
class, along with summary statistics from computer output. 


1. Suppose that you convert the class’s heights from inches to centimeters (1 inch = 
2.54 cm). Describe the effect this will have on the shape, center, and spread of the 
distribution. 
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2. IfMrs. Navard had the entire class stand on a 6-inch-high plat- 


I ES — form and then had the students measure the distance from the top of 
a a i i a their heads to the ground, how would the shape, center, and spread 
Height (inches) of this distribution compare with the original height distribution? 
Variable n s, Min Q; Med Q; Max 3. Now suppose that you convert the class’s heights to z-scores. 
Height 25 67 429 60 63 66 69 75 iene be the shape, center, and spread of this distribution? 
xplain. 


DATA EXPLORATION The speed of light 


Light travels fast, but it is not transmitted instantly. Light takes over a second to reach 
us from the moon and over 10 billion years to reach us from the most distant objects in 
the universe. Because radio waves and radar also travel at the speed of light, having an 
accurate value for that speed is importantin communicating with astronauts and orbit- 
ing satellites. An accurate value for the speed of light is also important 
to computer designers because electrical signals travel at light speed. 
The first reasonably accurate measurements of the speed of light 
were made more than a hundred years ago by A. A. Michelson and 
Simon Newcomb. The table below contains 66 measurements 
made by Newcomb between July and September 1882.’ 

Newcomb measured the time in seconds thata light signal took to 
pass from his laboratory on the Potomac River to a mirror at the base 
of the Washington Monument and back, a total distance of about 
7400 meters. Newcomb’s first measurement of the passage time of 
light was 0.000024828 second, or 24,828 nanoseconds. (‘There are 
10° nanoseconds in a second.) The entries in the table record only 
the deviations from 24,800 nanoseconds. 


Dio Lie Die 2a Se A De NG AO Lo 2 tee Jae Lh 
152, We feo de Bl ett 20 30. Sc. 30. 28 Zo 2 


The figure provides a histogram and numerical sum- 
maries (computed with and without the two outliers) from 
Minitab for these data. 


1. We could convert the passage time measurements to 


és | nanoseconds by adding 24,800 to each of the data values in 
the table. What effect would this have on the shape, center, 
104 and spread of the distribution? Be specific. 


2. After performing the transformation to nanoseconds, we 
could convert the measurements from nanoseconds to sec- 


a 22 ee onds by dividing each value by 10°. What effect would this 
Passage time (Deviations from 24,800 nanoseconds) have on the shape, center, and spread of the distribution? 


Be specific. 


Descriptive Statistics: Passage time : . : . 
3. Use the information provided to esti- 


mate the speed of light in meters per sec- 


P.Time 66 26.21 10.75 —44 24 27 31 40 ond. Be prepared to justify the method 
P.Time* 64 27.75 5.08 16 24.5 27.5 31 40 you used. 


Variable nm Mean Stdev Min Q1 Med Q3; Max 
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Summary 


e ‘Two ways of describing an individual’s location within a distribution are 
percentiles and z-scores. An observation’s percentile is the percent of the 
distribution that is below the value of that observation. To standardize any 
observation x, subtract the mean of the distribution and then divide the 
difference by the standard deviation. The resulting z-score 


x — mean 


A= rene 
standard deviation 


says how many standard deviations x lies above or below the distribution 
mean. We can also use percentiles and z-scores to compare the location of 
individuals in different distributions. 


e¢ Acumulative relative frequency graph allows us to examine location within 
a distribution. Cumulative relative frequency graphs begin by grouping 
the observations into equal-width classes (much like the process of making 
a histogram). The completed graph shows the accumulating percent of 
observations as you move through the classes in increasing order. 


e It is common to transform data, especially when changing units of 
measurement. When you add a constant a to all the values in a data set, 
measures of center (median and mean) and location (quartiles and percentiles) 
increase by a. Measures of spread do not change. When you multiply all the 
values in a data set by a positive constant b, measures of center, location, 
and spread are multiplied by b. Neither of these transformations changes the 
shape of the distribution. 


Exercises 


1. Shoes How many pairs of shoes do students have? 2. Old folks Here is a stemplot of the percents of resi- 
A] 86 | Do girls have more shoes than boys? Here are data dents aged 65 and older in the 50 states: 
& from a random sample of 20 female and 20 male 
students at a large high school: 7 | 0 Key: 15|2 means 15.2% of 
(33 || 3 this state’s residents are 65 
Female: 50 26 26 31 57 19 24 22 23 38 sealteee oie 
is BO) Ws s4 23 30 49 18 I Si 11 | 16777 
’ 12 | 01122456778999 
Male: 4 ¢ @ 5 12 & @& 7 10 10 13 | 0001223344455689 
10 #11 4 8 2 FF © 10 83 7 14 | 023568 
15 | 24 
(a) Find and interpret the percentile in the female distri- 1619 


UROL: Mice EGET (a) Find and interpret the percentile for Colorado, where 


(b) Find and interpret the percentile in the male distribu- 10.1% of the residents are aged 65 and older. 
tion for the boy with 22 pairs of shoes. (b) Find and interpret the percentile for Rhode Island, 


(c) Who is more unusual: the girl with 22 pairs of shoes where 13.9% of the residents are aged 65 and older. 
or the boy with 22 pairs of shoes? Explain. (c) Which of these two states is more unusual? Explain. 


100 CHAPTER 2 


3. Math test Josh just got the results of the statewide 
Algebra 2 test: his score is at the 60th percentile. 
When Josh gets home, he tells his parents that he got 
60 percent of the questions correct on the state test. 
Explain what’s wrong with Josh’s interpretation. 


4. Blood pressure Larry came home very excited after a 
visit to his doctor. He announced proudly to his wife, 
“My doctor says my blood pressure is at the 90th per- 
centile among men like me. That means I’m better off 
than about 90% of similar men.” How should his wife, 
who is a statistician, respond to Larry’s statement? 


5. Growth charts We used an online growth chart to 
find percentiles for the height and weight of a 16-year- 
old girl who is 66 inches tall and weighs 118 pounds. 
According to the chart, this girl is at the 48th per- 
centile for weight and the 78th percentile for height. 
Explain what these values mean in plain English. 


6. Run fast Peter is a star runner on the track team. In 
the league championship meet, Peter records a time 
that would fall at the 80th percentile of all his race 
times that season. But his performance places him at 
the 50th percentile in the league championship meet. 
Explain how this is possible. (Remember that lower 
times are better in this case!) 


Exercises 7 and 8 involve a new type of graph called a 
percentile plot. Each point gives the value of the variable 
being measured and the corresponding percentile for one 
individual in the data set. 


7. ‘Textme The percentile plot below shows the distri- 
bution of text messages sent and received in a two-day 
period by a random sample of 16 females from a large 


high school. 
(a) Describe the student represented by the highlighted 
point. 


(b) Use the graph to estimate the median number of 
texts. Explain your method. 


| Percentile Plot > } 
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8. Foreign-born residents The following percentile plot 
shows the distribution of the percent of foreign-born 
residents in the 50 states. 
(a) The highlighted point is for Maryland. Describe what 
the graph tells you about this state. 
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(b) Use the graph to estimate the 30th percentile of the 
distribution. Explain your method. 


Foreign-born residents 


{ Percentile Plot )$) 


Percentile 
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9. Shopping spree The figure below is a cumulative 


pole} == relative frequency graph of the amount spent by 
© 50 consecutive grocery shoppers at a store. 
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(a) Estimate the interquartile range of this distribution. 
Show your method. 


(b) What is the percentile for the shopper who spent 
$19.50? 


(c) Draw the histogram that corresponds to this graph. 


10. Light it up! The graph below is a cumulative 
relative frequency graph showing the lifetimes 
(in hours) of 200 lamps.* 
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(a) Estimate the 60th percentile of this distribution. 
Show your method. 


(b) What is the percentile for a lamp that lasted 900 hours? 


(c) Draw a histogram that corresponds to this graph. 
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13. 


14. 
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SAT versus ACT Eleanor scores 680 on the SAT’ 
Mathematics test. The distribution of SAT scores is 
symmetric and single-peaked, with mean 500 and 
standard deviation 100. Gerald takes the American 
College Testing (ACT) Mathematics test and 
scores 27. ACT scores also follow a symmetric, 
single-peaked distribution—but with mean 18 and 
standard deviation 6. Find the standardized scores 
for both students. Assuming that both tests measure 
the same kind of ability, who has the higher score? 


Comparing batting averages ‘Three landmarks of 
baseball achievement are ‘Ty Cobb’s batting average 
of 0.420 in 1911, Ted Williams’s 0.406 in 1941, and 
George Brett’s 0.390 in 1980. These batting averages 
cannot be compared directly because the distribution 
of major league batting averages has changed over the 
years. The distributions are quite symmetric, except 
for outliers such as Cobb, Williams, and Brett. While 
the mean batting average has been held roughly con- 
stant by rule changes and the balance between hitting 
and pitching, the standard deviation has dropped over 
time. Here are the facts: 


Decade Mean Standard deviation 
1910s 0.266 0.0371 
1940s 0.267 0.0326 
1970s 0.261 0.0317 


Find the standardized scores for Cobb, Williams, and 
Brett. Who was the best hitter?’ 


Measuring bone density Individuals with low bone 
density have a high risk of broken bones (fractures). 
Physicians who are concerned about low bone density 
(osteoporosis) in patients can refer them for special- 
ized testing. Currently, the most common method for 
testing bone density is dual-energy X-ray absorptiom- 
etry (DEXA). A patient who undergoes a DEXA test 
usually gets bone density results in grams per square 
centimeter (g/cm?) and in standardized units. 


Judy, who is 25 years old, has her bone density 
measured using DEXA. Her results indicate a bone 
density in the hip of 948 g/cm? and a standardized 
score of z = —1.45. In the reference population of 
25-year-old women like Judy, the mean bone density 
in the hip is 956 g/cm’. 


Judy has not taken a statistics class in a few years. Ex- 
plain to her in simple language what the standardized 
score tells her about her bone density. 


Use the information provided to calculate the standard 
deviation of bone density in the reference population. 


Comparing bone density Refer to the previous exer- 
cise. One of Judy’s friends, Mary, has the bone density 


(a) 


(b 


= 
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in her hip measured using DEXA. Mary is 35 years 
old. Her bone density is also reported as 948 g/cm’, 
but her standardized score is z = 0.50. The mean 
bone density in the hip for the reference population 
of 35-year-old women is 944 grams/cm?. 


Whose bones are healthier —Judy’s or Mary’s? Justify 


your answer. 


Calculate the standard deviation of the bone density in 
Marty’s reference population. How does this compare 
with your answer to Exercise 13(b)? Are you surprised? 


Exercises 15 and 16 refer to the dotplot and summary sta- 
tistics of salaries for players on the World Champion 2008 
Philadelphia Phillies baseball team. 


Salary (millions) 


Variable n 


Salary 


Mean Std. dev. Min Q, Med Q; Max 


29 3388617 3767484 390000 440000 1400000 6000000 14250000 


15 
re] 90 


17. 


18. 


. Baseball salaries Brad Lidge played a crucial role as 


the Phillies’ “closer,” pitching the end of many games 
throughout the season. Lidge’s salary for the 2008 
season was $6,350,000. 


Find the percentile corresponding to Lidge’s salary. 
Explain what this value means. 


Find the z-score corresponding to Lidge’s salary. Ex- 
plain what this value means. 


. Baseball salaries Did Ryan Madson, who was paid 


$1,400,000, have a high salary or a low salary com- 
pared with the rest of the team? Justify your answer 
by calculating and interpreting Madson’s percentile 
and z-score. 


The scores on Ms. Martin’s statistics quiz had a 
mean of 12 and a standard deviation of 3. Ms. Martin 
wants to transform the scores to have a mean of 75 
and a standard deviation of 12. What transformations 
should she apply to each test score? Explain. 


Mr. Olsen uses an unusual grading system in his 
class. After each test, he transforms the scores to have 
a mean of 0 and a standard deviation of 1. Mr. Olsen 
then assigns a grade to each student based on the 
transformed score. On his most recent test, the class’s 
scores had a mean of 68 and a standard deviation of 
15. What transformations should he apply to each 
test score? Explain. 
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19 
i 25 
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. Tall or short? Mr. Walker measures the heights (in 


inches) of the students in one of his classes. He uses a 
computer to calculate the following numerical 
summaries: 


Mean Std. dev. Min Q, Med Q3 Max 
69.188 3.20 oo Gris 695 7 74.5 


ats 


Next, Mr. Walker has his entire class stand on their 
chairs, which are 18 inches off the ground. Then he 
measures the distance from the top of each student’s 
head to the floor. 

Find the mean and median of these measurements. 
Show your work. 

Find the standard deviation and IOR of these mea- 
surements. Show your work. 


. Teacher raises A school system employs teachers at 


salaries between $28,000 and $60,000. The teachers’ 
union and the school board are negotiating the form 
of next year’s increase in the salary schedule. 


If every teacher is given a flat $1000 raise, what will 
this do to the mean salary? To the median salary? E'x- 
plain your answers. 


What would a flat $1000 raise do to the extremes and 
quartiles of the salary distribution? ‘To the standard 
deviation of teachers’ salaries? Explain your answers. 


. Tall or short? Refer to Exercise 19. Mr. Walker con- 


verts his students’ original heights from inches to feet. 


Find the mean and median of the students’ heights in 
feet. Show your work. 


Find the standard deviation and IOR of the students’ 
heights in feet. Show your work. 


. Teacher raises Refer to Exercise 20. If each teacher 


receives a 5% raise instead of a flat $1000 raise, the 
amount of the raise will vary from $1400 to $3000, 
depending on the present salary. 


What will this do to the mean salary? ‘To the median 
salary? Explain your answers. 


Will a 5% raise increase the IOR? Will it increase the 
standard deviation? Explain your answers. 


. Cool pool? Coach Ferguson uses a thermometer 


to measure the temperature (in degrees Celsius) at 
20 different locations in the school swimming pool. 
An analysis of the data yields a mean of 25°C anda 
standard deviation of 2°C. Find the mean and stan- 
dard deviation of the temperature readings in degrees 
Fahrenheit (recall that °F = (9/5)°C + 32). 


Measure up Clarence measures the diameter of each 
tennis ball in a bag with a standard ruler. Unfortu- 
nately, he uses the ruler incorrectly so that each of his 


measurements is 0.2 inches too large. Clarence’s data 
had a mean of 3.2 inches and a standard deviation of 
0.1 inches. Find the mean and standard deviation of 

the corrected measurements in centimeters (recall 


that 1 inch = 2.54 cm). 


Multiple choice: Select the best answer for Exercises 25 
to 30. 


25. Jorge’s score on Exam | in his statistics class was at 
the 64th percentile of the scores for all students. His 
score falls 


(a) between the minimum and the first quartile. 
(b) between the first quartile and the median. 

(c) between the median and the third quartile. 
(d) between the third quartile and the maximum. 
(e) at the mean score for all students. 


26. When Sam goes to a restaurant, he always tips the 
server $2 plus 10% of the cost of the meal. If Sam’s 
distribution of meal costs has a mean of $9 and a stan- 
dard deviation of $3, what are the mean and standard 
deviation of the distribution of his tips? 


(a) $2.90, $0.30 
(b) $2.90, $2.30 
(c) $9.00, $3.00 
(d) $11.00, $2.00 
(e) $2.00, $0.90 


27. Scores on the ACT college entrance exam follow a 
bell-shaped distribution with mean 18 and standard 
deviation 6. Wayne’s standardized score on the ACT 
was —0.5. What was Wayne’s actual ACT score? 


(Gia sath 2 (cy 15 eed) aera 


28. George has an average bowling score of 180 and 
bowls in a league where the average for all bowlers 
is 150 and the standard deviation is 20. Bill has an 
average bowling score of 190 and bowls in a league 
where the average is 160 and the standard deviation 
is 15. Who ranks higher in his own league, George 
or Bill? 


(a) Bill, because his 190 is higher than George’s 180. 


(b) Bill, because his standardized score is higher than 
George’s. 

(c) Bill and George have the same rank in their leagues, 
because both are 30 pins above the mean. 

(d) George, because his standardized score is higher than 
Bill’s. 

(e) George, because the standard deviation of bowling 
scores is higher in his league. 
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Exercises 29 and 30 refer to the following setting. The num- 
ber of absences during the fall semester was recorded for 
each student in a large elementary school. The distribu- 
tion of absences is displayed in the following cumulative 
relative frequency graph. 
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Cumulative Relative Frequency 
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Number of Absences 


29. What is the interquartile range (IOR) for the distribu- 
tion of absences? 

(a) | (c) 3 (e) 14 

(b)2  (d)5 

30. If the distribution of absences was displayed in a 


histogram, what would be the best description of the 
histogram’s shape? 


(a) Symmetric 
(b) Uniform 
(c) Skewed left 
( 

( 


d) Skewed right 


e) Cannot be determined 


Exercises 31 and 32 refer to the following setting. We used 
CensusAtSchool’s Random Data Selector to choose a 
sample of 50 Canadian students who completed a survey in 
a recent year. 


31. Travel time (1.2) The dotplot below displays data on 
> students’ responses to the question “How long does 
© _ it usually take you to travel to school?” Describe the 

shape, center, and spread of the distribution. Are 
there any outliers? 


ecococcce 
eccceecce 
—| eeececce 


== 
iS) 
i=) 
& 
oO 
Diese 
oO 
i=] 
= 
So 
i=) 


Travel time (minutes) 


. Lefties (1.1) Students were asked, “Are you right- 


32 
yr handed, left-handed, or ambidextrous?” The 
¢ 


responses are shown below (R = right-handed; 
L = left-handed; A = ambidextrous). 


Re IR IRR REI RRR RETR III 
OUR URUK AA 
IR UR Ik IR AY IR IR Ib IR IR IX UR I A 
IR IR IR IR IR IR IR IR 


(a) Make an appropriate graph to display these data. 


(b) Over 10,000 Canadian high school students took the 
CensusAtSchool survey that year. What percent of 
this population would you estimate is left-handed? 
Justify your answer. 


Density Curves and Normal 


Distributions 


WHAT YOU WILL LEARN 


e Estimate the relative locations of the median and mean 
on a density curve. 


Use the 68-95-99.7 rule to estimate areas (propor- 


tions of values) in a Normal distribution. 


Use Table A or technology to find (i) the proportion of 
z-values in a specified interval, or (ii) a ZScore from a 
percentile in the standard Normal distribution. 


By the end of the section, you should be able to: 


e Use Table A or technology to find (i) the proportion of 
values in a specified interval, or (ii) the value that cor- 
responds to a given percentile in any Normal 
distribution. 


Determine whether a distribution of data is approxi- 
mately Normal from graphical and numerical evidence. 
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In Chapter 1, we developed a kit of graphical and numerical tools for describing 
distributions. Our work gave us a clear strategy for exploring data from a single 
quantitative variable. 


EXPLORING QUANTITATIVE DATA 


1. Always plot your data: make a graph, usually a dotplot, stemplot, or histogram. 
2. Look for the overall pattern (shape, center, spread) and for striking depar- 
tures such as outliers. 


3. Calculate numerical summaries to briefly describe center and spread. 
In this section, we add one more step to this strategy. 


4. Sometimes the overall pattern of a large number of observations is so regu- 
lar that we can describe it by a smooth curve. 


Density Curves 


Figure 2.7 is a histogram of the scores of all 947 seventh-grade students in Gary, 
Indiana, on the vocabulary part of the Iowa Test of Basic Skills (ITBS).° Scores on 
this national test have a very regular distribution. The histogram is symmetric, 
and both tails fall off smoothly from a single center peak. There are no large 
gaps or obvious outliers. The smooth curve drawn through the tops of the 
histogram bars in Figure 2.7 is a good description of the overall pattern of 
the data. 


FIGURE 2.7 Histogram of the lowa Test of Basic Skills (ITBS) vocabulary 


2 4 6 8 
ITBS vocabulary score 


10 2 scores of all seventh-grade students in Gary, Indiana. The smooth curve shows 
the overall shape of the distribution. 


Seventh-Grade Vocabulary Scores 
From histogram to density curve 


Our eyes respond to the areas of the bars in a histogram. The bar areas represent 
relative frequencies (proportions) of the observations. Figure 2.8(a) is a copy of 
Figure 2.7 with the leftmost bars shaded. The area of the shaded bars in Figure 2.8(a) 
represents the proportion of students with vocabulary scores less than 6.0. There are 
287 such students, who make up the proportion 287/947 = 0.303 of all Gary seventh- 
graders. In other words, a score of 6.0 corresponds to about the 30th percentile. 


The total area of the bars in the histogram is 100% (a proportion of 1), because all 
of the observations are represented. Now look at the curve drawn through the tops 
of the bars. In Figure 2.8(b), the area under the curve to the left of 6.0 is shaded. 
In moving from histogram bars to a smooth curve, we make a specific choice: 
adjust the scale of the graph so that the total area under the curve is exactly 1. Now 
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ITBS vocabulary score ITBS vocabulary score 


(a) (b) 


FIGURE 2.8 (a) The proportion of scores less than or equal to 6.0 in the actual data is 0.303. (b) The proportion of 
scores less than or equal to 6.0 from the density curve is 0.293. 


the total area represents all the observations, just like with the histogram. We can 
then interpret areas under the curve as proportions of the observations. 


The shaded area under the curve in Figure 2.8(b) represents the proportion of 
students with scores lower than 6.0. This area is 0.293, only 0.010 away from the 
actual proportion 0.303. So our estimate based on the curve is that a score of 6.0 
falls at about the 29th percentile. You can see that areas under the curve give 
good approximations to the actual distribution of the 947 test scores. In practice, it 
might be easier to use this curve to estimate relative frequencies than to determine 
the actual proportion of students by counting data values. 


A curve like the one in the previous example is called a density curve. 


Density curves, like distributions, come in many shapes. A density curve is 
often a good description of the overall pattern of a distribution. Outliers, which 
are departures from the overall pattern, are not described by the curve. 

No set of real data is exactly described by a density curve. The curve is an 
approximation that is easy to use and accurate enough for practical use. 
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CHAPTER 2 


MODELING DISTRIBUTIONS OF DATA 


Describing Density Curves 


Our measures of center and spread apply to density curves as well as to actual sets of 
observations. Areas under a density curve represent proportions of the total number 
of observations. The median of a data set is the point with half the observations on ei- 
ther side. So the median of a density curve is the “equal-areas point,” the point with 
half the area under the curve to its left and the remaining half of the area to its right. 

Because density curves are idealized patterns, a symmetric density curve is 
exactly symmetric. The median of a symmetric density curve is therefore at its 
center. Figure 2.9(a) shows a symmetric density curve with the median marked. It 
isn’t so easy to spot the equal-areas point on a skewed curve. There are mathemati- 
cal ways of finding the median for any density curve. That’s how we marked the 
median on the skewed curve in Figure 2.9(b). 

What about the mean? The mean of a set of observations is their arithmetic 
average. As we saw in Chapter 1, the mean is also the “balance point” of a distri- 
bution. That is, if we think of the observations as weights strung out along a thin 
rod, the mean is the point at which the rod would balance. This fact is also true of 


The long right tail pulls 
the mean to the right 
of the median. 


Mean 
Median and mean Median 


(b) 


FIGURE 2.9 (a) The median and mean of a symmetric density curve both lie at the center of sym- 
metry. (b) The median and mean of a right-skewed density curve. The mean is pulled away from 
the median toward the long tail. 


FIGURE 2.10 The mean is the bal- 
ance point of a density curve. 


You probably noticed that we used 
the same notation for the mean and 
standard deviation of a population in 
Chapter 1, ~ and o, as we do here for 
the mean and standard deviation of a 
density curve. 


Total area under 
curve = 1 


Area = 0.12 
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density curves. The mean of a density curve is the point at which the curve would 
balance if made of solid material. Figure 2.10 illustrates this fact about the mean. 


——— — 


A symmetric curve balances at its center because the two sides are identical. 
The mean and median of a symmetric density curve are equal, as in Figure 2.9(a). 
We know that the mean of a skewed distribution is pulled toward the long tail. 
Figure 2.9(b) shows how the mean of a skewed density curve is pulled toward the 
long tail more than the median is. 


DISTINGUISHING THE MEDIAN AND MEAN OF A DENSITY CURVE 


Because a density curve is an idealized description of a distribution of data, we 
distinguish between the mean and standard deviation of the density curve and the 
mean x and standard deviation s, computed from the actual observations. The usual 
notation for the mean of a density curve is 4 (the Greek letter mu). We write the 
standard deviation of a density curve as o (the Greek letter sigma). We can roughly 
locate the mean yu of any density curve by eye, as the balance point. There is no easy 
way to locate the standard deviation o by eye for density curves in general. 


CHECK YOUR UNDERSTANDING 


Use the figure shown to answer the following questions. 
1. Explain why this is a legitimate density curve. 


2. About what proportion of observations lie between 7 
and 8? 


3. Trace the density curve onto your paper. Mark the approximate 
location of the median. 


4. Now mark the approximate location of the mean. Explain why 
the mean and median have the relationship that they do in this 
case. 
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Normal Distributions 


One particularly important class of density curves has already appeared in Figures 
2.7, 2.8, and 2.9(a). They are called Normal curves. The distributions they de- 
scribe are called Normal distributions. Normal distributions play a large role in 
statistics, but they are rather special and not at all “normal” in the sense of being 
usual or typical. We capitalize Normal to remind you that these curves are special. 

Look at the two Normal curves in Figure 2.11. They illustrate several important 
facts: 


e All Normal curves have the same overall shape: symmetric, single-peaked, 
and bell-shaped. 


e Any specific Normal curve is completely described by giving its mean ju and 
its standard deviation o. 


Bh Be 


FIGURE 2.11 Two Normal curves, showing the mean jy and standard deviation o. 


e The mean is located at the center of the symmetric curve and is the same as the 
median. Changing ys without changing o moves the Normal curve along the 
horizontal axis without changing its spread. 


e The standard deviation o controls the spread of a Normal curve. Curves with 
larger standard deviations are more spread out. 


The standard deviation a is the natural measure of spread for Normal distribu- 
tions. Not only do 4 and o completely determine the shape of a Normal curve, 
but we can locate a by eye on a Normal curve. Here’s how. Imagine that you are 
skiing down a mountain that has the shape of a Normal curve. At first, you de- 
scend at an ever-steeper angle as you go out from the peak: 


/ 


Fortunately, before you find yourself going straight down, the slope begins to grow 
flatter rather than steeper as you go out and down: 


ae —— 


The points at which this change of curvature takes place are located at a distance o on 
either side of the mean yt. (Advanced math students know these points as “inflection 
points.”) You can feel the change as you run a pencil along a Normal curve and so 
find the standard deviation. Remember that js and o alone do not specify _« 
the shape of most distributions. The shape of density curves in general does rr) 
not reveal a. These are special properties of Normal distributions. 


Normal curves were first applied to 
data by the great mathematician Carl 
Friedrich Gauss (1777-1855). He used 
them to describe the small errors made 
by astronomers and surveyors in 
repeated careful measurements of the 
same quantity. You will sometimes see 
Normal distributions labeled “Gaussian” 
in honor of Gauss. 


2.19 
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Why are the Normal distributions important in statistics? Here are three rea- 
sons. First, Normal distributions are good descriptions for some distributions of 
real data. Distributions that are often close to Normal include 


e scores on tests taken by many people (such as SAT exams and IQ tests), 


e repeated careful measurements of the same quantity (like the diameter of a 
tennis ball), and 


e characteristics of biological populations (such as lengths of crickets and yields 
of corn). 


Second, Normal distributions are good approximations to the results of many 
kinds of chance outcomes, like the number of heads in many tosses of a fair coin. 
Third, and most important, we will see that many statistical inference procedures 
are based on Normal distributions. 

Even though many sets of data follow a Normal distribution, many do not. Most 
income distributions, for example, are skewed to the right and so are not Nor- 
mal. Some distributions are symmetric but not Normal or even close to Normal. 
The uniform distribution of Exercise 35 (page 128) is one such example. 
Non-Normal data, like non-normal people, not only are common but are og 
sometimes more interesting than their Normal counterparts. 


The 68—95-99.7 Rule 


Earlier, we saw that the distribution of Iowa Test of Basic Skills (ITBS) vocabulary 
scores for seventh-grade students in Gary, Indiana, is symmetric, single-peaked, 
and bell-shaped. Suppose that the distribution of scores over time is exactly Nor- 
mal with mean yz = 6.84 and standard de- 
viation 0 = 1.55. (These are the mean and 
standard deviation of the 947 actual scores.) 
The figure shows the Normal density curve 
for this distribution with the points 1, 2, and 
3 standard deviations from the mean labeled 
on the horizontal axis. 

How unusual is it for a Gary seventh- 
grader to get an ITBS score above 9.94? As 
the following activity shows, the answer to 
this question is surprisingly simple. 
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3.74 5.29 6.84 8.39 9.94 11.49 
ITBS score 
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ACTIVITY | The Normal density curve applet 


MATERIALS: In this Activity, you will use the Normal Density Curve applet at the book’s Web 
Computer with site (www.whfreeman.com/tps5e) to explore an interesting property of Normal 
Internet access distributions. A graph similar to what you will see when you launch the applet is 
shown below. The applet finds the area under the curve in the region indicated 
by the green flags. If you drag one flag past the other, the applet will show the area 
under the curve between the two flags. When the “2-Tail” box is checked, the ap- 
plet calculates symmetric areas around the mean. 
Use the applet to help you answer the following questions. 


1. Ifyou put one flag at the extreme left of the curve and the second flag exactly 
in the middle, what proportion is reported by the applet? Why does this value 
make sense? 


2. If you place the two flags exactly one standard deviation on either side of the 
mean, what does the applet say is the area between them? 


3. What percent of the area under the Normal 
sai: Al , curve lies within 2 standard deviations of the mean? 


2 25a / \ 4. Use the applet to show that about 99.7% of the 
aie / area under the Normal density curve lies within 
| three standard deviations of the mean. Does this 
mean that about 99.7%/2 = 49.85% will lie within 
one and a half standard deviations? Explain. 


5. Change the mean to 100 and the standard de- 
viation to 15. Then click “Update.” What percent 
of the area under this Normal density curve lies 
within one, two, and three standard deviations of 
the mean? 


6. Change the mean to 6.84 and the standard deviation to 1.55. (These values 
are from the IT'BS vocabulary scores in Gary, Indiana.) Answer the question 
from Step 5. 


7. Summarize: Complete the following sentence: “For any Normal density 
curve, the area under the curve within one, two, and three standard deviations of 
the mean is about__%,__—-%, and__%.” 


Although there are many Normal curves, they all have properties in common. 
In particular, all Normal distributions obey the following rule. 


DEFINITION: The 68-95-99.7 rule 

In a Normal distribution with mean ;. and standard deviation o: 

e Approximately 68% of the observations fall within o of the mean ju. 

e Approximately 95% of the observations fall within 2c of the mean ju. 
e Approximately 99.7% of the observations fall within 3c of the mean pu. 
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Figure 2.12 illustrates the 68-95-99.7 rule. (Some people refer 
to this result as the “empirical rule.”) By remembering these three 
numbers, you can think about Normal distributions without con- 
stantly making detailed calculations. 

Here’s an example that shows how we can use the 68—95— 
99.7 rule to estimate the percent of observations in a specified 


/ \~ 68% of data>} | 


-__ 95% of data + 


interval. 
—- 99.7% of data +— 
3 -2 =i 0 1 2 3 
Standard deviations FIGURE 2.12 The 68-95—-99.7 rule for Normal distributions. 


ITBS Vocabulary Scores 


Using the 68-95-99. 7 rule 

PROBLEM: The distribution of TBS vocabulary scores for seventh-graders in Gary, Indiana, is 

N(6.64, 1.55). 

(a) What percent of the ITBS vocabulary scores are less than 3.74? Show your work. 

(b) What percent of the scores are between 5.29 and 9.94? Show your work. 

SOLUTION: 

(a) Notice that a score of 3.74 is exactly two standard deviations below the mean. By the 
68-95-99.7 rule, about 95% of all scores are between 

jt — 20 = 6.64 — (2)(1.55) = 6.84 — 3.10 = 3.74 

ju + 20 = 6.64 + (2)(1.55) = 6.84 + 3.10 = 9.94 
The other 5% of scores are outside this range. Because Normal distributions are symmetric, 
half of these scores are lower than 3.74 and half are higher than 9.94. That is, about 2.5% of 

the ITBS scores are below 3.74. Figure 2.1 3(a) shows this reasoning in picture form. 


(b) Let's start with a picture. Figure 2.13(b) shows the area under the Normal density curve between 5.29 
and 9.94. We can see that about 68% + 13.5% = 81.5% of ITBS scores are between 5.29 and 9.94. 


and 


= 95% of About 68% 
Scores of scores 
within 


Ac of pa 


About 2.5% 
of scores are 
less than 
3,74. 


ITBS score ITBS score 


FIGURE 2.13 (a) Finding the percent of lowa Test scores less than 3.74. (b) Finding the percent of lowa Test scores between 
5.29 and 9.94. 


For Practice Try Exercise 
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The 68-95-99.7 rule applies only to Normal distributions. Is there a similar 
rule that would apply to any distribution? Sort of. A result known as Chebyshev’s 


Chebyshev’s inequality is an interesting inequality says that in any distribution, the proportion of observations falling with- 
result, but it is not required for the AP® l 
Statistics exam. in k standard deviations of the mean is at least | — =>. Ifk = 2, for example, Cheby- 


k 


] 
shev’s inequality tells us that at least 1 — zo 0.75 of the observations in any distri- 


bution are within 2 standard deviations of the mean. For Normal distributions, we 
know that this proportion is much higher than 0.75. In fact, it’s approximately 0.95. 


THINK All models are wrong, but some are useful! The 68-95-99.7 rule 

describes distributions that are exactly Normal. Real data such as the actual ITBS 
ABOUT IT scores are never exactly Normal. For one thing, IT'BS scores are reported only to 
the nearest tenth. A score can be 9.9 or 10.0 but not 9.94. We use a Normal distri- 
bution because it’s a good approximation, and because we think of the knowledge 
that the test measures as continuous rather than stopping at tenths. 

How well does the 68—95—99.7 rule describe the actual ITBS scores? Well, 900 
of the 947 scores are between 3.74 and 9.94. That’s 95.04%, very accurate indeed. 
Of the remaining 47 scores, 20 are below 3.74 and 27 are above 9.94. The tails of 
the actual data are not quite equal, as they would be in an exactly Normal distri- 
bution. Normal distributions often describe real data better in the center of the 
distribution than in the extreme high and low tails. 

As famous statistician George Box once noted, “All models are wrong, but 
some are useful!” 


ao/ cueck YOUR UNDERSTANDING 
The distribution of heights of young women aged 18 to 24 is approximately N(6+.5, 2.5). 


1. Sketch a Normal density curve for the distribution of young women’s heights. Label 
the points one, two, and three standard deviations from the mean. 

2. What percent of young women have heights greater than 67 inches? Show your 
work. 


3. What percent of young women have heights between 62 and 72 inches? Show 
your work. 


The Standard Normal Distribution 


As the 68-95-99.7 rule suggests, all Normal distributions share many properties. 
In fact, all Normal distributions are the same if we measure in units of size o from 
the mean yp as center. Changing to these units requires us to standardize, just as 
we did in Section 2.1: 


Z= 
oO 


If the variable we standardize has a Normal distribution, then so does the new 
variable z. (Recall that subtracting a constant and dividing by a constant don’t 
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change the shape of a distribution.) This new distribution with mean p = 0 and 
standard deviation o = | is called the standard Normal distribution. 


An area under a density curve is a proportion of the observations in a distribu- 
tion. Any question about what proportion of observations lies in some range of 
values can be answered by finding an area under the curve. In a standard Normal 
distribution, the 68—95—99.7 rule tells us that about 68% of the observations fall 
between z = —1 and z = 1 (that is, within one standard deviation of the mean). 
What if we want to find the percent of observations that fall between z = —1.25 
and z = 1.25? The 68-95-99.7 rule can’t help us. 

Because all Normal distributions are the same when we standardize, we can find 
areas under any Normal curve from a single table, a table that gives areas under the 
curve for the standard Normal distribution. ‘Table A, the standard Normal table, gives 
areas under the standard Normal curve. You can find Table A in the back of the book. 


Table entry is 
area to left of z. 


For instance, suppose we wanted to find the proportion of observations from 
the standard Normal distribution that are less than 0.81. To find the area to the 
left of z = 0.81, locate 0.8 in the left-hand column of Table A, then locate the 
remaining digit 1 as .01 in the top row. The entry opposite 0.8 and under .01 is 
.7910. This is the area we seek. A reproduction of the relevant portion of ‘Table A 
is shown in the margin. 
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Table entry = 0.7910 
for z = 0.81. 


Table entry for z 
is always the area 
under the curve 

to the left of z. 


FIGURE 2.15 The area under a 
standard Normal curve to the left of 


the point z = 0.81 is 0.7910. 


Figure 2.15 illustrates the relationship between the value z = 0.81 and the area 
0.7910. Note that we have made a connection between z-scores and percentiles when the 
shape of a distribution is Normal. 


Standard Normal Distribution 
Finding area to the right 


What if we wanted to find the proportion of observations from the standard 
Normal distribution that are greater than —1.78? To find the area to the right 
of z = —1.78, locate —1.7 in the left-hand column of Table A, then locate the 

z 07 ; remaining digit 8 as .08 in the top row. The corresponding entry is .0375. (See 
-1.8 .0307 the excerpt from Table A in the margin.) 


This is the area to the left of z = —1.78. To find the area to the right of z = —1.78, we 
—1.6 .0475 use the fact that the total area under the standard Normal density curve is 1. So the 
desired proportion is 1 — 0.0375 = 0.9625. 


Figure 2.16 illustrates the relationship between the value z = —1.78 and the area 
0.9625. 


for z =-1.78. This is| | of z =—1.78 is 
the area to the left 1 — 0.0375 = 0.9625. 


Table entry = 0.0375 | | The area to the . 
of z =-1.78. 


FIGURE 2.16 The area 
under a standard Normal 
curve to the right of the 
point z = —1.78 is 0.9625. 
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A common student mistake is to look up a z-value in Table A and report 
the entry corresponding to that z-value, regardless of whether the problem 

asks for the area to the left or to the right of that z-value. To prevent making 

this mistake, always sketch the standard Normal curve, mark the z-value, and shade 
the area of interest. And before you finish, make sure your answer is reasonable in 
the context of the problem. 


Catching Some “z”s 


Finding areas under the standard Normal curve 


PROBLEM: Find the proportion of observations from the standard Normal distribution that are 
between —1.25 and0.81. 

SOLUTION: From Table A, the area to the left of z= 0.81 is 0.7910 and the area to the left of 
z=—1.25is 0.1056. So the area under the standard Normal curve between these two z-scores is 
0.7910—0.1056 = 0.6854. Figure 2.17 shows why this approach works. 


Area to left of Area to left of Area between z = -1.25 and Z = 0.81 is 
Z = 0.81 iS 0.7910 Z = 1.25 iS 0.1056. 0.7910 — 0.1056 = 0.6854. 


=3 -=2 =7 0 1 2 3 “3 =g <7 =9 SZ) 7 


FIGURE 2.17 One way to find the area between z = —1.25 and z = 0.81 under the standard Normal curve. 


Here’s another way to find the desired area. The area to the left of z= —1.25 under the standard 
Normal curve is 0.1056. The area to the right of z= 0.81 is 1 —0.7910 = 0.2090. So the area 
between these two z-scores is 


1—(0.1056 + 0.2090) = 1—-0.3146 = 0.6854 


Figure 2.18 shows this approach in picture form. 


The area to the The area to the 
left of Z = -1.25 right of Z = 0.81 
is 0.1056. i$ 0.2090. 


FIGURE 2.18 The area under the 
standard Normal curve between a 4 3 
Z=~1.25 and z = 0.81 is 0.6854. 


For Practice Try Exercise 
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Working backward: From areas to z-scores: So far, we have used 
Table A to find areas under the standard Normal curve from z-scores. What if we 
want to find the z-score that corresponds to a particular area? For example, let’s 
find the 90th percentile of the standard Normal curve. We’re looking for the z- 
score that has 90% of the area to its left, as shown in Figure 2.19. 


The area to the 
left of z is 0.90. 
What’s z? 


FIGURE 2.19 The z-score with 
area 0.90 to its left under the 


standard Normal curve. 
09 Because Table A gives areas to the left of a specified z-score, all we need to do 


8330 | 18 find the value closest to 0.90 in the middle of the table. From the reproduced 
portion of Table A, you can see that the desired z-score is z = 1.28. That is, the 


bal area to the left of z = 1.28 is approximately 0.90. 


9147 .9162 .9177 


CHECK YOUR UNDERSTANDING 

Use Table A in the back of the book to find the proportion of observations from a standard 
Normal distribution that fall in each of the following regions. In each case, sketch a stan- 
dard Normal curve and shade the area representing the region. 

1. z< 1.39 2.25215 3, —056 <2 =< 181 


Use ‘Table A to find the value z from the standard Normal distribution that satisfies each 
of the following conditions. In each case, sketch a standard Normal curve with your value 
of z marked on the axis. 


4. The 20th percentile 5. 45% ofall observations are greater than z 


You can use the Normal Density Curve applet at www.whfreeman.com/tps5e 
PPL, to confirm areas under the standard Normal curve. Just enter mean 0 and stan- 


~ 
& dard deviation 1, and then drag the flags to the appropriate locations. Of course, 


you can also confirm Normal curve areas with your calculator. 


TECHNOLOGY FROM z-SCORES TO AREAS, 


ia AND VICE VERSA 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Finding areas: The normalcd£ command on the T1-83/84 (normCdf on the TI-89) can be used to find areas under 
a Normal curve. The syntax is normalcdf (lower bound,upper bound,mean,standard deviation). 
Let’s use this command to confirm our answers to the previous two examples. 
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1. What proportion of observations from the standard Normal distribution are greater than —1.78? 
Recall that the standard Normal distribution has mean 0 and standard deviation 1. 
TI-83/84 TI-89 
Press (Distr) and choose nor- In the Stats/List Editor, Press (Distr) and 


malcdf (.OS2.55orlater: Inthe dialog box, enter choose Normal Cdf (. 


these values: lower:-1.78, upper: 100000, In the dialog box, enter these values: lower :-1.78, 


j1:0,0:1, choose Paste, and then press[ENTER |, upper:100000, sz: 0, 0:1, and then choose 
Older OS: Complete the command normalcdf£ 


(-1.78,100000,0,1) and press [ENTER |, 
NORMAL FLOAT AUTO REAL RADIAN CL (Fi~ [Feat Fue) Fee) Fee [Fr 


normalcdf(-1.78,100000.0.1 1] car 2.962462 


262862064 esa Ts 


=0 
=1 


list3=666 
MAIN RAD AUTO FUNC 3¢ 8 


Note: We chose 100000 as the upper bound because it’s many, many standard deviations above the mean. 
These results agree with our previous answer using ‘Table A: 0.9625. 


What proportion of observations from the standard Normal distribution are between —1.25 and 0.81? 


The screen shots below confirm our earlier result of 0.6854 using Table A. 


NORMAL FLOAT AUTO REAL RADIAN CL fl A re) 


normalcdf(-1.25,0.81,.0,1) 
» 6853801358 


1ist3=666 
MAIN RAD AUTO FUNC 378 


Working backward: The TI-83/84 and TI-89 invNorm function calculates the value corresponding to a given percentile in a 
Normal distribution. For this command, the syntax is invNorm (area to the left,mean, standard deviation). 


3. What is the 90th percentile of the standard Normal distribution? 
TI-83/84 TI-89 


e Press |2nd||/VARS| (Distr) and choose inv- In the Stats/List Editor, Press |F5] (Distr), 
Norm(. OS 2.55 or later: In the dialog box, en- choose Inverse, and Inverse Normal.... 
ter these values: area: .90, ~w:0, o:1, choose 
Paste, and then press [ENTER], Older OS: Com- 
plete the command invNorm(.90,0,1) and 
press | ENTER |. 


In the dialog box, enter these values: area: . 90, 
ju: 0, o:1, and then choose [ENTER |. 


These results match what we got using ‘Table A. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


invNorm(. 98.8.1) 
1.281551567 =L2BLEE 

=8 

=0 


MAIN Rab AuTO FUNC 
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Normal Distribution Calculations 


We can answer a question about areas in any Normal distribution by standard- 
izing and using Table A or by using technology. Here is an outline of the method 
for finding the proportion of the distribution in any region. 


HOW TO FIND AREAS IN ANY NORMAL DISTRIBUTION 


Step 1: State the distribution and the values of interest. Draw a Normal 
curve with the area of interest shaded and the mean, standard deviation, and 
boundary value(s) clearly identified. 


Step 2: Perform calculations—show your work! Do one of the following: 
(i) Compute a z-score for each boundary value and use Table A or technol- 
ogy to find the desired area under the standard Normal curve; or (1i) use the 
normalcedf command and label each of the inputs. 


Step 3: Answer the question. 


Here’s an example of the method at work. 


Tiger on the Range 


Normal calculations 


On the driving range, Tiger Woods practices his swing with a particular club by hitting 
many, many balls. Suppose that when Tiger hits his driver, the distance the ball trav- 
els follows a Normal distribution with mean 304 yards and standard deviation 8 yards. 


PROBLEM: What percent of Tiger’s drives travel at least 290 yards? 


X=290 
Z=-175 


SOLUTION: 


Step 1: State the distribution and the values of interest. The 
distance that Tiger’s ball travels follows a Normal distribution with 
[t = 304 and o = 8. We want to find the percent of Tiger's drives 
that travel 290 yards or more. Figure 2.20 shows the distribution 
with the area of interest shaded and the mean, standard deviation, 
and boundary value labeled. 


WN (304,8) 


Step 2: Perform calculations—show your work! For the boundary 
value x = 290, we have 


FIGURE 2.20 Distance traveled by 
Tiger Woods’s drives on the range. 


_ xp 290-304 _ 
=== 


lei 


Z 


So drives of 290 yards or more correspond to z= — 1.75 under the standard Normal curve. 


From Table A, we see that the proportion of observations less than — 1.75 is 0.0401. The area to 
the right of — 1.75 is therefore 1 — 0.0401 = 0.9599. This is about 0.96, or 96%. 


Using technology: The command normalcdf (lower:290,upper:100000, W:304, 
a: 8) also gives an area of 0.9599. 


Step 3: Answer the question. About 96% of Tiger Woods's drives on the range travel at least 290 
ards. 
: For Practice Try Exercise BSG) 


THINK 
ABOUT IT 
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What proportion of Tiger Woods’s drives go exactly 290 
yards? There is no area under the Normal density curve in Figure 2.20 exactly 
over the point 290. So the answer to our question based on the Normal model 
is 0. Tiger Woods’s actual data may contain a drive that went exactly 290 yards 
(up to the precision of the measuring device). The Normal distribution is just an 
easy-to-use approximation, not a description of every detail in the data. One more 
thing: the areas under the curve with x = 290 and x > 290 are the same. Accord- 
ing to the Normal model, the proportion of Tiger’s drives that go at least 290 yards 
is the same as the proportion that go more than 290 yards. 


aAr_,—_—@i—Oo™o00_?_ _ Oo 


The key to doing a Normal calculation is to sketch the area you want, then match 
that area with the area that the table (or technology) gives you. Here’s another example. 


Tiger on the Range (Continued) 


More complicated calculations 
PROBLEM: What percent of Tiger's drives travel between 305 and 325 yards? 


304 


x= 305 


SOLUTION: 


Step 1: State the distribution and the values of interest. As 
in the previous example, the distance that Tiger's ball travels follows 
aNormal distribution with ;2 = 304 and o = 8. We want to find the 
percent of Tiger's drives that travel between 305 and 325 yards. 
Figure 2.21 shows the distribution with the area of interest shaded 
and the mean, standard deviation, and boundary values labeled. 


N(304,8) 


312 320 -'| 328 © 336. | + Step 2: Performcalculations—show your work! For the boundary 


305 — 304 ’ 
x = 325 value x = 305, z = ay Oe = 0.13. The standardized score 


i i ’s dri 325 — 304 
FIGURE 2.21 Distance traveled by Tiger Woods’s drives on fory= 525ier = ; = 263. 


the range. 


From Table A, we see that the area between z = 0.13 and z= 2.63 under the standard Normal curve 
is the area to the left of 2.63 minus the area to the left of 0.13. Look at the picture below to check 
this. From Table A, area between 0.13 and 2.63 = area to the left of 2.63 — area to the left of 
0.13 = 0.9957 — 0.5517 = 0.4440. 


4 3 


Z=2.63 Z= 0.13 Z= 0.13 Z=2.63 


Using technology: The command normalcdf (lower:305,upper:325,W:304, 0:8) 
gives an area of 0.4459. 
Step 3: Answer the question. About 45% of Tiger's drives travel between 305 and 325 yards. 


For Practice Try Exercise BBY() 
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Table A sometimes yields a slightly Sometimes we encounter a value of z more extreme than those appearing in 

different answer from technology. Table A. For example, the area to the left of z = —4 is not given directly in the 

Thats necalve Ne nave iaeoud table. The z-values in Table A leave only area 0.0002 in each tail unaccounted 

Z-scores to two decimal places before fon ‘cal th : | ‘id 

using Table A. or. For practical purposes, we can act as if there is approximately zero area outside 
the range of Table A. 


Working backwards: From areas to values: The previous two ex- 
amples illustrated the use of Table A to find what proportion of the observations 
satisfies some condition, such as “Tiger’s drive travels between 305 and 325 yards.” 
Sometimes, we may want to find the observed value that corresponds to a given 
percentile. There are again three steps. 


HOW TO FIND VALUES FROM AREAS IN ANY NORMAL DISTRIBUTION 


Step 1: State the distribution and the values of interest. Draw a Normal 
curve with the area of interest shaded and the mean, standard deviation, and 
unknown boundary value clearly identified. 


Step 2: Perform calculations—show your work! Do one of the following: 
(i) Use Table A or technology to find the value of z with the indicated area 
under the standard Normal curve, then “unstandardize” to transform back to 
the original distribution; or (ii) Use the invNorm command and label each of 
the inputs. 


Step 3: Answer the question. 


Cholesterol in Young Boys 
Using Table A in reverse 


High levels of cholesterol in the blood increase the risk of heart disease. For 
14-year-old boys, the distribution of blood cholesterol is approximately Normal 
with mean 44 = 170 milligrams of cholesterol per deciliter of blood (mg/dl) and 
standard deviation ¢ = 30 mg/dl? 


PROBLEM: Whatis the 1st quartile of the distribution of blood cholesterol? 

SOLUTION: 

Step 1: State the distribution and the values of interest. The cholesterol level of 14-year-old 
boys follows a Normal distribution with 4. = 170 and o = 30. The 1st quartile is the boundary value x 
with 25% of the distribution to its left. Figure 2.22 shows a picture of what we are trying to find. 

Step 2: Perform calculations—show your work! Look in the body of Table A for the entry closest 
to 0.25. It’s 0.2514. This is the entry corresponding to z = —0.67.S0 z= —0.67 is the stan- 
dardized score with area 0.25 to its left. Now unstandardize. We know that the standardized score 


— 170 
for the unknown cholesterol level xis z= —0.67. So x satisfies the equation z 30. SON 


Solving for xgives 
x = 170 + (—0.67)(30) = 149.9 
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N(170,30) 
Area = 0.25 


FIGURE 2.22 Locating 
the 1st quartile of the 
cholesterol distribution oe 
for 14-year-old boys. 


Using technology: The command invNorm (area: 0.25, /6:170,0:30) givesx =149.8. 


Step 3: Answer the question. The 1st quartile of blood cholesterol levels in 14-year-old boys is 
about 150 mg/dl. 


For Practice Try Exercise BBX) 


CHECK YOUR UNDERSTANDING 

Follow the method shown in the examples to answer each of the following questions. 
Use your calculator or the Normal Curve applet to check your answers. 

1. Cholesterol levels above 240 mg/dl may require medical attention. What percent of 
14-year-old boys have more than 240 mg/dl of cholesterol? 

2. People with cholesterol levels between 200 and 240 mg/dl are at considerable risk for 
heart disease. What percent of 14-year-old boys have blood cholesterol between 200 and 
240 mg/dl? 

3. What distance would a ball have to travel to be at the 80th percentile of Tiger 
Woods’s drive lengths? 


Assessing Normality 


The Normal distributions provide good models for some distributions of real data. 
Examples include SAT and IQ test scores, the highway gas mileage of 2014 Cor- 
vette convertibles, state unemployment rates, and weights of 9-ounce bags of potato 
chips. The distributions of some other common variables are usually skewed and 
therefore distinctly non-Normal. Examples include economic variables such as per- 
sonal income and total sales of business firms, the survival times of cancer patients 
after treatment, and the lifetime of electronic devices. While experience can suggest 
whether or not a Normal distribution is a reasonable model in a particular case, it 
is risky to assume that a distribution is Normal without actually inspecting the data. 

In the latter part of this course, we will use various statistical inference proce- 
dures to try to answer questions that are important to us. These tests involve sam- 
pling individuals and recording data to gain insights about the populations from 
which they come. Many of these procedures are based on the assumption that 
the population is approximately Normally distributed. Consequently, we need to 
develop a strategy for assessing Normality. 


122 CHAPTER 2 MODELING DISTRIBUTIONS OF DATA 


FIGURE 2.23 Histogram of state 
unemployment rates. 


Unemployment in the States 
Are the data close to Normal? 


Let’s start by examining data on unemployment rates in the 50 states. Here are the 
data arranged from lowest (North Dakota’s 4.1%) to highest (Michigan’s 14.7%).!° 


Ale ates ee WO Ob OO Ot OO ao.) 0.) 6, os 0) 
LO Ae 8 SOL S82 Sat 85 8b 
C06.) 658 8 Ol Oo OG OZ OS 0s 
LOG MOS MOS OO ee 2s ee 2 2 lay 


e Plot the data. Make a dotplot, stemplot, or histogram. See if the graph is ap- 
proximately symmetric and bell-shaped. 

Figure 2.23 is a histogram of the state unemployment rates. The graph is roughly 
symmetric, single-peaked, and somewhat bell-shaped. 


Number of states 


8 10 12 


Unemployment rate 


¢ Check whether the data follow the 68—95—99.7 rule. 


We entered the unemployment rates into computer software and requested 
summary statistics. Here’s what we got: 


Mean = 8.682 Standard deviation = 2.225. 


Now we can count the number of observations within one, two, and three 
standard deviations of the mean. 


Mean + 1 SD: 6.457 to 10.907 36 out of 50 = 72% 
Mean + 2 SD: 4.232 to 13.132 48 out of 50= 96% 
Mean + 3 SD: 2.007 to 15.357. = 50 out of 50 = 100% 


These percents are quite close to the 68%, 95%, and 99.7% targets for a Normal 
distribution. 


Ifa graph of the data is clearly skewed, has multiple peaks, or isn’t bell-shaped, that’s 
evidence that the distribution is not Normal. However, just because a plot of the data 
looks Normal, we can’t say that the distribution is Normal. The 68-95—-99.7 rule can 
give additional evidence in favor of or against Normality. A Normal probability plot 
also provides a good assessment of whether a data set follows a Normal distribution. 
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Unemployment in the States 
Making a Normal probability plot 


Most software packages, including Minitab, Fathom, and JMP, can construct Nor- 
mal probability plots (sometimes called Normal quantile plots) from entered data. 
The T1-83/84 and TI-89 will also make these graphs. Here’s how a Normal prob- 
ability plot is constructed. 


1. Arrange the observed data values from smallest to largest. Record the percentile 
corresponding to each observation (but remember that there are several definitions 
of “percentile”). For example, the smallest observation in a set of 50 values is at either 
the Oth percentile (because 0 out of 50 values are below this observation) or the 2nd 
percentile (because | out of 50 values are at or below this observation). Technology 
usually “splits the difference,” declaring this minimum value to be at the (0 + 2)/2 = 


The highlighted point 
is (4.1, -2.326). North 
Dakota’s 4.1% 
unemployment rate is 
at the 1st percentile, 
which is at z = —2.326 
in the standard 
Normal distribution. 


Expected z-score 


Ist percentile. By similar reasoning, the second-smallest 
value is at the 3rd percentile, the third-smallest value is at 
the 5th percentile, and so on. The maximum value is at 


the (98 + 100)/2 = 99th percentile. 


2. Use the standard Normal distribution (Table A or 
invNorm) to find the z-scores at these same percentiles. 
For example, the Ist percentile of the standard Nor- 
mal distribution is z = —2.326. The 3rd percentile is 
2=— —1,60(5 the oth percentle is z = —1.045; 4,2; the 
99th percentile is z = 2.326. 


3. Plot each observation x against its expected z-score from 
Step 2. If the data distribution is close to Normal, the 


Unemployment rate 


plotted points will lie close to some straight line. Figure 
2.24 shows a Normal probability plot for the state unem- 


ployment data. There is a strong linear pattern, which 


FIGURE 2.24 Normal probability plot of the percent of unem- —- suggests that the distribution of unemployment rates is 
ployed individuals in each of the 50 states. close to Normal. 


Some software plots the data values on 
the horizontal axis and the z-scores on 
the vertical axis, while other software 
does just the reverse. The TI-83/84 and 
TI-89 give you both options. We prefer 
the data values on the horizontal axis, 
which is consistent with other types of 
graphs we have made. 


As Figure 2.24 indicates, real data almost always show some departure 4 
from Normality. When you examine a Normal probability plot, look for 
shapes that show clear departures from Normality. Don’t overreact to minor 
wiggles in the plot. When we discuss statistical methods that are based on the Nor- 
mal model, we will pay attention to the sensitivity of each method to departures 
from Normality. Many common methods work well as long as the data are approxi- 
mately Normal. 


INTERPRETING NORMAL PROBABILITY PLOTS 


If the points on a Normal probability plot lie close to a straight line, the data 
are approximately Normally distributed. Systematic deviations from a straight 
line indicate a non-Normal distribution. Outliers appear as points that are far 
away from the overall pattern of the plot. 
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AP® EXAM TIP Normal probability plots are not included on the AP® Statistics topic outline. 


However, these graphs are very useful for assessing Normality. You may use them on the AP® 
exam if you wish—just be sure that you know what you’re looking for (a linear pattern). 


Let’s look at an example of some data that are not Normally distributed. 


Guinea Pig Survival 
Assessing Normality 


In Chapter 1 Review Exercise R1.7 (page 77), we introduced data on the survival 
times in days of 72 guinea pigs after they were injected with infectious bacteria in 
a medical experiment. 

PROBLEM: Determine whether these data are approximately Normally distributed. 


SOLUTION: Let’s follow the first step in our strategy for assessing Normality: plot the data! 
Figure 2.25(a) shows a histogram of the guinea pig survival times. We can see that the distribution 
is heavily right-skewed. Figure 2.25(b) is a Normal probability plot of the data. The clear curvature in 
this graph confirms that these data do not follow a Normal distribution. 

We won't bother checking the 66-95-99.7 rule for these data because the graphs in Figure 
2.25 indicate serious departures from Normality. 


Frequency of survival time 
ses a 8S & 8B 
Expected z-score 


y 


0 100 200 «= 300 #00—«S ss $00-=— G00 0 100 200 300 ©400~=—s $00 600 
Survival time (days) Survival time (days) 


(a) (b) 
FIGURE 2.25 (a) Histogram and (b) Normal probability plot of the guinea pig survival data. 


For Practice Try Exercise 


THINK How can we determine shape from a Normal probability 
plot? Look at the Normal probability plot of the guinea pig survival data in 

ABOUT IT Figure 2.25(b). Imagine drawing a line through the leftmost points, which cor- 
respond to the smaller observations. The larger observations fall systematically 
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oe, 


to the right of this line. That is, the right-of-center observa- 
tions have much larger values than expected based on their 
percentiles and the corresponding z-scores from the standard 
Normal distribution. 

This Normal probability plot indicates that the guinea 
pig survival data are strongly right-skewed. In a right-skewed 
distribution, the largest observations fall distinctly to the right 
of a line drawn through the main body of points. Similarly, 
left skewness is evident when the smallest observations fall 
to the left the line. 


Expected z-score 


0 100 =. 200 300 400 500 600 
Survival time (days) 


$A 


If you’re wondering how to make a Normal probability plot on your calculator, 
the following Technology Corner shows you the process. 


TECHNOLOGY Qe MAL PROBABILITY PLOTS 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


‘To make a Normal probability plot for a set of quantitative data: 
e Enter the data values in LI /listl. We'll use the state unemployment rates data from page 122. 


e Define Plot! as shown. 


TI-83/84 


NORMAL FLOAT AUTO REAL RADIAN CL fo 
[ARES Plot2 Plot? 

Plot MUM ber! Plot 13 

List: 
Data Axis: a? 

Marke Box 

Store 25cores ko: 


ESC=CANCEL 


RAD AUTO FUME iff 


e Use ZoomStat (ZoomData on the TI-89) to see the finished graph. 


FE Ta TE 
Face/hesrapl en 


Ploti-La 


- 


wed. i uct -2. 32635 


USE €4t4 0R TYPE # CESCISCANCEL 


Y="2.326348 


Interpretation: The Normal probability plot is quite linear, so it is reasonable to believe that the data follow a Normal 
distribution. 


126 CHAPTER 2 MODELING DISTRIBUTIONS OF DATA 


DATA EXPLORATION The vending machine problem 


Have you ever purchased a hot drink from a vending machine? The intended 
sequence of events runs something like this. You insert your money into the ma- 
chine and select your preferred beverage. A cup falls out of the machine, landing 
upright. Liquid pours out until the cup is nearly full. You reach in, grab the piping- 
hot cup, and drink happily. 

Sometimes, things go wrong. The machine might swipe your money. Or the cup 
might fall over. More frequently, everything goes smoothly until the liquid begins to 
flow. It might stop flowing when the cup is only half full. Or the liquid might keep com- 
ing until your cup overflows. Neither of these results leaves you satisfied. 

The vending machine company wants to keep customers happy. So they have 
decided to hire you as a statistical consultant. They provide you with the following 
summary of important facts about the vending machine: 
¢ Cups will hold 8 ounces. 
e¢ The amount of liquid dispensed varies according to a Normal distribution 

centered at the mean yp that is set in the machine. 
e o =0.2 ounces. 

Ifa cup contains too much liquid, a customer may get burned from a spill. This 
could result in an expensive lawsuit for the company. On the other hand, custom- 
ers may be irritated if they get a cup with too little liquid from the machine. Given 
these issues, what mean setting for the machine would you recommend? Write a 
brief report to the vending machine company president that explains your answer. 


Do You Sudoku? 


In the chapter-opening Case Study (page 83), one of the authors played 
an online game of sudoku. At the end of his game, the graph on the next 
page was displayed. The density curve shown was constructed from a his- 
togram of times from 4,000,000 games played in one week at this Web 
site. You will now use what you have learned in this chapter to analyze 
how well the author did. 


State and interpret the percentile for the author’s time of 
3 minutes and 19 seconds. (Remember that smaller times 
indicate better performance.) 

Explain why you cannot find the z-score corresponding to 
the author’s time. 

Suppose the author’s time to finish the puzzle had been 5 
minutes and 6 seconds instead. 

(a) Would his percentile be greater than 50%, equal to 
50%, or less than 50%? Justify your answer. 

(b) Would his z-score be positive, negative, or zero? Explain. 


0 min 
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Your time: 3 minutes, 19 seconds 4. From long experience, the author’s times to fin- 


4 


Rank: Top 19% 


ish an easy sudoku puzzle at this Web site follow 
a Normal distribution with mean 4.2 minutes and 
standard deviation 0.7 minutes. In what percent of 
the games that he plays does the author finish an 
easy puzzle in less than 3 minutes and 15 seconds? 
Show your work. (Note: 3 minutes and 15 seconds 

is not the same as 3.15 seconds!) 
5. The author’s wife also enjoys playing sudoku 
online. Her times to finish an easy puzzle at this 
30 mins Web site follow a Normal distribution with mean 
3.8 minutes and standard deviation 0.9 minutes. In 
her most recent game, she finished in 3 minutes. 
Whose performance is better, relatively speaking: 


Easy level average time: 5 minutes, 6 seconds the author’s 3 minutes and 19 seconds or his wife’s 


3 minutes? Justify your answer. 


Summary 


We can describe the overall pattern of a distribution by a density curve. A 
density curve always remains on or above the horizontal axis and has total area | 
underneath it. An area under a density curve gives the proportion of observations 
that fall in an interval of values. 


A density curve is an idealized description of the overall pattern of a distribution 
that smooths out the irregularities in the actual data. We write the mean of 
a density curve as js and the standard deviation of a density curve as o to 
distinguish them from the mean x and the standard deviation s, of the actual data. 


The mean and the median of a density curve can be located by eye. The mean ju 
is the balance point of the curve. The median divides the area under the curve in 
half. The standard deviation o cannot be located by eye on most density curves. 


The mean and median are equal for symmetric density curves. The mean of a 
skewed curve is located farther toward the long tail than the median is. 


The Normal distributions are described by a special family of bell-shaped, 
symmetric density curves, called Normal curves. The mean ju and standard 
deviation o completely specify a Normal distribution N(y,c). The mean is the 
center of the curve, and @ is the distance from jy to the change-of-curvature 
points on either side. 

All Normal distributions obey the 68—95-99.7 rule, which describes what percent 
of observations lie within one, two, and three standard deviations of the mean. 


All Normal distributions are the same when measurements are standardized. If 
x follows a Normal distribution with mean yp and standard deviation o, we can 
standardize using 
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‘The variable z has the standard Normal distribution with mean 0 and stan- 
dard deviation 1. 


e Table A at the back of the book gives percentiles for the standard Normal 
curve. By standardizing, we can use Table A to determine the percentile 
for a given z-score or the z-score corresponding to a given percentile in any 
Normal distribution. You can use your calculator or the Normal Curve applet 
to perform Normal calculations quickly. 

e To perform certain inference procedures in later chapters, we will need to 
know that the data come from populations that are approximately Normally 
distributed. To assess Normality for a given set of data, we first observe the 
shape of a dotplot, stemplot, or histogram. Then we can check how well the 
data fit the 68—95-99.7 rule for Normal distributions. Another good method 
for assessing Normality is to construct a Normal probability plot. 


2.2) TECHNOLOGY 
CORNERS 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


5. From z-scores to areas, and vice versa 
6. Normal probability plots 


Exercises 


33. Density curves Sketch a density curve that might de- (b) The proportion of accidents that occur in the first 
scribe a distribution that is symmetric but has two peaks. mile of the path is the area under the density curve 


5 j He the ? 
34. Density curves Sketch a density curve that might Between Umiles/and Vmile Whatistusared: 


describe a distribution that has a single peak and is (c) Sue’s property adjoins the bike path between the 
skewed to the left. 0.8 mile mark and the 1.1 mile mark. What 
proportion of accidents happen in front of Sue’s 


Exercises 35 to 38 involve a special type of density curve — proper? Paola: 


one that takes constant height (looks like a horizontal line) 


over some interval of values. ‘This density curve describes a 36. Where’s the bus? Sally takes the same bus to work 
variable whose values are distributed evenly (uniformly) over every morning. The amount of time (in minutes) that 
some interval of values. We say that such a variable has a she has to wait for the bus to arrive is described by the 
uniform distribution. uniform distribution below. 


35. Biking accidents Accidents on a level, 3-mile bike 
path occur uniformly along the length of the path. 


The figure below displays the density curve that Height = 
describes the uniform distribution of accidents. os 
Height = 1/3 
0 t 2 3 (a) Explain why this curve satisfies the two requirements 
Distance along bike path (miles) fora density curve. 
(a) Explain why this curve satisfies the two requirements (b) On what percent of days does Sally have to wait more 


for a density curve. than 8 minutes for the bus? 
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(c) On what percent of days does Sally wait between 2.5 44. Potato chips Refer to Exercise 42. Use the 68-95— 
and 5.3 minutes for the bus? 99.7 rule to answer the following questions. Show 
your work! 


37. Biking accidents What is the mean p of the density 
curve pictured in Exercise 35? (‘That is, where would (a) Between what weights do the middle 68% of bags fall? 
the curve balance?) What is the median? (‘That is, 
where is the point with area 0.5 on either side?) 


38. Where’s the bus? What is the mean yu of the density (c) What percent of 9-ounce bags of this brand of potato 
curve pictured in Exercise 36? What is the median? chips weigh between 8.97 and 9.17 ounces? 


(b) What percent of bags weigh less than 9.02 ounces? 


39. Mean and median ‘The figure below displays two (d) A bag that weighs 9.07 ounces is at what percentile in 
density curves, each with three points marked. At this distribution? 
which of these points on each curve do the mean and ae . 
shermcdian tale 45. Estimating SD The figure below shows two Nor- 


mal curves, both with mean 0. Approximately what 


[IN IIN/|\ is the standard deviation of each of these curves? 
ABC A B Cc 


(a) (b) 


40. Mean and median ‘The figure below displays two 
density curves, each with three points marked. At 
which of these points on each curve do the mean and 
the median fall? 


VAIN yt -16 -12 -08 -04 0 O24 08 a is 
ABA AB C 


46. A Normal curve E'stimate the mean and standard de- 
(a) (b) viation of the Normal density curve in the figure below. 


41. Men’s heights The distribution of heights of adult 
American men is approximately Normal with mean 
69 inches and standard deviation 2.5 inches. Draw an 
accurate sketch of the distribution of men’s heights. 
Be sure to label the mean, as well as the points 1, 2, 
and 3 standard deviations away from the mean on the 
horizontal axis. 


42. Potato chips The distribution of weights of 9-ounce 
bags of a particular brand of potato chips is ap- 
proximately Normal with mean pp = 9.12 ounces 
and standard deviation ¢ = 0.05 ounce. Draw an 
accurate sketch of the distribution of potato chip bag 
weights. Be sure to label the mean, as well as the 
points 1, 2, and 3 standard deviations away from the 
mean on the horizontal axis. 


43. Men’s heights Refer to Exercise +1. Use the 68-95— 


ape acre ae ee ee | ee | aT ee | 
as 4 3 OF & GO Woh wil ae ales ale} ile they 1l7/ 


11 99.7 rule to answer the following questions. Show your For Exercises 47 to 50, use Table A to find the proportion 
) work! of observations from the standard Normal distribution that 


(a) Between what heights do the middle 95% of men fall? satisfies each of the following statements. In each case, 
(ib) Whi ceetoineneetullia tha: Minch sketch a standard Normal curve and shade the area under 
( 


the curve that is the answer to the question. 
c) What percent of men are between 64 and 66.5 inches 
tall? 47. ‘Table A practice 


(d) A height of 71.5 inches corresponds to what percen- (a) z< 2.85 (c) z> —1.66 
tile of adult male American heights? (b) z > 2.85 (d) —1.66<z<2.85 
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48. ‘Table A practice 

@) 2< 246 (c) 0.89 <z< 2.46 

(b) z> 2.46 (Gl) =2.95 <2< S27 
49. More Table A practice 

(a) zis between —1.33 and 1.65 

(b) zis between 0.50 and 1.79 

50. More Table A practice 

(a) zis between —2.05 and 0.78 

(b) zis between —1.1] and —0.32 


For Exercises 51 and 52, use Table A to find the value z 
from the standard Normal distribution that satisfies each of 
the following conditions. In each case, sketch a standard 
Normal curve with your value of z marked on the axis. 


51. Working backward 
(a) The 10th percentile. 
(b) 34% of all observations are greater than z. 
52. Working backward 
(a) The 63rd percentile. 
(b) 75% of all observations are greater than z. 


53. Length of pregnancies The length of human preg- 
nancies from conception to birth varies according 
to a distribution that is approximately Normal with 
mean 266 days and standard deviation 16 days. 


(a) At what percentile is a pregnancy that lasts 240 days 
(that’s about 8 months)? 


(b) What percent of pregnancies last between 240 
and 270 days (roughly between 8 months and 
9 months)? 


(c) How long do the longest 20% of pregnancies last? 


54. IQ test scores Scores on the Wechsler Adult Intel- 
ligence Scale (a standard IQ test) for the 20 to 34 age 
group are approximately Normally distributed with 
= 110 ando = 25. 


(a) At what percentile is an IQ score of 150? 


(b) What percent of people aged 20 to 34 have IQs 
between 125 and 150? 


(c) MENSA is an elite organization that admits as mem- 
bers people who score in the top 2% on IQ tests. 
What score on the Wechsler Adult Intelligence Scale 
would an individual aged 20 to 34 have to earn to 
qualify for MENSA membership? 


55. Puta lid on it! At some fast-food restaurants, custom- 
ers who want a lid for their drinks get them from a 
large stack left near straws, napkins, and condiments. 


MODELING DISTRIBUTIONS OF DATA 


The lids are made with a small amount of flexibility 
so they can be stretched across the mouth of the 

cup and then snugly secured. When lids are too 
small or too large, customers can get very frustrated, 
especially if they end up spilling their drinks. At one 
particular restaurant, large drink cups require lids 
with a “diameter” of between 3.95 and 4.05 inches. 
The restaurant’s lid supplier claims that the diameter 
of their large lids follows a Normal distribution with 
mean 3.98 inches and standard deviation 0.02 inches. 
Assume that the supplier’s claim is true. 


(a) What percent of large lids are too small to fit? Show 
your method. 


(b) What percent of large lids are too big to fit? Show 
your method. 


(c) Compare your answers to parts (a) and (b). Does it 
make sense for the lid manufacturer to try to make 
one of these values larger than the other? Why or why 
not? 


56. I think I can! An important measure of the perfor- 
mance of a locomotive is its “adhesion,” which is the 
locomotive’s pulling force as a multiple of its weight. 
The adhesion of one 4400-horsepower diesel locomo- 
tive varies in actual use according to a Normal distri- 
bution with mean ys = 0.37 and standard deviation 


a = 0.04. 


(a) Fora certain small train’s daily route, the locomotive 
needs to have an adhesion of at least 0.30 for the train 
to arrive at its destination on time. On what propor- 
tion of days will this happen? Show your method. 


(b) An adhesion greater than 0.50 for the locomotive will 
result in a problem because the train will arrive too 
early at a switch point along the route. On what pro- 
portion of days will this happen? Show your method. 


(c) Compare your answers to parts (a) and (b). Does it 
make sense to try to make one of these values larger 
than the other? Why or why not? 


57. Puta lid on it! Refer to Exercise 55. The supplier is 
considering two changes to reduce the percent of its 
large-cup lids that are too small to 1%. One strategy is 
to adjust the mean diameter of its lids. Another option 
is to alter the production process, thereby decreasing 
the standard deviation of the lid diameters. 


(a) If the standard deviation remains at 0 = 0.02 inches, 
at what value should the supplier set the mean diam- 
eter of its large-cup lids so that only 1% are too small 
to fit? Show your method. 


(b) Ifthe mean diameter stays at 4 = 3.98 inches, what 
value of the standard deviation will result in only 1% 
of lids that are too small to fit? Show your method. 
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(c) Which of the two options in parts (a) and (b) do you 


58. 


59. 


60. 


61. 


2s 


think is preferable? Justify your answer. (Be sure to 
consider the effect of these changes on the percent of 
lids that are too large to fit.) 


I think I can! Refer to Exercise 56. The locomotive’s 
manufacturer is considering two changes that could 
reduce the percent of times that the train arrives late. 
One option is to increase the mean adhesion of the 
locomotive. The other possibility is to decrease the 
variability in adhesion from trip to trip, that is, to 
reduce the standard deviation. 


If the standard deviation remains at o = 0.04, to what 
value must the manufacturer change the mean adhe- 
sion of the locomotive to reduce its proportion of late 
arrivals to only 2% of days? Show your work. 


If the mean adhesion stays at ys = 0.37, how much 
must the standard deviation be decreased to ensure 
that the train will arrive late only 2% of the time? 
Show your work. 


Which of the two options in parts (a) and (b) do you 
think is preferable? Justify your answer. (Be sure to 
consider the effect of these changes on the percent of 
days that the train arrives early to the switch point.) 


Deciles The deciles of any distribution are the values 
at the 10th, 20th, ... , 90th percentiles. ‘The first and 
last deciles are the 10th and the 90th percentiles, 
respectively. 


What are the first and last deciles of the standard Nor- 
mal distribution? 


The heights of young women are approximately Nor- 
mal with mean 64.5 inches and standard deviation 2.5 
inches. What are the first and last deciles of this distri- 
bution? Show your work. 


Outliers The percent of the observations that are 
classified as outliers by the 1.5 X IQR rule is the same 
in any Normal distribution. What is this percent? 
Show your method clearly. 


Flight times An airline flies the same route at the 
same time each day. The flight time varies according to 
a Normal distribution with unknown mean and stan- 
dard deviation. On 15% of days, the flight takes more 
than an hour. On 3% of days, the flight lasts 75 minutes 
or more. Use this information to determine the mean 
and standard deviation of the flight time distribution. 


Brush your teeth ‘The amount of time Ricardo 
spends brushing his teeth follows a Normal distribu- 
tion with unknown mean and standard deviation. 
Ricardo spends less than one minute brushing his 
teeth about 40% of the time. He spends more than 


63. 
a] 124 
x2) 
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two minutes brushing his teeth 2% of the time. Use 
this information to determine the mean and standard 
deviation of this distribution. 


Sharks Here are the lengths in feet of 44 great white 
sharks: !! 


18.7 12.3 186 16.4 15.7 183 14.6 15.8 14.9 17.6 12.1 
16.4 16.7 17.8 16.2 12.6 17.8 13.8 12.2 15.2 14.7 12.4 
13.2 15.8 143 166 9.4 18.2 13.2 13.6 15.3 16.1 13.5 
Wh WG2 2s Wes) Ge! Ws WSL USL US 1is2 Te 


Enter these data into your calculator and make a histo- 
gram. Include a sketch of the graph on your paper. Then 
calculate one-variable statistics. Describe the shape, cen- 
ter, and spread of the distribution of shark lengths. 


Calculate the percent of observations that fall within 1, 
2, and 3 standard deviations of the mean. How do these 
results compare with the 68-95-99.7 rule? 


Use your calculator to construct a Normal probability 
plot. Include a sketch of the graph on your paper. Inter- 
pret this plot. 


Having inspected the data from several different per- 
spectives, do you think these data are approximately 
Normal? Write a brief summary of your assessment 
that combines your findings from parts (a) through (c). 


Density of the earth In 1798, the English scientist 
Henry Cavendish measured the density of the earth 
several times by careful work with a torsion balance. 
The variable recorded was the density of the earth as a 
multiple of the density of water. Here are Cavendish’s 
29 measurements:!” 


5.00 5.61 4.88 5.07 
DO S58 Ol 128) 
9.42 5.47 5.63 5.34 


3:265 9:509).9:36) 91299719198 5/65 
5.44 5.34 5.79 5.10 5.27 5.39 
5.46 5.30 5.75 5.68 5.85 


Enter these data into your calculator and make a his- 
togram. Include a sketch of the graph on your paper. 
Then calculate one-variable statistics. Describe the 
shape, center, and spread of the distribution of density 
measurements. 


Calculate the percent of observations that fall within 
1, 2, and 3 standard deviations of the mean. How do 
these results compare with the 68-95-99.7 rule? 


Use your calculator to construct a Normal probability 
plot. Include a sketch of the graph on your paper. Inter- 
pret this plot. 


Having inspected the data from several different 
perspectives, do you think these data are approxi- 
mately Normal? Write a brief summary of your as- 
sessment that combines your findings from parts (a) 


through (c). 
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65. Runners’ heart rates ‘The figure below is a Normal 
probability plot of the heart rates of 200 male runners 
after six minutes of exercise on a treadmill.!® The 
distribution is close to Normal. How can you see 
this? Describe the nature of the small deviations from 
Normality that are visible in the plot. 
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Heart rate (beats per minute) 


66. Carbon dioxide emissions ‘The figure below is a 
Normal probability plot of the emissions of carbon 
dioxide per person in 48 countries.'* In what ways is 
this distribution non-Normal? 
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67. Is Michigan Normal? We collected data on the tuition 
charged by colleges and universities in Michigan. 
Here are some numerical summaries for the data: 


Mean Std. Dev. Min Max 
10614 8049 1873 30823 


Based on the relationship between the mean, stan- 

dard deviation, minimum, and maximum, is it rea- 
sonable to believe that the distribution of Michigan 
tuitions is approximately Normal? Explain. 


68. Weights aren’t Normal The heights of people of the 
same gender and similar ages follow Normal distribu- 
tions reasonably closely. Weights, on the other hand, 
are not Normally distributed. The weights of women 
aged 20 to 29 have mean 141.7 pounds and median 
133.2 pounds. The first and third quartiles are 118.3 
pounds and 157.3 pounds. What can you say about 
the shape of the weight distribution? Why? 


Multiple choice: Select the best answer for Exercises 69 to 74. 


69. ‘Two measures of center are marked on the density 
curve shown. Which of the following is correct? 


(a) The median is at the yellow line and the mean is at 
the red line. 

(b) The median is at the red line and the mean is at the 
yellow line. 

(c) The mode is at the red line and the median is at the 
yellow line. 

(d) The mode is at the yellow line and the median is at 
the red line. 

(e) The mode is at the red line and the mean is at the 
yellow line. 

Exercises 70 to 72 refer to the following setting. The 

weights of laboratory cockroaches follow a Normal dis- 

tribution with mean 80 grams and standard deviation 2 

grams. The following figure is the Normal curve for this 

distribution of weights. 


70. 
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Point C on this Normal curve corresponds to 


84 grams. (c) 78 grams. (e) 74 grams. 


82 grams. (d) 76 grams. 


. About what percent of the cockroaches have weights 


between 76 and 84 grams? 


99.7% (c) 68% (e) 34% 
95% (d) 47.5% 
. About what proportion of the cockroaches will have 


weights greater than 83 grams? 


0.0228 (c) 0.1587 (e) 0.0772 
0.0668 (d) 0.9332 


. A different species of cockroach has weights that 


follow a Normal distribution with a mean of 50 
grams. After measuring the weights of many of these 
cockroaches, a lab assistant reports that 14% of the 
cockroaches weigh more than 55 grams. Based on 
this report, what is the approximate standard deviation 
of weights for this species of cockroaches? 


46 (d) 14.0 
5.0 (e) Cannot determine without more information. 


6.2 


. The following Normal probability plot shows the 


distribution of points scored for the 551 players in the 
2011-2012 NBA season. 


Probability Plot of Points 
Normal 


? il 


Expected z-score 
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If the distribution of points was displayed in a his- 
togram, what would be the best description of the 
histogram’s shape? 


Approximately Normal 

Symmetric but not approximately Normal 
Skewed left 

Skewed right 


Cannot be determined 


. Gas it up! (1.3) Interested in a sporty car? Worried 


that it might use too much gas? The Environmen- 
tal Protection Agency lists most such vehicles in 

its “two-seater” or “minicompact” categories. The 
figure shows boxplots for both city and highway gas 
mileages for our two groups of cars. Write a few 
sentences comparing these distributions. 


40 4 
354 


305) 


Miles per gallon 


T T T 
Two city Twohwy Minicity Minihwy 


. Python eggs (1.1) How is the hatching of water 


python eggs influenced by the temperature of the 
snake’s nest? Researchers assigned newly laid eggs 
to one of three temperatures: hot, neutral, or cold. 
Hot duplicates the extra warmth provided by the 
mother python, and cold duplicates the absence 
of the mother. Here are the data on the number of 
eggs and the number that hatched:! 


Cold Neutral Hot 
Number of eggs 27 56 104 
Number hatched 16 38 ifs 


Make a two-way table of temperature by outcome 
(hatched or not). 


Calculate the percent of eggs in each group that 
hatched. The researchers believed that eggs would be 
less likely to hatch in cold water. Do the data support 
that belief? 
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Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


The distribution of scores on a recent test closely fol- 
lowed a Normal distribution with a mean of 22 points and 
a standard deviation of 4 points. 

(a) What proportion of the students scored at least 25 

points on this test? 

(b) What is the 31st percentile of the distribution of 

test scores? 

(c) ‘The teacher wants to transform the test scores so that 

they have an approximately Normal distribution with 
a mean of 80 points and a standard deviation of 10 
points. ‘To do this, she will use a formula in the form: 


new score = a + b (old score) 


Find the values of a and 5 that the teacher should 
use to transform the distribution of test scores. 

(d) Before the test, the teacher gave a review assign- 
ment for homework. The maximum score on 
the assignment was 10 points. The distribution 
of scores on this assignment had a mean of 9.2 
points and a standard deviation of 2.1 points. 
Would it be appropriate to use a Normal distri- 
bution to calculate the proportion of students 
who scored below 7 points on this assignment? 
Explain. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements 
would you suggest to the student who wrote it? Finally, your teach- 
er will provide you with a scoring rubric. Score your response and 
note what, if anything, you would do differently to improve your 
own score. 


Chapter Review 


Section 2.1: Describing Location in a Distribution 


In this section, you learned two different ways to describe 
the location of individuals in a distribution, percentiles 
and standardized scores (z-scores). Percentiles describe 
the location of an individual by measuring what percent 
of the observations in the distribution have a value less 
than the individual’s value. A cumulative relative fre- 
quency graph is a handy tool for identifying percentiles 
in a distribution. You can use it to estimate the percentile 
for a particular value of a variable or estimate the value of 
the variable at a particular percentile. 

Standardized scores (z-scores) describe the location of 
an individual in a distribution by measuring how many 
standard deviations the individual is above or below the 
mean. ‘To find the standardized score for a particular ob- 


servation, transform the value by subtracting the mean 
and dividing the difference by the standard deviation. Be- 
sides describing the location of an individual in a distri- 
bution, you can also use z-scores to compare observations 
from different distributions—standardizing the values 
puts them on a standard scale. 

You also learned to describe the effects on the shape, 
center, and spread of a distribution when transforming 
data from one scale to another. Adding a positive constant 
to (or subtracting it from) each value in a data set changes 
the measures of location but not the shape or spread of 
the distribution. Multiplying or dividing each value in a 
data set by a positive constant changes the measures of 
location and measures of spread but not the shape of the 
distribution. 


Section 2.2: Density Curves and Normal Distributions 


In this section, you learned how density curves are used to 
model distributions of data. An area under a density curve 
gives the proportion of observations that fall in a specified 
interval of values. ‘The total area under a density curve is 
1, or 100%. 

The most commonly used density curve is called a 
Normal curve. The Normal curve is symmetric, single- 
peaked, and bell-shaped with mean y and standard devia- 
tion o. For any distribution of data that is approximately 
Normal in shape, about 68% of the observations will be 
within | standard deviation of the mean, about 95% of 
the observations will be within 2 standard deviations of 
the mean, and about 99.7% of the observations will be 
within 3 standard deviations of the mean. Conveniently, 
this relationship is called the 68—95—99.7 rule. 

When observations do not fall exactly 1, 2, or 3 standard 
deviations from the mean, you learned how to use ‘Table A 
(or technology) to identify the proportion of values in any 


What Did You Learn? 


Learning Objective 


Section 


specified interval under a Normal curve. You also learned 
how to use ‘Table A (or technology) to determine the val- 
ue of an individual that falls at a specified percentile in 
a Normal distribution. On the AP® exam, it is extremely 
important that you clearly communicate your methods 
when answering questions that involve the Normal distri- 
bution. You must specify the shape (Normal), center (js), 
and spread (¢) of the distribution; identify the region under 
the Normal curve that you are working with; and correctly 
calculate the answer with work shown. Shading a Normal 
curve with the mean, standard deviation, and boundaries 
clearly identified is a great start. 

Finally, you learned how to determine whether a distri- 
bution of data is approximately Normal using graphs (dot- 
plots, stemplots, histograms) and the 68—95-99.7 rule. You 
also learned that a Normal probability plot is a great way to 
determine whether the shape of a distribution is approxi- 
mately Normal. ‘The more linear the Normal probability 
plot, the more Normal the distribution of the data. 


Related Example 
on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Find and interpret the percentile of an individual 
value within a distribution of data. 


86 R2.1 


Estimate percentiles and individual values using a 
cumulative relative frequency graph. 


87, 88 R2.2 


Find and interpret the standardized score (z-score) of 
an individual value within a distribution of data. 


90, 91 R2.1 


Describe the effect of adding, subtracting, multiplying by, 
or dividing by a constant on the shape, center, and spread 
of a distribution of data. 


93, 94, 95 


Estimate the relative locations of the median and 
mean on a density curve. 


Discussion on 
106-107 


Use the 68-95—99.7 rule to estimate areas 
(proportions of values) in a Normal distribution. 


WW 


Use Table A or technology to find (i) the proportion of 
z-values in a specified interval, or (ii) a Zscore from a 
percentile in the standard Normal distribution. 


IA, Ws 
Discussion on 116 


Use Table A or technology to find (i) the proportion of 
values in a specified interval, or (ii) the value that corresponds 
to a given percentile in any Normal distribution. 


R2.7, R2.8, R2.9 


118, 119, 120 


Determine whether a distribution of data is approximately Normal 
from graphical and numerical evidence. 


122, 123, 124 R2.10, R2.11 


136 
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Chapter 2 Chapter Review Exercises 


These exercises are designed to help you review the impor- 
tant ideas and methods of the chapter. 


R2.1 


RZ2Z 


(a) 


(b) 


R2.3 


Variable n 


Guess 


Is Paul tall? According to the National Center for 
Health Statistics, the distribution of heights for 
15-year-old males has a mean of 170 centimeters 
(cm) and a standard deviation of 7.5 cm. Paul is 15 
years old and 179 cm tall. 


Find the z-score corresponding to Paul’s height. Ex- 
plain what this value means. 


Paul’s height puts him at the 85th percentile among 
15-year-old males. Explain what this means to some- 
one who knows no statistics. 


Computer use Mrs. Causey asked her students how 
much time they had spent using a computer during the 
previous week. The following figure shows a cumula- 
tive relative frequency graph of her students’ responses. 


At what percentile does a student who used her com- 
puter for 7 hours last week fall? 


Estimate the interquartile range (IQR) from the 
graph. Show your work. 


100 + 
~ 0- 
3s 
—& eo 
i} 
iI 
2570 
is 
i") 
& 60-7 
Es 
= 504 
2 
e 40-4 
ae 
& 30-7 
| 
EF 204 
Ss) 

104 


Hours per week 


Aussie, Aussie, Aussie A group of Australian students 
were asked to estimate the width of their classroom in 
feet. Use the dotplot and summary statistics below to 
answer the following questions. 


Mean Stdev Minimum Q, 


wie 156 LEW} aA 0) 


Median Q3 


24.00 35.50 42.00 48.00 


Maximum 


94.00 


(a) Suppose we converted each student’s guess from feet 


= 


to meters (3.28 ft = 1 m). How would the shape of 
the distribution be affected? Find the mean, median, 
standard deviation, and JOR for the transformed data. 


The actual width of the room was 42.6 feet. Suppose 
we calculated the error in each student’s guess as fol- 
lows: guess — 42.6. Find the mean and standard de- 
viation of the errors. Justify your answers. 


R2.4 What the mean means The figure below is a 


(a 


) 


density curve. Trace the curve onto your paper. 


Mark the approximate location of the median. E:x- 
plain your choice of location. 


(b) Mark the approximate location of the mean. Explain 


your choice of location. 


R2.5 Horse pregnancies Bigger animals tend to carry 


their young longer before birth. The length of horse 
pregnancies from conception to birth varies accord- 
ing to a roughly Normal distribution with mean 336 
days and standard deviation 3 days. Use the 68-95— 
99.7 rule to answer the following questions. 


(a) Almost all (99.7%) horse pregnancies fall in what in- 


terval of lengths? 


(b) What percent of horse pregnancies are longer than 


339 days? Show your work. 


R2.6 Standard Normal distribution Use Table A (or 


technology) to find each of the following for a stan- 
dard Normal distribution. In each case, sketch a stan- 
dard Normal curve and shade the area of interest. 


(a) The proportion of observations with —2.25 <z< 1.77 


(b) The number z such that 35% of all observations are 


greater than z 


R2.7 Low-birth-weight babies Researchers in Norway an- 


alyzed data on the birth weights of 400,000 newborns 
over a 6-year period. The distribution of birth weights 
is Normal with a mean of 3668 grams and a standard 
deviation of 511 grams.!° Babies that weigh less than 
2500 grams at birth are classified as “low birth weight.” 


(a) What percent of babies will be identified as low birth 


weight? Show your work. 


(b) Find the quartiles of the birth weight distribution. 


R2.8 


Lee 


ao 


Show your work. 


Ketchup A fast-food restaurant has just installed a 
new automatic ketchup dispenser for use in prepar- 
ing its burgers. ‘The amount of ketchup dispensed by 
the machine follows a Normal distribution with mean 
1.05 ounces and standard deviation 0.08 ounce. 


If the restaurant’s goal is to put between | and 1.2 
ounces of ketchup on each burger, what percent of 
the time will this happen? Show your work. 


Suppose that the manager adjusts the machine’s set- 
tings so that the mean amount of ketchup dispensed 
is 1.1 ounces. How much does the machine’s stan- 
dard deviation have to be reduced to ensure that at 
least 99% of the restaurant’s burgers have between | 
and 1.2 ounces of ketchup on them? 


R2.9 Grading managers Many companies “grade on a 


bell curve” to compare the performance of their man- 
agers and professional workers. ‘This forces the use of 
some low performance ratings, so that not all workers 
are listed as “above average.” Ford Motor Compa- 
ny’s “performance management process” for a time 
assigned 10% A grades, 80% B grades, and 10% C 
grades to the company’s 18,000 managers. Suppose 
that Ford’s performance scores really are Normally 
distributed. ‘This year, managers with scores less than 
25 received C’s, and those with scores above 475 
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received A’s. What are the mean and standard de- 
viation of the scores? Show your work. 


R2.10 Fruit fly thorax lengths Here are the lengths in 
millimeters of the thorax for 49 male fruit flies:!” 


0.64 0.64 0.64 0.68 0.68 0.68 0.72 0.72 0.72 0.72 0.74 0.76 0.76 
0.76 0.76 0.76 0.76 0.76 0.76 0.78 0.80 0.80 0.80 0.80 0.80 0.82 
0.82 0.84 0.84 0.84 0.84 0.84 0.84 0.84 0.84 0.84 0.84 0.88 0.88 
0.88 0.88 0.88 0.88 0.88 0.88 0.92 0.92 0.92 0.94 


Are these data approximately Normally distributed? 
Give appropriate graphical and numerical evidence 
to support your answer. 

R2.11 Assessing Normality A Normal probability plot of a 


set of data is shown here. Would you say that these 
measurements are approximately Normally distrib- 


uted? Why or why not? 
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Data values 


Chapter 2 AP® Statistics Practice Test 


Section |: Multiple Choice Select the best answer for each question. 


T2.1 Many professional schools require applicants to take 


a standardized test. Suppose that 1000 students take 
such a test. Several weeks after the test, Pete receives 
his score report: he got a 63, which placed him at the 
73rd percentile. This means that 


(a) Pete’s score was below the median. 


(b) Pete did worse than about 63% of the test takers. 
(c 
(d 


(e 


Pete did worse than about 73% of the test takers. 
Pete did better than about 63% of the test takers. 
Pete did better than about 73% of the test takers. 


ee ee 
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(a) 
123 


(a) 
12> 


CHAPTER 2 


For the Normal distribution shown, the standard de- 
viation is closest to 


0 (bl (2 (3. (e)5 


Rainwater was collected in water collectors at 30 differ- 
ent sites near an industrial complex, and the amount 
of acidity (pH level) was measured. ‘The mean and 
standard deviation of the values are 4.60 and 1.10, 
respectively. When the pH meter was recalibrated 
back at the laboratory, it was found to be in error. The 
error can be corrected by adding 0.1 pH units to all 
of the values and then multiplying the result by 1.2. 
The mean and standard deviation of the corrected pH 
measurements are 


DOr lA (eo); 5.40), 4 (e647 120) 
bioral 32 (a) S400 1632 
The figure shows a cumulative relative frequency 


graph of the number of ounces of alcohol consumed 
per week in a sample of 150 adults who report drink- 
ing alcohol occasionally. About what percent of these 
adults consume between 4 and 8 ounces per week? 
100 4 


80 + 


604 


Percent 


Consumption (0z) 


20% (b) 40% (c) 50% (d) 60% (e) 80% 

The average yearly snowfall in Chillyville is Normally 
distributed with a mean of 55 inches. If the snowfall 
in Chillyville exceeds 60 inches in 15% of the years, 


what is the standard deviation? 


(a) 4.83 inches 
(b) 5.18 inches 
(c) 6.04 inches 
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(d) 8.93 inches 


(e) The standard deviation 
cannot be computed from 
the given information. 


12.6 The figure shown is the density curve of a distribution. 


Seven values are marked on the density curve. Which 
of the following statements is true? 


nA 


A B Cc I) eh le 


(a) ‘The mean of the distribution is E. 


(b 
(c 
(d 
( 


e 


) 
) 
) 


) 
) 


The area between B and F is 0.50. 

The median of the distribution is C. 
The 3rd quartile of the distribution is D. 
The area between A and G is 1. 


12.7 Ifthe heights of a population of men follow a Normal 


(a) |? 


distribution, and 99.7% have heights between 5’0” 
and 7'(0”, what is your estimate of the standard devia- 
tion of the heights in this population? 


(b) 3” (c) 4” —() "—(e) 12” 


12.8 Which of the following is not correct about a standard 


Normal distribution? 


(a) ‘The proportion of scores that satisfy 0 < z < 1.5 is 


(b 
(c 
(d 
(e 


Se see 


G2332, 

The proportion of scores that satisfy z << —1.0 is 0.1587. 
The proportion of scores that satisfy z > 2.0 is 0.0228. 
The proportion of scores that satisfy z < 1.5 is 0.9332. 
The proportion of scores that satisfy z > —3.0 is 0.9938. 


Questions T2.9 and T2. 10 refer to the following setting. Until the 
scale was changed in 1995, SAT scores were based on a scale 
set many years ago. For Math scores, the mean under the old 
scale in the 1990s was 470 and the standard deviation was 110. 
In 2009, the mean was 515 and the standard deviation was 116. 


T2.9 What is the standardized score (z-score) for a student 
who scored 500 on the old SAT scale? 


G30) =0.27 ser =U wa) O11) fey 0127 


T2.10 Gina took the SAT in 1994 and scored 500. Her 
cousin Colleen took the SAT in 2013 and scored 
530. Who did better on the exam, and how can you 
tell? 
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(a) Colleen—she scored 30 points higher than Gina. 
(b) Colleen—her standardized score is higher than Gina’s. 


) 
) 
(c) Gina—her standardized score is higher than Colleen’s. 
(d) Gina—the standard deviation was bigger in 2013. 

) 


(e) The two cousins did equally well—their z-scores are 
the same. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T2.11 As part of the President’s Challenge, students can 
attempt to earn the Presidential Physical Fitness 
Award or the National Physical Fitness Award by 
meeting qualifying standards in five events: curl-ups, 
shuttle run, sit and reach, one-mile run, and pull- 
ups. The qualifying standards are based on the 1985 
School Population Fitness Survey. For the Presiden- 
tial Award, the standard for each event is the 85th 
percentile of the results for a specific age group and 
gender among students who participated in the 1985 
survey. For the National Award, the standard is the 
50th percentile. ‘To win either award, a student must 
meet the qualifying standard for all five events. 

Jane, who is 9 years old, did 40 curl-ups in one 
minute. Matt, who is 12 years old, also did 40 curl- 
ups in one minute. The qualifying standard for the 
Presidential Award is 39 curl-ups for Jane and 50 
curl-ups for Matt. For the National Award, the stan- 
dards are 30 and 40, respectively. 


— 
p 
a 


Compare Jane’s and Matt’s performances using 
percentiles. Explain in language simple enough for 
someone who knows little statistics to understand. 


(b) Who has the higher standardized score (score), 
Jane or Matt? Justify your answer. 


12.12 The army reports that the distribution of head cir- 


cumference among male soldiers is approximately 


Normal with mean 22.8 inches and standard devia- 
tion 1.1 inches. 


(a) A male soldier whose head circumference is 23.9 
inches would be at what percentile? Show your 
method clearly. 


(b 


7 


The army’s helmet supplier regularly stocks hel- 
mets that fit male soldiers with head circumfer- 
ences between 20 and 26 inches. Anyone with a 
head circumference outside that interval requires 
a customized helmet order. What percent of male 
soldiers require custom helmets? 


(c 


WH 


Find the interquartile range for the distribution of 
head circumference among male soldiers. 


12.13 A study recorded the amount of oil recovered from 
the 64 wells in an oil field. Here are descriptive sta- 
tistics for that set of data from Minitab. 


Descriptive Statistics: Oilprod 
Variable on 


64 48.25 37.80 40.24 2.00 204.90 21.40 60.75 


Mean Median StDev Min Max on Q3 


Oilprod 


Does the amount of oil recovered from all wells in this 
field seem to follow a Normal distribution? Give ap- 
propriate statistical evidence to support your answer. 
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Describing 
Relationships 


How Faithful Ils Old Faithful? 


The Starnes family visited Yellowstone National Park in hopes of seeing the Old Faithful geyser erupt. 
They had only about four hours to spend in the park. When they pulled into the parking lot near Old 
Faithful, a large crowd of people was headed back to their cars from the geyser. Old Faithful had just 
finished erupting. How long would the Starnes family have to wait until the next eruption? 


Let’s look at some data. Figure 3.1 shows a his- 
togram of times (in minutes) between consecutive 
eruptions of Old Faithful in the month before the 
Starnes family’s visit. he shortest interval was 
47 minutes, and the longest was 113 minutes. 
That’s a lot of variability! The distribution has two 
clear peaks— one at about 60 minutes and the other 
at about 90 minutes. 

If the Starnes family hopes for a 60-minute 
gap between eruptions, but the actual interval is 
closer to 90 minutes, the kids will get impatient. 
If they plan for a 90-minute interval and go some- 
where else in the park, they won’t get back in time 
to see the next eruption if the gap is only about 
60 minutes. What should the Starnes family do? 


505 


Frequency 


" 


40 50 60 70 80 90 100 110 120 
Interval (minutes) 


FIGURE 3.1 Histogram of the interval (in minutes) between 
eruptions of the Old Faithful geyser in the month prior to the 
Starnes family’s visit. 
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Pe Introduction 


Investigating relationships between variables is central to what we do in statistics. 
When we understand the relationship between two variables, we can use the value 
of one variable to help us make predictions about the other variable. In Section 1.1, 
we explored relationships between categorical variables, such as the gender of a 
young person and his or her opinion about future income. The association between 
these two variables suggests that males are generally more optimistic about their 
future income than females. 

In this chapter, we investigate relationships between two quantitative variables. 
Does knowing the number of points a football team scores per game tell us any- 
thing about how many wins it will have? What can we learn about the price of a 
used car from the number of miles it has been driven? Are there any variables that 
might help the Starnes family predict how long it will be until the next eruption 
of Old Faithful? 


ACTIVITY | CSI Stats: 


The case of the missing cookies 


MATERIALS: Mrs. Hagen keeps a large jar full of cookies on her desk for her students. Over 
Meterstick, handprint, and the past few days, a few cookies have disappeared. The only people with access 
math department roster to Mrs. Hagen’s desk are the other math teachers at her school. She asks her col- 
(from Teacher’s Resource leagues whether they have been making withdrawals from the cookie jar. No one 
Materials) for each group of confesses to the crime. 

three to four students; one But the next day, Mrs. Hagen catches a break—she finds a clear handprint on 
sheet of graph paper per the cookie jar. The careless culprit has left behind crucial evidence! At this point, 
aie Mrs. Hagen calls in the CSI Stats team (your class) to help her identify the prime 


suspect in “The Case of the Missing Cookies.” 


1. Measure the height and hand span of each member of your group to the 
nearest centimeter (cm). (Hand span is the maximum distance from the tip of 
the thumb to the tip of the pinkie finger on a person’s fully stretched-out hand.) 


2. Your teacher will make a data table on the board with two columns, labeled 
as follows: 


Hand span (cm) Height (cm) 


Send a representative to record the data for each member of your group in the table. 


3. Copy the data table onto your graph paper very near the left margin of the 
page. Next, you will make a graph of these data. Begin by constructing a set of 
coordinate axes. Allow plenty of space on the page for your graph. Label the 
horizontal axis “Hand span (cm)” and the vertical axis “Height (cm).” 


4. Since neither hand span nor height can be close to 0 cm, we want to start 
our horizontal and vertical scales at larger numbers. Scale the horizontal 
axis in 0.5-cm increments starting with 15 cm. Scale the vertical axis in 5-cm 
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‘ increments starting with 135 cm. Refer to the sketch in the margin 
for comparison. 


5. Plot each point from your class data table as accurately as you can 


€ on the graph. Compare your graph with those of your group members. 
= 1504 
= 6. As a group, discuss what the graph tells you about the relationship 
z MS between hand span and height. Summarize your observations in a 
140 sentence or two. 
135 4 7. Ask your teacher for a copy of the handprint found at the scene 
ie gis os a and the math department roster. Which math teacher does your group 


Hiand/spun:(em) believe is the “prime suspect”? Justify your answer with appropriate 
statistical evidence. 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e — |dentify explanatory and response variables in situations e Interpret the correlation. 
where one variable helps to explain or influences the other. Understand the basic properties of correlation, 
Make a scatterplot to display the relationship between including how the correlation is influenced by outliers. 
two quantitative variables. Use technology to calculate correlation. 


Describe the direction, form, and strengthofa = Explain why association does not imply causation. 
relationship displayed in a scatterplot and identify outli- 


ers in a scatterplot. 


Most statistical studies examine data on more than one variable. Fortunately, 
analysis of several-variable data builds on the tools we used to examine individual 
variables. The principles that guide our work also remain the same: 


e Plot the data, then add numerical summaries. 
e Look for overall patterns and departures from those patterns. 


e¢ When there’s a regular overall pattern, use a simplified model to describe it. 


Explanatory and Response Variables 


We think that car weight helps explain accident deaths and that smoking influ- 
ences life expectancy. In these relationships, the two variables play different roles. 
Accident death rate and life expectancy are the response variables of interest. Car 
weight and number of cigarettes smoked are the explanatory variables. 


DEFINITION: Response variable, explanatory variable 


A response variable measures an outcome of a study. An explanatory variable 
may help explain or predict changes in a response variable. 
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You will often see explanatory variables It is easiest to identify explanatory and response variables when we actually 
called independent variables and specify values of one variable to see how it affects another variable. For instance, 
respolise variables'ralled Garendeny =, study the effect of alcohol on body temperature, researchers gave several dif- 
Hasek rend els ferent amounts of alcohol to mice. Then the d the cl | 
“independent” and “dependent” have y measured the change in each 
other meanings in statistics, we won't. | Mouse’s body temperature 15 minutes later. In this 
use them here. case, amount of alcohol is the explanatory variable, 
and change in body temperature is the response 
variable. When we don’t specify the values of either 
variable but just observe both variables, there may 
or may not be explanatory and response vari- 
ables. Whether there are depends on how 
you plan to use the data. aie a 


Linking SAT Math and Critical 


Reading Scores 


Explanatory or response? 


Julie asks, “Can I predict a state’s mean SAT Math score if I know its mean SAT 
Critical Reading score?” Jim wants to know how the mean SAT Math and Critical 
Reading scores this year in the 50 states are related to each other. 


PROBLEM: For each student, identify the explanatory variable and the response variable if possible. 


SOLUTION: Julie is treating the mean SAT Critical Reading score as the explanatory variable and 
the mean SAT Math score as the response variable. Jim is simply interested in exploring the relation- 
ship between the two variables. For him, there is no clear explanatory or response variable. 


For Practice Try Exercise 


In many studies, the goal is to show that changes in one or more explanatory 
variables actually cause changes in a response variable. However, other explanatory- 
response relationships don’t involve direct causation. In the alcohol and mice study, 
alcohol actually causes a change in body temperature. But there is no cause-and-effect 
relationship between SAT Math and Critical Reading scores. Because the scores are 
closely related, we can still use a state’s mean SAT Critical Reading score to predict its 
mean Math score. We will learn how to make such predictions in Section 3.2. 


CHECK YOUR UNDERSTANDING 


Identify the explanatory and response variables in each setting. 


1. How does drinking beer affect the level of alcohol in people’s blood? The legal limit 
for driving in all states is 0.08%. In a study, adult volunteers drank different numbers of 
cans of beer. Thirty minutes later, a police officer measured their blood alcohol levels. 


2. ‘The National Student Loan Survey provides data on the amount of debt for recent 
college graduates, their current income, and how stressed they feel about college debt. A 
sociologist looks at the data with the goal of using amount of debt and income to explain 
the stress caused by college debt. 
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Displaying Relationships: Scatterplots 


625 5 The most useful graph for displaying the relationship between 

600 | 23° two quantitative variables is a scatterplot. Figure 3.2 shows a 

: .* scatterplot of the percent of high school graduates in each state 
5 ee =, who took the SAT and the state’s mean SAT Math score in a 
= 550-4 ° eis recent year. We think that “percent taking” will help explain 
2 we. ae a ee ee ee “mean score.” So “percent taking” is the explanatory variable 
é em ‘| vs Ree es and “mean score” is the response variable. We want to see 
| . how mean score changes when percent taking changes, so 

ae . we put percent taking (the explanatory variable) on the hori- 

450 >. + —1— + —_-~Ss zontal axis. Each point represents a single state. In Colorado, 


0 100 20 30 40 0 80 for example, 21% took the SAT, and their mean SAT Math 
Percenbeaanginne score was 570. Find 21 on the x (horizontal) axis and 570 on 
FIGURE 3.2 Scatterplot of the the y (vertical) axis. Colorado appears as the point (21, 570). 
mean SAT Math score in each state 
against the percent of that state’s 
high school graduates who took the = DEFINITION: Scatterplot 
i i aotieg nes Inteeeer at A scatterplot shows the relationship between two quantitative variables measured 
the point (21, 570), the values for peal : : ; 
on the same individuals. The values of one variable appear on the horizontal axis, and 


Colorado. 
the values of the other variable appear on the vertical axis. Each individual in the data 
appears as a point in the graph. 
Here's a helpful way to remember: the Always plot the explanatory variable, if there is one, on the horizontal axis (the 
Sry analy Sea en ee an aNe x axis) of a scatterplot. As a reminder, we usually call the explanatory variable x 
axis. 


and the response variable y. If there is no explanatory-response distinction, either 
variable can go on the horizontal axis. 

We used computer software to produce Figure 3.2. For some problems, you'll 
be expected to make scatterplots by hand. Here’s how to do it. 


HOW TO MAKE A SCATTERPLOT 


1. Decide which variable should go on each axis. 
2. Label and scale your axes. 


3. Plot individual data values. 


The following example illustrates the process of constructing a scatterplot. 


SEC Football 


Making a scatterplot 


At the end of the 2011 college football season, the University of Alabama 
defeated Louisiana State University for the national championship. In- 
terestingly, both of these teams were from the Southeastern Conference 
(SEC). Here are the average number of points scored per game and num- 
ber of wins for each of the twelve teams in the SEC that season. 
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Team Alabama Arkansas Auburn Florida Georgia Kentucky 
Points per game 34.8 36.8 25.7 25.5 32.0 15.8 
Wins 12 11 8 7 10 5 
Team Louisiana State Mississippi Mississippi State South Carolina Tennessee Vanderbilt 
Points per game Osh 16.1 25.3 30.1 20.3 26.7 
Wins 13 2 7 11 5 6 


PROBLEM: Make a scatterplot of the relationship between points per game and wins. 
SOLUTION: We follow the steps described earlier to make the scatterplot. 

1. Decide which variable should go on which axis. The number of wins a football 
team has depends on the number of points they score. So we'll use points 
per game as the explanatory variable (x axis) and wins as the response 
variable (yaxis). 


2. _Labeland scale your axes. We labeled the xaxis “Points per game” and the 
yaxis “Wins.” Because the teams’ points per game vary from 15.6 to 36.8, 
we chose a horizontal scale starting at 15 points, with tick marks every 

5 points. The teams’ wins vary from 2 to 13, so we chose a vertical scale 
starting at O with tick marks every 2 wins. 


3. Plot individual data values. The first team in the table, Alabama, scored 
1s 20 a5 30 35 #0 34.8 points per game and had 12 wins. We plot this point directly above 
Points per game 34.8 on the horizontal axis and to the right of 12 on the vertical axis, as 
shown in Figure 3.3. For the second team in the list, Arkansas, we add the 
FIGURE 3.3 Completed scatterplot of points per game anoint, (36.8, 11) to the graph. By adding the points for the remaining ten 


and Leu) ans Ph TEs oa San teams, we get the completed scatterplot in Figure 3.3. 
intersect at the point (34.8, 12), the values for Alabama. 


Wins 


For Practice Try Exercise 


Describing Scatterplots 


To describe a scatterplot, follow the basic strategy of data analysis from Chapters 
1 and 2: look for patterns and important departures from those patterns. Let’s 
take a closer look at the scatterplot from Figure 3.2. What do we see? 


e The graph shows a clear direction: the overall pattern moves from upper left 
to lower right. That is, states in which higher percents of high school grad- 
uates take the SAT tend to have lower mean SAT Math 


er scores. We call this a negative association between the two 

600 | 230 variables. 
E 5754 . a e The form of the relationship is slightly curved. More im- 
2 soy lore me portant, most states fall into one of two distinct clusters. 
E ° In about half of the states, 25% or fewer graduates took 
= 25- ° a ae A the SAT. In the other half, more than 40% took the SAT. 
= 5007 ° "ete ee ° e The strength of a relationship in a scatterplot is deter- 
475 4 mined by how closely the points follow a clear form. 
ist a ; The overall relationship in Figure 3.2 is moderately 
0 10 20 30 40 so 60 70 80 90 strong: states with similar percents taking the SAT tend 


Percent taking SAT to have roughly similar mean SAT Math scores. 


THINK 
ABOUT IT 
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e ‘Two states stand out in the scatterplot: West Virginia at (19, 501) and Maine at 
(87, 466). These points can be described as outliers because they fall outside 
the overall pattern. 


What explains the clusters? There are two widely used college entrance 
exams, the SAT and the American College Testing (ACT) exam. Each state usually 
favors one or the other. The ACT states cluster at the left of Figure 3.2 and the SAT 
states at the right. In ACT states, most students who take the SAT are applying to a 
selective college that prefers SAT scores. This select group of students has a higher 
mean score than the much larger group of students who take the SAT in SAT states. 


_ —————————X—X———~_J 


HOW TO EXAMINE A SCATTERPLOT 


As in any graph of data, look for the overall pattern and for striking depar- 
tures from that pattern. 


e You can describe the overall pattern of a scatterplot by the direction, 
form, and strength of the relationship. 


e An important kind of departure is an outlier, an individual value that 
falls outside the overall pattern of the relationship. 


Let’s practice examining scatterplots using the SEC football data from the 
previous example. 


SEC Football 


Describing a scatterplot 


In the last example, we constructed the scatterplot shown below that displays the 
average number of points scored per game and the number of wins for college 
football teams in the Southeastern Conference. 


Points per game 


PROBLEM: Describe what the scatterplot reveals about the relation- 
ship between points per game and wins. 


SOLUTION: Direction: In general, it appears that teams that score 
more points per game have more wins and teams that score fewer points per 
game have fewer wins. We say that there is a positive association between 
points per game and wins. 


Form: There seems to be a linear pattern in the graph (that is, the overall 
pattern follows a straight line). 


Strength: Because the points do not vary much from the linear pattern, 
the relationship is fairly strong. There do not appear to be any values that 
depart from the linear pattern, So there are no outliers. 


For Practice Try Exercise 
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Even when there is a clear association 
between two variables in a scatterplot, 
the direction of the relationship only 
describes the overall trend—not the 
relationship for each pair of points. For 
example, even though teams that score 
more points per game generally have 
more wins, Georgia and South Carolina 
are exceptions to the overall pattern. 
Georgia scored more points per game 
than South Carolina (32 versus 30.1) 
but had fewer wins (10 versus 11). 


So far, we’ve seen relationships with two different directions. The number of 
wins generally increases as the points scored per game increases (positive asso- 
ciation). The mean SAT score generally goes down as the percent of graduates 
taking the test increases (negative association). Let’s give a careful definition for 
these terms. 


DT 


DEFINITION: Positive association, negative association 


Two variables have a positive association when above-average values of one tend 
to accompany above-average values of the other and when below-average values 
also tend to occur together. 


Two variables have a negative association when above-average values of one tend 
to accompany below-average values of the other. 


Of course, not all relationships have a clear direction that we can describe 
as a positive association or a negative association. Exercise 9 involves a relation- 
ship that doesn’t have a single direction. This next example, however, illustrates a 
strong positive association with a simple and important form. 


The Endangered Manatee 
Pulling it all together 


Manatees are large, gentle, slow-moving creatures found along the coast of Florida. 
Many manatees are injured or killed by boats. The table below contains data on 
the number of boats registered in Florida (in thousands) and the number of mana- 
tees killed by boats for the years 1977 to 2010.’ 


YEAR BOATS MANATEES | YEAR BOATS MANATEES | YEAR BOATS MANATEES 
1977 = 447 13 1989 711 50 2001 944 81 
1978 460 21 1990 719 47 2002 962 95 
1979 = 481 24 1991 681 53 2003 978 73 
1980 498 16 1992 679 38 2004 983 69 
1981 513 24 1993 678 35 2005 1010 79 
1982 512 20 1994 696 49 2006 1024 92 
1983 526 15 1995 42 2007 = 1027 73 
1984 559 34 1996 60 2008 1010 90 
1985 585 33 1997 54 2009 982 97 
1986 614 33 1998 66 2010 942 83 
1987 645 39 1999 82 
1988 675 43 2000 78 


732 
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809 
830 
880 


Manatees killed 


400 500 600 700 = 800 900 1000 1100 
Boats registered in Florida (10008) 


FIGURE 3.4 Scatterplot of the number of Florida manatees 
killed by boats from 1977 to 2010 against the number of 
boats registered in Florida that year. 
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PROBLEM: Make a scatterplot to show the relationship between 
the number of manatees killed and the number of registered boats. 
Describe what you see. 


SOLUTION: For the scatterplot, we'll use “boats registered” 

as the explanatory variable and “manatees killed” as the response 
variable. Figure 3.4 is our completed scatterplot. There is a positive 
association—more boats registered goes with more manatees 
killed. The form of the relationship is linear. That is, the overall 
pattern follows a straight line from lower left to upper right. The 
relationship is strong because the points don't deviate greatly from 
a line, except for the 4 years that have a high number of boats regis- 
tered, but fewer deaths than expected based on the linear pattern. 


For Practice Try Exercise 


YEAH? 
WELL THIS 
WAS FROM 
A SUZUKI 

150, V-6. 


wWiGy= 


CHECK YOUR UNDERSTANDING 


In the chapter-opening Case Study (page 141), the Starnes family arrived at Old Faithful 
after it had erupted. ‘They wondered how long it would be until the next eruption. Here is 
a scatterplot that plots the interval between consecutive eruptions 
of Old Faithful against the duration of the previous eruption, for 
the month prior to their visit. 
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Duration (minutes) 


1. 


Describe the direction of the relationship. Explain why this 


makes sense. 


2. 


What form does the relationship take? Why are there two 


clusters of points? 


3. 
4. 
5. 


How strong is the relationship? Justify your answer. 
Are there any outliers? 


What information does the Starnes family need to predict 


when the next eruption will occur? 
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7.) TECHNOLOGY 


SCATTERPLOTS ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


CORNER 


Making scatterplots with technology is much easier than constructing them by hand. We'll use the SEC football data 
from page 146 to show how to construct a scatterplot on a T1-83/84 or T1-89. 


e Enter the data values into your lists. Put the points per game in L1/list] and the number of wins in L2/list2. 
F2|on the T1-89). Specify the settings shown below. 


e Define a scatterplot in the statistics plot menu (press 
NORMAL FLOAT AUTO REAL RADIAN MP o 


Mark: Fl +: - 
Color: EUS 


e Use ZoomStat (ZoomData on the T1-89) to obtain a graph. The calculator will set the window dimensions automatically 
by looking at the values in L1Alistl and L2/list2. 


NORMAL FLOAT AUTO REAL RADIAN MP n 


revis|zbam|rracelnesyornlerath|oravo[ anf 
j Oo 
| es e. Oo a Oo 
o go 
| 5 5 *s o 4 ae 
a - 
MAIN RAD AUTO FUME 


Notice that there are no scales on the axes and that the axes are not labeled. If you copy a scatterplot from your calculator onto 
your paper, make sure that you scale and label the axes. 


AP® EXAM TIP If you are asked to make a scatterplot on a free-response question, be sure to 
label and scale both axes. Don’tjust copy an unlabeled calculator graph directly onto your paper. 


Some people refer to ras the 
“correlation coefficient.” 


Measuring Linear Association: Correlation 


A scatterplot displays the direction, form, and strength of the relationship between 
two quantitative variables. Linear relationships are particularly important because a 
straight line is a simple pattern that is quite common. A linear relationship is strong 
if the points lie close to a straight line and weak if they are widely scattered about a 
line. Unfortunately, our eyes are not good judges of how strong a linear relationship is. 
The two scatterplots in Figure 3.5 (on the facing page) show the same data, 
but the graph on the right is drawn smaller in a large field. The right-hand rr) 
graph seems to show a stronger linear relationship. 

Because it’s easy to be fooled by different scales or by the amount of space 
around the cloud of points in a scatterplot, we need to use a numerical measure 


to supplement the graph. Correlation is the measure we use. 


DEFINITION: Correlation r 
The correlation r measures the direction and strength of the linear relationship 
between two quantitative variables. 
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Points per game 


How good are you at estimating the 


correlation by eye from a scatterplot? 


To find out, try an online applet. Just 
search for “guess the correlation 
applets.” 


FIGURE 3.6 How correla- 
tion measures the strength 
of a linear relationship. 
Patterns closer to a 
straight line have correla- 


Points per game 


FIGURE 3.5 Two Minitab scatterplots of the same data. The straight-line pattern in the graph on 


the right appears stronger because of the surrounding space. 


The correlation r is always a number between —1 and 1. Correlation indicates the 


direction of a linear relationship by its sign: r > 0 for a positive association and r < 0 


for a negative association. Values of r near 0 indicate a very weak linear relationship. 
The strength of the linear relationship increases as r moves away from 0 toward either 
—1 or 1. The extreme values r = —1 and r = 1 occur only in the case of a perfect 
linear relationship, when the points lie exactly along a straight line. 

Figure 3.6 shows scatterplots that correspond to various values of r. To make 
the meaning of r clearer, the standard deviations of both variables in these plots 
are equal, and the horizontal and vertical scales are the same. The correlation 
describes the direction and strength of the linear relationship in each graph. 


tions closer to 1 or —1. 
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The following Activity lets you explore some important properties of the correlation. 
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ACTIVITY 


MATERIALS: 


Computer with Internet 


connection 


PPLE, 


& 


DESCRIBING RELATIONSHIPS 


Correlation and Regression applet 


Go to the book’s Web site, www.whfreeman.com/tps5e, and launch the Correla- 
tion and Regression applet. 


1. You are going to use the Correlation and Regression applet to make several 
scatterplots with 10 points that have correlation close to 0.7. 
(a) Start by putting two points on the graph. What's the value of the correlation? 
Why does this make sense? 
(b) Make a lower-left to upper-right pattern of 10 points with correla- 
tion about r = 0.7. (You can drag points up or down to adjust r after 
you have 10 points.) 
(c) Make another scatterplot: this one should have 9 points in a verti- 
cal stack at the left of the plot. Add 1 point far to the right and move it 
until the correlation is close to 0.7. 
(d) Make a third scatterplot: make this one with 10 points in a curved 
pattern that starts at the lower left, rises to the right, then falls again at 
the far right. Adjust the points up or down until you have a very smooth 
curve with correlation close to 0.7. 


Summarize: If you know that the correlation between two variables is r = 0.7, 
what can you say about the form of the relationship? 


2. Click on the scatterplot to create a group of 10 points in the lower-left corner 
of the scatterplot with a strong straight-line pattern (correlation about 0.9). 

(a) Add 1 point at the upper right that is in line with the first 10. How does the 
correlation change? 

(b) Drag this last point straight down. How small can you make the correlation? 
Can you make the correlation negative? 


Summarize: What did you learn from Step 2 about the effect of a single point on 
the correlation? 


Now that you know what information the correlation provides—and doesn’t 
provide —let’s look at an example that shows how to interpret it. 


SEC Football 


Interpreting correlation 


PROBLEM: Our earlier scatterplot of the average points per game and number of wins for college 
football teams in the SEC is repeated at top right. For these data, r = 0.936. 


(a) Interpret the value of r in context. 


(b) The point highlighted in red on the scatterplot is Mississippi. What effect does Mississippi have 
onthe correlation? Justify your answer. 
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SOLUTION: 
(a) The correlation of 0.936 confirms what we see in the scatterplot: there 
is a strong, positive linear relationship between points per game and wins in 


4.153 


(b) Mississippi makes the correlation closer to 1 (stronger). If Mississippi 
were not included, the remaining points wouldn't be as tightly clustered in a 


For Practice Try Exercise 


AP® EXAM TIP. If you’re asked to interpret a correlation, start by looking at a scatterplot of 


the data. Then be sure to address direction, form, strength, and outliers (sound familiar?) and 
put your answer in context. 


CHECK YOUR UNDERSTANDING 


The scatterplots below show four sets of real data: (a) repeats the manatee plot in Figure 
3.4 (page 149); (b) shows the number of named tropical storms and the number predicted 
before the start of hurricane season each year between 1984 and 2007 by William Gray of 
Colorado State University; (c) plots the healing rate in micrometers (millionths of a me- 
ter) per hour for the two front limbs of several newts in an experiment; and (d) shows stock 
market performance in consecutive years over a 56-year period. For each graph, estimate 
the correlation r. Then interpret the value of r in context. 
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Calculating Correlation Now that you have some idea of how to interpret 
the correlation, let’s look at how it’s calculated. 


HOW TO CALCULATE THE CORRELATION r 


The formula for the correlation r is a bit complex. It helps us see what cor- 
relation is, but in practice, you should use your calculator or software to find r. 
Exercises 19 and 20 ask you to calculate a correlation step-by-step from the defini- 
tion to solidify its meaning. 

The formula for r begins by standardizing the observations. Let’s use the famil- 
iar SEC football data to perform the required calculations. The table below shows 
the values of points per game x and number of wins y for the SEC college football 
teams. For these data, x = 27.07 and s, = 7.16. 


Team Alabama Arkansas Auburn Florida Georgia Kentucky 
Points per game 34.8 36.8 25.7 25.5 32.0 15.8 
Wins 12 11 8 7 10 5 
Team Louisiana State Mississippi Mississippi State South Carolina Tennessee Vanderbilt 
Points per game 35.7 16.1 25.3 30.1 20.3 26.7 
Wins 13 2 7 11 5 6 
The value 
Xj xX 
Sx 


in the correlation formula is the standardized points per game (z-score) of the ith 
team. For the first team in the table (Alabama), the corresponding z-score is 
Pe 2707 
re 7.16 


That is, Alabama’s points per game total (34.8) is a little more than 1 
standard deviation above the mean points per game for the SEC teams. 


= 1.08 


Some people like to write the 
correlation formula as 


1 
=F 1 > 2 


to emphasize the product of 


standardized scores in the calculation. 


THINK 
ABOUT IT 
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Standardized values have no units —in this example, they are no longer mea- 
sured in points. 
To standardize the number of wins, we use y = 8.08 and s, = 3.34. For 


IZ:— 8.068 
3.34 


dard deviations above the mean number of wins for SEC teams. When we mul- 
tiply this team’s two z-scores, we get a product of 1.2636. The correlation r is an 
“average” of the products of the standardized scores for all the teams. Just 
as in the case of the standard deviation s,, the average here divides by one fewer 
than the number of individuals. Finishing the calculation reveals that r = 0.936 
for the SEC teams. 


Alabama, z, = = 1.17. Alabama’s number of wins (12) is 1.17 stan- 


What does correlation measure? The Fathom screen shots below pro- 
vide more detail. At the left is a scatterplot of the SEC football data with two lines 
added —a vertical line at the group’s mean points per game and a horizontal line 
at the mean number of wins of the group. Most of the points fall in the upper-right 
or lower-left “quadrants” of the graph. That is, teams with above-average points 
per game tend to have above-average numbers of wins, and teams with below- 
average points per game tend to have numbers of wins that are below average. 
This confirms the positive association between the variables. 

Below on the right is a scatterplot of the standardized scores. To get this graph, 
we transformed both the x- and the y-values by subtracting their mean and divid- 
ing by their standard deviation. As we saw in Chapter 2, standardizing a data set 
converts the mean to 0 and the standard deviation to 1. That’s why the vertical and 
horizontal lines in the right-hand graph are both at 0. 
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Notice that all the products of the standardized values will be positive—not 
surprising, considering the strong positive association between the variables. What 
if there was a negative association between two variables? Most of the points would 
be in the upper-left and lower-right “quadrants” and their z-score products would 
be negative, resulting in a negative correlation. 
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Facts about Correlation 


How correlation behaves is more important than the details of the formula. Here’s 
what you need to know in order to interpret correlation correctly. 
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"He says we've ruined his positive correlation between height and weight." 


Mean Math score 


CHAPTER 3 


DESCRIBING RELATIONSHIPS 


1. Correlation makes no distinction between explanatory 
and response variables. It makes no difference which vari- 
able you call x and which you call y in calculating the cor- 
relation. Can you see why from the formula? 


l 5 x—-xX\fVi7¥ 
n-1 Sy Sy 
2. Because r uses the standardized values of the observa- 
tions, r does not change when we change the units of mea- 
surement of x, y, or both. Measuring height in centimeters 


rather than inches and weight in kilograms rather than 
pounds does not change the correlation between height 


and weight. 


¢ Correlation does not imply causation. Even when a scatterplot shows a strong 
linear relationship between two variables, we can’t conclude that changes in 
one variable cause changes in the other. For example, looking at data from 
the last 10 years, there is a strong positive relationship between the number of 
high school students who own a cell phone and the number of students who 
pass the AP® Statistics exam. Does this mean that buying a cell phone will 
help you pass the AP® exam? Not likely. Instead, the correlation is positive 
because both of these variables are increasing over time. 


r= 


3. The correlation r itself has no unit of measurement. It is just a number. 


Describing the relationship between two variables is more complex 
than describing the distribution of one variable. Here are some cautions 
to keep in mind when you use correlation. 


¢ Correlation requires that both variables be quantitative, so that it makes sense 
to do the arithmetic indicated by the formula for r. We cannot calculate a cor- 
relation between the incomes of a group of people and what city they live in 
because city is a categorical variable. 


¢ Correlation only measures the strength of a linear relationship between two 
variables. Correlation does not describe curved relationships between variables, 
no matter how strong the relationship is. A correlation of 0 doesn’t guarantee 
that there’s no relationship between two variables, just that there’s no linear 
relationship. 


e Avalue of r close to 1 or —1 does not guarantee a linear relationship between 


two variables. A scatterplot with a clear curved form can have a correlation 
that is close to 1 or —1. For example, the correlation be- 


om jl tween percent taking the SAT and mean Math score is close 
600 4 Soe to —1, but the association is clearly curved. Always plot your 
575 - oe — data! 
sso 1 °°* : e Like the mean and standard deviation, the correlation is 
pu . °, : not resistant: r is strongly affected by a few outlying obser- 
. : is. aes vations. Use r with caution when outliers appear in the 
one) : i 4 ' scatterplot. 
475 4 ¢ Correlation is not a complete summary of two-variable 
450 SS See data, even when the relationship between the variables 
0 10 20 30 40 S50 60 70 80 90 is linear. You should give the means and standard devia- 
Percent taking SAT tions of both x and y along with the correlation. 
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Of course, even giving means, standard deviations, and the correlation for “state 
SAT Math scores” and “percent taking” will not point out the clusters in Figure 3.2. 
Numerical summaries complement plots of data, but they do not replace them. 


Scoring Figure Skaters 
Why correlation doesn’t tell the whole story 


Until a scandal at the 2002 Olympics brought change, figure skating was scored 
by judges on a scale from 0.0 to 6.0. The scores were often controversial. We have 
the scores awarded by two judges, Pierre and Elena, for many skaters. How well do 
they agree? We calculate that the correlation between their scores is r = 0.9. But 


the mean of Pierre’s scores is 0.8 point lower than Elena’s mean. 


These facts don’t contradict each other. They simply give different kinds of infor- 
mation. The mean scores show that Pierre awards lower scores than Elena. But 
because Pierre gives every skater a score about 0.8 point lower than Elena does, 
the correlation remains high. Adding the same number to all values of either x 
or y does not change the correlation. If both judges score the same skaters, the 
competition is scored consistently because Pierre and Elena agree on which per- 
formances are better than others. The high r shows their agreement. But if Pierre 
scores some skaters and Elena others, we should add 0.8 point to Pierre’s scores to 
arrive at a fair comparison. 


DATA EXPLORATION The SAT essay: Is longer better? 


Following the debut of the new SAT Writing test in March 2005, Dr. Les Perelman 
from the Massachusetts Institute of ‘Technology stirred controversy by reporting, 
“Tt appeared to me that regardless of what a student wrote, the longer the essay, the 
higher the score.” He went on to say, “I have never found a quantifiable predictor 
in 25 years of grading that was anywhere as strong as this one. If you just graded 
them based on length without ever reading them, you’d be right over 90 percent 
of the time.”* The table below shows the data that Dr. Perelman used to draw his 
conclusions.’ 


Words: 460 422 402 365 357 278 236 201 168 156 8133 


Score: 6 6 5 5 6 5 4 4 4 3 2 
Words: 114 108 100 403 401 388 320 258 236 189 128 
Score: 2 1 1 5 6 6 5 4 4 3 2 
Words: 67 697 387 355 337 325 272 150 135 
Score: 1 6 6 5 5 4 4 2 3 


Does this mean that if students write a lot, they are guaranteed high scores? 
Carry out your own analysis of the data. How would you respond to each of 
Dr. Perelman’s claims? 
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Summary 


e A scatterplot displays the relationship between two quantitative variables 
measured on the same individuals. Mark values of one variable on the 
horizontal axis (x axis) and values of the other variable on the vertical axis 
(y axis). Plot each individual’s data as a point on the graph. 


e — If we think that a variable x may help explain, predict, or even cause changes 
in another variable y, we call x an explanatory variable and y a response vari- 
able. Always plot the explanatory variable, if there is one, on the x axis of a 
scatterplot. Plot the response variable on the y axis. 


e In examining a scatterplot, look for an overall pattern showing the direction, 
form, and strength of the relationship and then look for outliers or other 
departures from this pattern. 


e Direction: If the relationship has a clear direction, we speak of either posi- 
tive association (above-average values of the two variables tend to occur to- 
gether) or negative association (above-average values of one variable tend to 
occur with below-average values of the other variable). 


e Form: Linear relationships, where the points show a straight-line pattern, are 
an important form of relationship between two variables. Curved relation- 
ships and clusters are other forms to watch for. 


e Strength: The strength of a relationship is determined by how close the 
points in the scatterplot lie to a simple form such as a line. 


e The correlation r measures the strength and direction of the linear associa- 
tion between two quantitative variables x and y. Although you can calculate 
a correlation for any scatterplot, r measures strength for only straight-line 
relationships. 


e Correlation indicates the direction of a linear relationship by its sign: r > 0 for 
a positive association and r < 0 for a negative association. Correlation always 
satisfies —1 =r = | and indicates the strength of a linear relationship by how 
close it is to — 1 or 1. Perfect correlation, r= +1, occurs only when the points 
on a scatterplot lie exactly on a straight line. 


e Remember these important facts about r: Correlation does not imply causa- 
tion. Correlation ignores the distinction between explanatory and response 
variables. The value of r is not affected by changes in the unit of measure- 
ment of either variable. Correlation is not resistant, so outliers can greatly 
change the value of r. 


TECHNOLOGY 
CORNER 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


7. Scatterplots on the calculator 
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Exercises 


1. Coral reefs How sensitive to changes in water 
pgfze) temperature are coral reefs? To find out, measure 
the growth of corals in aquariums where the water 
temperature is controlled at different levels. Growth is 
measured by weighing the coral before and after the 
experiment. What are the explanatory and response 
variables? Are they categorical or quantitative? 


2. Treating breast cancer Early on, the most common 
treatment for breast cancer was removal of the breast. 
It is now usual to remove only the tumor and nearby 
lymph nodes, followed by radiation. The change in 
policy was due to a large medical experiment that com- 
pared the two treatments. Some breast cancer patients, 
chosen at random, were given one or the other treatment. 
‘The patients were closely followed to see how long they 
lived following surgery. What are the explanatory and 
response variables? Are they categorical or quantitative? 


3. IQand grades Do students with higher IQ test scores 
tend to do better in school? The figure below shows 
a scatterplot of IQ and school grade point average 
(GPA) for all 78 seventh-grade students in a rural 
midwestern school. (GPA was recorded on a 12-point 
scale with A+ = 12,A=11,A—=10,B+=9...., 
D==Landir = 


4. How much gas? Joan is concerned about the 
amount of energy she uses to heat her home. The 
graph below plots the mean number of cubic feet of 
gas per day that Joan used each month against the 
average temperature that month (in degrees Fahren- 
heit) for one heating season. 
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(a) Does the plot show a positive or negative association 
between the variables? Why does this make sense? 


(b) What is the form of the relationship? Is it very strong? 
Explain your answers. 


(c) Explain what the point at the bottom right of the plot 
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al represents. 

10 5 5. Heavy backpacks Ninth-grade students at the Webb 
» 9 c ee pgs) Schools go on a backpacking trip each fall. Students are 
oS °e Baad a one F 6 
£ 8- aie. “us 8 & divided into hiking groups of size 8 by selecting names 
eS - ee ar from a hat. Before leaving, students and their backpacks 
£67 . «Shae are weighed. The data here are from one hiking group 
» 54 cea ee in a recent year. Make a scatterplot by hand that shows 
E 45 = ee how backpack weight relates to body weight. 
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IQ score 


(a) Does the plot show a positive or negative association 
between the variables? Why does this make sense? 


(b) What is the form of the relationship? Is it very strong? 
Explain your answers. 


(c) At the bottom of the plot are several points that we 
might call outliers. One student in particular has a 
very low GPA despite an average IO score. What are 
the approximate IQ and GPA for this student? 


6. Bird colonies One of nature’s patterns connects the 
percent of adult birds in a colony that return from 
the previous year and the number of new adults that 
join the colony. Here are data for 13 colonies of 
sparrowhawks:° 


Percent return: 74 66 81 52 73 62 52 45 62 46 60 46 38 
New adults: 5 © 8 Wl i Ib We 7 i Ws 1 20 20 


Make a scatterplot by hand that shows how the number 
of new adults relates to the percent of returning birds. 
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Heavy backpacks Refer to your graph from Exercise 5. 


Describe the relationship between body weight and 
backpack weight for this group of hikers. 


One of the hikers is a possible outlier. Identify the 
body weight and backpack weight for this hiker. How 
does this hiker affect the form of the association? 


Bird colonies Refer to your graph from Exercise 6. 


Describe the relationship between number of new spar- 
rowhawks in a colony and percent of returning adults. 


For short-lived birds, the association between these 
variables is positive: changes in weather and food sup- 
ply drive the populations of new and returning birds 
up or down together. For long-lived territorial birds, 
on the other hand, the association is negative because 
returning birds claim their territories in the colony 
and don’t leave room for new recruits. Which type of 
species is the sparrowhawk? Explain. 


Does fast driving waste fuel? How does the fuel con- 
sumption of a car change as its speed increases? Here 
are data for a British Ford Escort. Speed is measured in 
kilometers per hour, and fuel consumption is measured 
in liters of gasoline used per 100 kilometers traveled.’ 


Speed Fuel used Speed Fuel used 
(km/h) —(liters/100 km) == (km/h) _—_—(liters/100 km) 

10 21.00 90 7.57 

20 13.00 100 8.27 

30 10.00 110 9.03 

40 8.00 120 9.87 

50 7.00 130 10.79 

60 5.90 140 We 

70 6.30 150 12.83 

80 6.95 


Mass: 
Rate: 


DESCRIBING RELATIONSHIPS 


36.1 54.6 48.5 42.0 50.6 42.0 40.3 33.1 42.4 34.5 51.1 41.2 
995 1425 1396 1418 1502 1256 1189 913 1124 1052 1347 1204 


Use your calculator to help sketch a scatterplot. 


Describe the form of the relationship. Why is it not linear? 
Explain why the form of the relationship makes sense. 


It does not make sense to describe the variables as either 
positively associated or negatively associated. Why? 


Is the relationship reasonably strong or quite weak? 
Explain your answer. 


. Do heavier people burn more energy? Metabolic 


rate, the rate at which the body consumes energy, is 
important in studies of weight gain, dieting, and exer- 
cise. We have data on the lean body mass and resting 
metabolic rate for 12 women who are subjects in a 
study of dieting. Lean body mass, given in kilograms, 
is a person’s weight leaving out all fat. Metabolic rate 
is measured in calories burned per 24 hours. ‘The 
researchers believe that lean body mass is an impor- 
tant influence on metabolic rate. 


(a) Use your calculator to help sketch a scatterplot to 
examine the researchers’ belief. 

(b) Describe the direction, form, and strength of the 
relationship. 


11. Southern education For a long time, the South has 
lagged behind the rest of the United States in the per- 
formance of its schools. Efforts to improve education 
have reduced the gap. We wonder if the South stands 
out in our study of state average SAT’ Math scores. ‘The 
figure below enhances the scatterplot in Figure 3.2 
(page 145) by plotting 12 southern states in red. 
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(a) What does the graph suggest about the southern states? 


(b) The point for West Virginia is labeled in the graph. 
Explain how this state is an outlier. 


12. Do heavier people burn more energy? ‘The study 
of dieting described in Exercise 10 collected data 
on the lean body mass (in kilograms) and metabolic 
rate (in calories) for 12 female and 7 male subjects. 
The figure below is a scatterplot of the data for all 19 
subjects, with separate symbols for males and females. 
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Does the same overall pattern hold for both women 
and men? What difference between the sexes do you 
see from the graph? 


13. Merlins breeding The percent of an animal species 


148] in the wild that survives to breed again is often lower 
& following a successful breeding season. A study of 


Ate 


5 


merlins (small falcons) in northern Sweden observed 
the number of breeding pairs in an isolated area and 
the percent of males (banded for identification) that 

returned the next breeding season. Here are data for 

seven years:® 


a 2) 2) 2 g) g sy 
82 83 70 61 69 58 43 


Breeding pairs: 
Percent return: 


Make a scatterplot to display the relationship be- 
tween breeding pairs and percent return. Describe 
what you see. 


Does social rejection hurt? We often describe our 
emotional reaction to social rejection as “pain.” Does 
social rejection cause activity in areas of the brain 
that are known to be activated by physical pain? If 

it does, we really do experience social and physical 
pain in similar ways. Psychologists first included and 
then deliberately excluded individuals from a social 
activity while they measured changes in brain activity. 
After each activity, the subjects filled out question- 
naires that assessed how excluded they felt. The table 
below shows data for 13 subjects.’ “Social distress” is 
measured by each subject’s questionnaire score after 
exclusion relative to the score after inclusion. (So val- 
ues greater than | show the degree of distress caused 
by exclusion.) “Brain activity” is the change in activity 
in a region of the brain that is activated by physical 
pain. (So positive values show more pain.) 


Subject Social distress Brain activity 
1 1.26 —0.055 
2 1.85 —0.040 
3 1.10 —0.026 
4 2.50 —0.017 
5 Dalit —0.017 
6 2.67 0.017 
7 2.01 0.021 
8 2.18 0.025 
9 2.58 0.027 

10 215 0.033 
| 25 0.064 
12 S688) 0.077 
13 3.65 0.124 


Make a scatterplot to display the relationship between 
social distress and brain activity. Describe what you see. 


Matching correlations Match each of the following 
scatterplots to the r below that best describes it. (Some 


r’s will be left over.) 


r=—09 r=—0.7 


r= 


r=03 r=07 +r 


0.3 
0.9 


r=0 
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Rank the correlations Consider each of the following 
relationships: the heights of fathers and the heights 

of their adult sons, the heights of husbands and the 
heights of their wives, and the heights of women at age 
4 and their heights at age 18. Rank the correlations be- 
tween these pairs of variables from largest to smallest. 
Explain your reasoning. 


Correlation blunders Each of the following statements 
contains an error. Explain what’s wrong in each case. 


“There is a high correlation between the gender of 
American workers and their income.” 


“We found a high correlation (r = 1.09) between 
students’ ratings of faculty teaching and ratings made 
by other faculty members.” 


“The correlation between planting rate and yield of 
corn was found to be r = 0.23 bushel.” 


Teaching and research A college newspaper in- 
terviews a psychologist about student ratings of the 
teaching of faculty members. The psychologist says, 
“The evidence indicates that the correlation between 
the research productivity and teaching rating of 
faculty members is close to zero.” The paper reports 
this as “Professor McDaniel said that good researchers 
tend to be poor teachers, and vice versa.” Explain why 
the paper’s report is wrong. Write a statement in plain 
language (don’t use the word “correlation”) to explain 
the psychologist’s meaning. 
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Dem bones Archaeopteryx is an extinct beast having 
feathers like a bird but teeth and a long bony tail 

like a reptile. Only six fossil specimens are known. 
Because these specimens differ greatly in size, some 
scientists think they are different species rather than 
individuals from the same species. We will examine 
some data. If the specimens belong to the same 
species and differ in size because some are younger 
than others, there should be a positive linear relation- 
ship between the lengths of a pair of bones from all 
individuals. An outlier from this relationship would 
suggest a different species. Here are data on the 
lengths in centimeters of the femur (a leg bone) and 
the humerus (a bone in the upper arm) for the five 
specimens that preserve both bones:!° 


Femur (x): 38 56 59 64 74 
Humerus (J): 4] 63 70 72 84 


Make a scatterplot. Do you think that all five speci- 
mens come from the same species? Explain. 


Find the correlation r step by step, using the formula 
on page 154. Explain how your value for r matches 
your graph in part (a). 

Data on dating A student wonders if tall women tend 
to date taller men than do short women. She measures 
herself, her dormitory roommate, and the women in the 
adjoining rooms. ‘Then she measures the next man each 
woman dates. Here are the data (heights in inches): 


Women(x): 66 64 66 65 70 65 
Men (y): 72 68 70 68 71 ~~ 65 


Make a scatterplot of these data. Based on the scat- 
terplot, do you expect the correlation to be positive or 
negative? Near +1 or not? 


Find the correlation r step by step, using the formula on 
page 154. Do the data show that taller women tend to 
date taller men? 


. Hot dogs Are hot dogs that are high in calories also 


high in salt? The figure below is a scatterplot of the 
calories and salt content (measured as milligrams of 
sodium) in 17 brands of meat hot dogs.'! 
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The correlation for these data is r = 0.87. Explain 
what this value means. 


(b) 


Die 
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What effect does the hot dog brand with the lowest calo- 
rie content have on the correlation? Justify your answer. 
All brawn? The figure below plots the average brain 
weight in grams versus average body weight in kilo- 

Vi 
grams for 96 species of mammals.’* There are many 
small mammals whose points overlap at the lower left. 
The correlation between body weight and brain weight 
is r = 0.86. Explain what this value means. 


What effect does the elephant have on the correla- 
tion? Justify your answer. 
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. Dem bones Refer to Exercise 19. 


How would r change if the bones had been measured 
in millimeters instead of centimeters? (‘here are 
10 millimeters in a centimeter.) 


If the x and y variables are reversed, how would the 
correlation change? Explain. 


. Data on dating Refer to Exercise 20. 


How would r change if all the men were 6 inches 
shorter than the heights given in the table? Does the 
correlation tell us if women tend to date men taller 
than themselves? 


If heights were measured in centimeters rather than 
inches, how would the correlation change? (‘There 
are 2.54 centimeters in an inch.) 


Strong association but no correlation The gas 
mileage of an automobile first increases and then 
decreases as the speed increases. Suppose that this 
relationship is very regular, as shown by the following 
data on speed (miles per hour) and mileage (miles 
per gallon). 


Speed: 20 30 40 50 60 
Mileage: 24 28 30 28 24 


Make a scatterplot to show the relationship between 
speed and mileage. 

Calculate the correlation for these data by hand or 
using technology. 

Explain why the correlation has the value found in part 
(b) even though there is a strong relationship between 
speed and mileage. 


26. What affects correlation? Here are some hypotheti- 


cal data: 
Xe 1 2 8} 4 10 10 
y: 1 3 6} by 1 11 


(a) Make a scatterplot to show the relationship between x 
and y. 

(b) Calculate the correlation for these data by hand or 
using technology. 

(c) What is responsible for reducing the correlation to the 
value in part (b) despite a strong straight-line relation- 
ship between x and y in most of the observations? 

Multiple choice: Select the best answer for Exercises 27 to 32. 

27. You have data for many years on the average price of 
a barrel of oil and the average retail price of a gallon 
of unleaded regular gasoline. If you want to see how 
well the price of oil predicts the price of gas, then you 
should make a scatterplot with as the explana- 
tory variable. 

(c) the year 


(a) the price of oil (e) time 


(b) the price of gas (d) either oil price or gas price 

28. Ina scatterplot of the average price of a barrel of oil 
and the average retail price of a gallon of gas, you 
expect to see 

(a) very little association. 

(b) a weak negative association. 

(c) astrong negative association. 

(d) a weak positive association. 

(e) astrong positive association. 

29. The following graph plots the gas mileage (miles per 
gallon) of various cars from the same model year ver- 
sus the weight of these cars in thousands of pounds. 
The points marked with red dots correspond to cars 
made in Japan. From this plot, we may conclude that 

(a) there is a positive association between weight and gas 
mileage for Japanese cars. 

(b)_ the correlation between weight and gas mileage for all 
the cars is close to 1. 

(c) there is little difference between Japanese cars and cars 
made in other countries. 

(d) Japanese cars tend to be lighter in weight than other cars. 

(e) Japanese cars tend to get worse gas mileage than other cars. 
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30. If women always married men who were 2 years older 
than themselves, what would the correlation between 
the ages of husband and wife be? 


(a) 2 
(b) 1 
(c) 0.5 
d 


(d) 0 
(e) Can’t tell without seeing the data 


31. The figure below is a scatterplot of reading test scores 
against IQ test scores for 14 fifth-grade children. 
There is one low outlier in the plot. What effect does 
this low outlier have on the correlation? 


a) It makes the correlation closer to 1. 


b) It makes the correlation closer to 0 but still positive. 


( 
( 
(c) It makes the correlation equal to 0. 
(d) It makes the correlation negative. 

( 


e) Ithas no effect on the correlation. 
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32. If we leave out the low outlier, the correlation for 
the remaining 13 points in the preceding figure is 


closest to 
(a) —0.95. (c) 0. (e) 0.95. 
—0.5. (d) 0.5. 


Big diamonds (1.2, 1.3) Here are the weights (in 
milligrams) of 58 diamonds from a nodule carried 
up to the earth’s surface in surrounding rock. These 
data represent a population of diamonds formed in a 
single event deep in the earth.’ 


A Adics 
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Make a graph that shows the distribution of weights 
of these diamonds. Describe what you see. Give ap- 
propriate numerical measures of center and spread. 
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34. College debt (2.2) A report published by the Federal (b) Assuming that the distribution of student loan balances 
_ Reserve Bank of New York in 2012 reported the is approximately Normal, use your answer to part (a) to 
results of a nationwide study of college student debt. estimate the proportion of borrowers who owe more than 
Researchers found that the average student loan $54,000. 
balance per borrower is $23,300. ‘They also reported (c) In fact, the report states that about 10% of borrowers 
that about one-quarter of borrowers owe more than owe more than $54,000. What does this fact indicate 
$28,000.!* about the shape of the distribution of student loan 
(a) Assuming that the distribution of student loan balances? 
balances is approximately Normal, estimate the (d) The report also states that the median student loan 
standard deviation of the distribution of student loan balance is $12,800. Does this fact support your conclu- 
balances. sion in part (c)? Explain. 


Least-Squares Regression 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


Interpret the slope and y intercept of a least-squares e Interpret the standard deviation of the residuals 
regression line. and r? and use these values to assess how well the 
Use the least-squares regression line to predict y for a least-squares regression line models the relationship 
given x. Explain the dangers of extrapolation. between two variables. 

Calculate and interpret residuals. Describe how the slope, y intercept, standard deviation 


of the residuals, and r* are influenced by outliers. 


Explain the concept of least squares. 

Determine the equation of a least-squares regression 
line using technology or computer output. 

Construct and interpret residual plots to assess whether 
a linear model is appropriate. 


Find the slope and y intercept of the least-squares regres- 
sion line from the means and standard deviations of x and y 
and their correlation. 


Linear (straight-line) relationships between two quantitative variables are fairly com- 
mon and easy to understand. In the previous section, we found linear relationships 
in settings as varied as sparrowhawk colonies, natural-gas consumption, and Florida 
manatee deaths. Correlation measures the direction and strength of these relation- 
ships. When a scatterplot shows a linear relationship, we’d like to summarize the 
overall pattern by drawing a line on the scatterplot. A regression line summarizes 
the relationship between two variables, but only in a specific setting: when one of 
the variables helps explain or predict the other. Regression, unlike correlation, re- 
quires that we have an explanatory variable and a response variable. 


DEFINITION: Regression line 


A regression line is a line that describes how a response variable y changes as an 
explanatory variable x changes. We often use a regression line to predict the value of 
y for a given value of x. 
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Let’s look at a situation where a regression line provides a useful model. 


HO Much Is That Truck Worth? 


Regression lines as models 


Everyone knows that cars and trucks lose value the more they are 
driven. Can we predict the price of a used Ford F-150 SuperCrew 
4 X 4 if we know how many miles it has on the odometer? A ran- 
dom sample of 16 used Ford F-150 SuperCrew 4 X 4s was selected 
from among those listed for sale at autotrader.com. The number 
of miles driven and price (in dollars) were recorded for each of the 
trucks.!> Here are the data: 


Miles driven 70,583 129,484 29,932 29,953 24,495 75,678 8359 4447 
Price (in dollars) 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
Miles driven 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
Price (in dollars) 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 


Figure 3.7 is a scatterplot of these data. The plot shows a moderately strong, nega- 
tive linear association between miles driven and price with no outliers. The cor- 
relation is r = —0.815. The line on the plot is a regression line for predicting price 
from miles driven. 


This regression line predicts 
price from miles driven. 


Price (in dollars) 


FIGURE 3.7 Scatterplot 
showing the price and 
miles driven of used Ford : I 1 1 
F-150s, with a regression 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 
line added. Miles driven 


Interpreting a Regression Line 


A regression line is a model for the data, much like the density curves of 
Chapter 2. The equation of a regression line gives a compact mathematical de- 
scription of what this model tells us about the relationship between the response 
variable y and the explanatory variable x. 
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ll EE 


DEFINITION: Regression line, predicted value, slope, y intercept 


Suppose that yis a response variable (plotted on the vertical axis) and x is an 
explanatory variable (plotted on the horizontal axis). A regression line relating y to x 
has an equation of the form 


j=a+ bx 
In this equation, 


e (read “y hat”) is the predicted value of the response variable y for a given 
value of the explanatory variable x. 


e bis the slope, the amount by which yis predicted to change when x increases 
by one unit. 
e ais the yintercept, the predicted value of y when x = 0. 


Although you are probably accustomed to the form y = mx + b for the equa- 
tion of a line from algebra, statisticians have adopted a different form for the equa- 
tion of a regression line. Some use f = bo + b\x. We prefer § = a + bx for two 
reasons: (1) it’s simpler and (2) your calculator uses this form. Don’t get so caught 
up in the symbols that you lose sight of what they mean! The coefficient of x is 
always the slope, no matter what symbol is used. 

Many calculators and software programs will give you the equation of a regres- 
sion line from keyed-in data. Understanding and using the line are more impor- 
tant than the details of where the equation comes from. 


How Much Is That Truck Worth? 


Interpreting the slope and y intercept 


The equation of the regression line shown in 
ada) | Figure 3.7 is 


° ¢ | The y intercept of the 


sical regression line is a = 38,257. 


“price = 38,257 — 0.1629 (miles driven) 


35,000 + 


30,000 4 ° 
PROBLEM: Identify the slope and yintercept of the 
regression line. Interpret each value in context. 
SOLUTION: The slope b= —0.1629 tells us that the 
price of a used Ford F-150 is predicted to go down by 0.1629 


dollars (16.29 cents) for each additional mile that the truck 
ia has been driven. The yintercept a = 38,257 is the predicted 


0 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 A . ‘ 
price of a Ford F-150 that has been driven O miles. 


Miles driven 
For Practice Try Exercise BEQ@QYat{()) 


25,000 4 


20,000 + 


Price (in dollars) 


15,000 + 7 


10,000 + ° 


The slope of a regression line is an important numerical description of the rela- 
tionship between the two variables. Although we need the value of the y intercept 
to draw the line, it is statistically meaningful only when the explanatory variable 
can actually take values close to zero, as in this setting. 


Price (in dollars) 
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Does a small slope mean that there’s no relationship? For the 
me oe men ate 
miles driven and price regression line, the slope b = —0.1629 is a small number. 
ABOUT IT This does not mean that change in miles driven has little effect on price. The size 
of the slope depends on the units in which we measure the two variables. In this set- 
ting, the slope is the predicted change in price (in dollars) when the distance driven 
increases by | mile. There are 100 cents in a dollar. If we measured price in cents 
instead of dollars, the slope would be 100 times larger, b = 16.29. You can’t say how 
strong a relationship is by looking at the size of the slope of the regression line. 


?——— HT _|__ aoa oq_cwq_q_q—_—O 


Prediction 


We can use a regression line to predict the response 7 for a specific value of the 
explanatory variable x. Here’s how we do it. 


How Much Is That Truck Worth? 


Predicting with a regression line 
For the Ford F-150 data, the equation of the regres- 


sion line is 


price = 38,257 — 0.1629 (miles driven) 


If a used Ford F-150 has 100,000 miles driven, sub- 


stitute x = 100,000 in the equation. The predicted 
price is 


price = 38,257 — 0.1629(100,000) = 21,967 dollars 


0 


T T T T 
20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 
Miles driven 


This prediction is illustrated in Figure 3.8. 


FIGURE 3.8 Using the regression line to predict price for a Ford 
F-150 with 100,000 miles driven. 


The accuracy of predictions from a regression line depends on how much the 
data scatter about the line. In this case, prices for trucks with similar mileage show 
a spread of about $10,000. The regression line summarizes the pattern but gives 
only roughly accurate predictions. 

Can we predict the price of a Ford F-150 with 300,000 miles driven? We can 
certainly substitute 300,000 into the equation of the line. The prediction is 


“price = 38,257 — 0.1629(300,000) = —10,613 dollars 


That is, we predict that we would need to pay someone else $10,613 just to take 
the truck off our hands! 
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A negative price doesn’t make much sense in this context. Look again at Figure 
3.8. A truck with 300,000 miles driven is far outside the set of x values for our data. 
We can’t say whether the relationship between miles driven and price remains lin- 
ear at such extreme values. Predicting price for a truck with 300,000 miles driven 
is an extrapolation of the relationship beyond what the data show. 


Often, using the regression line DEFINITION: Extrapolation 


to make a prediction for x = 0 is aaa Peer sae : : 
an extrapolation. That's why the y Extrapolation is the use of a regression line for prediction far outside the interval 


intercept isn’t always statistically of values of the explanatory variable x used to obtain the line. Such predictions are 
meaningful. often not accurate. 


Don’t make predictions using values of x that are much larger or much 


Few relationships are linear for all values of the explanatory variable. @ 
smaller than those that actually appear in your data. 


CHECK YOUR UNDERSTANDING 


Some data were collected on the weight of a male white laboratory rat for the first 25 weeks 
after its birth. A scatterplot of the weight (in grams) and time since birth (in weeks) shows 
a fairly strong, positive linear relationship. The linear regression equation weight = 100+ 
40(time) models the data fairly well. 


1. What is the slope of the regression line? Explain what it 


= 
= 


ay 


means in context. A é 
2. What’s the y intercept? Explain what it means in context. b 

3. Predict the rat’s weight after 16 weeks. Show your work. a 

4. Should you use this line to predict the rat’s weight at 
age 2 years? Use the equation to make the prediction and 


think about the reasonableness of the result. (There are 454 
grams in a pound.) 


Residuals and the Least-Squares 
Regression Line 


In most cases, no line will pass exactly through all the 


a points in a scatterplot. Because we use the line to predict 

40,000 4 Cariedaeiaion temmdh ine y from x, the prediction errors we make are errors in y, the 

35,000 vertical direction in the scatterplot. A good regression line 
g rel Repression lite makes the vertical deviations of the points from the line as 
S 9 = 38,257 — 0.1629x small as possible. 
aro Figure 3.9 shows a scatterplot of the Ford F-150 data 
20,000 5 with a regression line added. The prediction errors are 
zs 15,000 4 

10,000 4 

cae | | , | | | | a FIGURE 3.9 Scatterplot of the Ford F-150 data with a regression 


Gi -BOnGa aban 68:hou anges: Agomin lan ie 446. Gu FeD DIN line added. A good regression line should make the prediction 
Miles driven errors (shown as bold vertical segments) as small as possible. 
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marked as bold segments in the graph. These vertical deviations represent “left- 
over” variation in the response variable after fitting the regression line. For that 
reason, they are called residuals. 


(I 


DEFINITION: Residual 


A residual is the difference between an observed value of the response variable and 
the value predicted by the regression line. That is, 


residual = observed y — predicted y 


=p) pant 


The following example shows you how to calculate and interpret a residual. 


How Much Is That Truck Worth? 


Finding a residual 


PROBLEM: Find and interpret the residual for the Ford F-150 that had 70,583 miles driven 
and a price of $21,994. 


SOLUTION: The regression line predicts a price of 
price = 38,257 — 0.1629(70,583) = 26,759 dollars 


for this truck, but its actual price was $21,994. This truck’s residual is 


residual = observed y — predicted y 
= y— y = 21,994 — 26,759 = —4765 dollars 
That is, the actual price of this truck is $4765 lower than expected, based on its mileage. The actual 


price might be lower than predicted as a result of other factors. For example, the truck may have been 
in an accident or may need a new paint job. 


For Practice Try Exercise 


The line shown in Figure 3.9 makes the residuals for the 16 trucks “as small 
as possible.” But what does that mean? Maybe this line minimizes the sum of 
the residuals. Actually, if we add up the prediction errors for all 16 trucks, the 
positive and negative residuals cancel out. That’s the same issue we faced when 
we tried to measure deviation around the mean in Chapter 1. We'll solve the 
current problem in much the same way: by squaring the residuals. The regres- 
sion line we want is the one that minimizes the sum of the squared residuals. 
That’s what the line shown in Figure 3.9 does for the Ford F-150 data, which is 
why we call it the least-squares regression line. 


(I 


DEFINITION: Least-squares regression line 


The least-squares regression line of y on xis the line that makes the sum of the 
squared residuals as small as possible. 
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45,000 + 
Sum of squared 
40,000 - residuals = 461,300,000 
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(b) 


FIGURE 3.10 The least-squares idea: make the errors in predicting y as small as possible by 
minimizing the sum of the squares of the residuals. 


Figure 3.10 gives a geometric interpretation of the least-squares idea for the truck 
data. Figure 3.10(a) shows the “squared” residual for the truck with 70,583 miles driven 
and a price of $21,994. The area of this square is (—4765)(—4765) = 22,705,225. 
Figure 3.10(b) shows the squared residuals for all the trucks. The sum of squared re- 
siduals is 461,300,000. No other regression line would give a smaller sum of squared 
residuals. 


ACTIVITY | Investigating properties of the 


least-squares regression line 


MATERIALS: 


Computer with 
Internet connection 


Ago cata pouty 
© (Dew your own tne 
Rotate SS: 1,28 [7] 


#) Show least squares tine 
Show mean XA Y ines 
Show mrsicusis 


wLe, In this Activity, you will use the Correlation and Regression applet at the book’s 
Web site, www.whfreeman.com/tps5e, to explore some properties of the least- 
squares regression line. 


1. Click on the scatterplot to create a group of 15 to 20 points from lower left to 
upper right with a clear positive straight-line pattern (correlation around 0.7). 


2. Click the “Draw your own line” button to select starting and 
ending points for your own line on the plot. Use the mouse to 
adjust the starting and ending points until you have a line that 
models the association well. 


3. Click the “Show least-squares line” button. How do the two 
lines compare? One way to measure this is to compare the “Rel- 
ative SS,” the ratio of the sum of squared residuals from your 
line and the least-squares regression line. If the two lines are ex- 
actly the same, the relative sum of squares will be 1. Otherwise, 
the relative sum of squares will be larger than 1. 


4. Press the “CLEAR” button and create another scatterplot as in Step 1. Then 
click on “Show least-squares line” and “Show mean X & Y lines.” What do you 
notice? Move or add points, one at a time, in your scatterplot to see if this result 
continues to hold true. 
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5. Now click the “Show residuals” button. How does an outlier affect the slope 
and y intercept of the least-squares regression line? Move or add points, one at a 
time, to investigate. Does it depend on whether the outlier has an x-value close 

to the center of the plot or toward the far edges of the plot? 


Your calculator or statistical software will give the equation of the least-squares 
line from data that you enter. Then you can concentrate on understanding and 
using the regression line. 


LEAST-SQUARES REGRESSION LINES 


TECHNOLOGY OW THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Let’s use the Ford F-150 data to show how to find the equation of the least-squares regression line on the 'T1-83/84 
and T1-89. Here are the data again: 


Miles driven 70,583 129,484 29,932 29,953 24,495 75,678 8359 4447 
Price (in dollars) 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
Miles driven 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
Price (in dollars) 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 


1. Enter the miles driven data into LI/listl and the price data into L2/list2. Then make a scatterplot. Refer to the 
Technology Corner on page 150. 


2. To determine the least-squares regression line: 


TI-83/84 TL-89 
e Press| STAT} choose CALC and then e In the Statistics/List Editor, press | F4 | (CALC); 
LinReg (a+bx) . OS 2.55 or later: In the dialog choose Regressions and then LinReg (a+bx). 
box, ene) the following: Xlist :L1, Ylist:L2, e Enter list] for the Xlist, list2 for the Ylist; choose to 
Freghist (leave blank), Store RegEQ:Y1, store the RegEqn to yl(x); and press [ENTER | 


and choose Calculate. Older OS: Finish the 
command to read LinReg (a+bx) L1,L2,Y1 and 
press | ENTER |. (Y1 is found under VARS/Y-VARS/ 
Function.) 


NORMAL FLOAT AUTO REAL RADIAN MP o 


=atbx 


a=38257.13507 
=-, 1629185531 

r2=, 664247901 

r=-, 8150140496 


2fi7l= 
PRIM RAD AUTO FUME 276 


Note: If you do not want to store the equation to Y1, then leave the StoreRegEq prompt blank (OS 2.55 or later) or 
use the following command (older OS): LinReg (a+bx) L1,L2. 
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3. Graph the regression line. Turn off all other equations in the Y= screen and use ZoomStat/ZoomData to add the 
least-squares line to the scatterplot. 


NORMAL FLOAT AUTO REAL RADIAN MP a] 


reviclevamfreace|neavarnfaatn|oral ran 
mo 


4. Save these lists for later use. On the home screen, use the |STO> | key to help execute the command 
L1>MILES:L2—PRICE (list1—MILES:1list2—>PRICE on the TI-89). 


Note: If r? and r do not appear on the TI-83/84 screen, do this one-time series of keystrokes: OS 2.55 or later: 
Press MODE and set STAT DIAGNOSTICS to ON. Older OS: Press [2nd] |0] (CATALOG), scroll down to 
DiagnosticoOn, and press |ENTER |. Press [ENTER | again to execute the command. ‘The screen should say “Done.” 
Then redo Step 2 to calculate the least-squares line. The r’ and r values should now appear. 


AP® EXAM TIP When displaying the equation of a least-squares regression line, the 
calculator will report the slope and intercept with much more precision than we need. 
However, there is no firm rule for how many decimal places to show for answers on the 


AP® exam. Our advice: Decide how much to round based on the context of the problem 
you are working on. 


CHECK YOUR UNDERSTANDING 


It’s time to practice your calculator regression skills. Using the familiar SEC football data 
in the table below, repeat the steps in the previous Technology Corner. You should get 
y = —3.7506 + 0.4372x as the equation of the regression line. 


Team Alabama Arkansas Auburn Florida Georgia Kentucky 
Points per game 34.8 36.8 25.7 25.5 32.0 15.8 
Wins 12 11 8 7 10 5 
Team Louisiana State Mississippi Mississippi State South Carolina Tennessee Vanderbilt 
Points per game 35.7 16.1 25.3 30.1 20.3 26.7 
Wins 13 2 7 11 5 6 


Determining Whether a Linear Model Is 
Appropriate: Residual Plots 


One of the first principles of data analysis is to look for an overall pattern and for 
striking departures from the pattern. A regression line describes the overall pattern 
of a linear relationship between an explanatory variable and a response variable. 
We see departures from this pattern by looking at the residuals. 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


Plotl:La,sL2 


X=68474 Y=33961 


Most graphing calculators and 
statistical software will calculate and 
store residuals for you. 


Some software packages prefer to plot 
the residuals against the predicted 
values y instead of against the values 
of the explanatory variable. The basic 
shape of the two plots is the same 
because Y is linearly related to x. 
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How Much Is That Truck Worth? 


Examining residuals 


Let’s return to the Ford F-150 data about the number of miles driven 
and price for a random sample of 16 used trucks. In general, trucks 
with more miles driven have lower prices. In the Technology Corner, 
we confirmed that the equation of the least-squares regression line for 
these data is price = 38,257 — 0.1629 (miles driven). The calculator 
screen shot in the margin shows a scatterplot of the data with the least- 
squares line added. 


One truck had 68,474 miles driven and a price of $33,961. This truck 
is marked on the scatterplot with an X. Because the point is above the 
line on the scatterplot, we know that its actual price is higher than the pre- 
dicted price. To find out exactly how much higher, we calculate the residual 
for this truck. The predicted price for a Ford F-150 with 68,474 miles driven is 


% = 38,257 — 0.1629(68,474) = $27,103 
The residual for this truck is therefore 
residual = observed y — predicted y = y — $ = 33,961 — 27,103 = $6858 
This truck costs $6858 more than expected, based on its mileage. 


The 16 points used in calculating the equation of the least-squares regression line 
produce 16 residuals. Rounded to the nearest dollar, they are 


—=4/6> (64 = 3506" 8617 7728 3057 5004 458 
2207 WSS =S1 2) 6858" ZNO) 55720 99732 = 2857 


Although residuals can be calculated from any model that is fitted to the data, the 
residuals from the least-squares line have a special property: the mean of the least- 
squares residuals is always zero. You can check that the sum of the residuals in the 
above example is —$18. The sum is not exactly 0 because of rounding errors. 

You can see the residuals in the scatterplot of Figure 3.11(a) on the next page 
by looking at the vertical deviations of the points from the line. The residual plot 
in Figure 3.11(b) makes it easier to study the residuals by plotting them against 
the explanatory variable, miles driven. Because the mean of the residuals is always 
zero, the horizontal line at zero in Figure 3.11(b) helps orient us. This “residual 
= 0” line corresponds to the regression line in Figure 3.1 1(a). 


a 


DEFINITION: Residual plot 


A residual plot is a scatterplot of the residuals against the explanatory variable. 
Residual plots help us assess whether a linear model is appropriate. 
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(a) (b) 


10,000 4 
e 
e 
e 
e 
5000 

—_ 
Z e 
Ss — e e 
: : 
S = o+° 
i=} 72) 
7 ) 
vo ms e 
oe e 
‘= 
al 5000 4° Z 

e 

e 
e 
5000 1 1 1 1 1 1 1 r ~ 10,000 > T 
0 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 0 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 
Miles driven Miles driven 


FIGURE 3.11 (a) Scatterplot of price versus miles driven, with the least-squares line. (b) Residual 
plot for the regression line displayed in Figure 3.11(a). The line at y= O marks the sum (and 
mean) of the residuals. 


CHECK YOUR UNDERSTANDING 
Refer to the Ford F-150 miles driven and price data. 


1. Find the residual for the truck that had 8359 miles driven and a price of $31,891. 
Show your work. 


2. Interpret the value of this truck’s residual in context. 


3. For which truck did the regression line overpredict price by the most? Justify your 
answer. 


Examining residual plots A residual plot in effect turns the regression 
line horizontal. It magnifies the deviations of the points from the line, making it 
easier to see unusual observations and patterns. Because it is easier to see an un- 
usual pattern in a residual plot than a scatterplot of the original data, we often use 
residual plots to determine if the model we are using is appropriate. 

Figure 3.12(a) shows a nonlinear association between two variables and the 
least-squares regression line for these data. Figure 3.12(b) shows the residual plot 
for these data. 
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FIGURE 3.12 (a)A straight line is not a good model for these data. (b) The residual plot has a 
curved pattern. 


FIGURE 3.13 The random scatter 
of points indicates that the regres- 
sion line has the same form as 
the association, so the line is an 
appropriate model. 


THINK 
ABOUT IT 


TECHNOLOGY 
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Because the form of our model (linear) is not the same as the form of the as- 
sociation (curved), there is an obvious leftover pattern in the residual plot. When 
an obyious curved pattern exists in a residual plot, the model we are using is not 
appropriate. We'll look at how to deal with curved relationships in Chapter 12. 

When we use a line to model a linear association, there will be no leftover pat- 
terns in the residual plot, only random scatter. Figure 3.13 shows the residual plot 
for the Ford F-150 data. Because there is only random scatter in the residual plot, 
we know the linear model we used is appropriate. 


10,000 + 


5000 4 


Residual 
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Why do we look for patterns in residual plots? The word residual 
comes from the Latin word residuum, meaning “left over.” When we calculate a 
residual, we are calculating what is left over after subtracting the predicted value 
from the observed value: 


residual = observed y— predicted y 


Likewise, when we look at the form of a residual plot, we are looking at the form that 
is left over after subtracting the form of the model from the form of the association: 


form of residual plot = form of association — form of model 


When there is a leftover form in the residual plot, the form of the association and 
form of the model are not the same. However, if the form of the association and 
form of the model are the same, the residual plot should have no form, other than 
random scatter. 


OR 


RESIDUAL PLOTS ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Let’s continue the analysis of the Ford F-150 miles driven and price data from the previous ‘Technology Corner 
(page 171). You should have already made a scatterplot, calculated the equation of the least-squares regression line, and 
graphed the line on your plot. Now, we want to calculate residuals and make a residual plot. Fortunately, your calculator 
has already done most of the work. Each time the calculator computes a regression line, it also computes the residu- 
als and stores them in a list named RESID. Make sure to calculate the equation of the regression line before using the 


RESID list! 
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TI-83/84 TI-89 
1. Display the residuals in L3(list3). 


e With L3 highlighted, press |2nd||sTaT|(LIST) and ¢ With list3 highlighted, press |2nd 


(VAR-LINK), ar- 


select the RESID list. row down to STATVARS, and select the RESID list. 


NORMAL FLOAT AUTO REAL RADIAN MP 


cane|nce| rats ina 


robro |28956 
11st301 J=-4765. 8548548529 
MAIN RAD AUTO FUN zee 


Lotv= -4763. 854834852 


2. Turn off Plot] and the regression equation. Specify Plot2 with L1/listl as the x variable and L3/list3 as the y variable. 
Use ZoomStat (ZoomData) to see the residual plot. 


NORMAL FLOAT AUTO REAL RADIAN MP o 


The x axis in the residual plot serves as a reference line: points above this line correspond to positive residuals and points 
below the line correspond to negative residuals. 


Note: If you don’t want to see the residuals in L3/list3, you can make a residual plot in one step by using the RESID list 
as the y variable in the scatterplot. 


CHECK YOUR UNDERSTANDING 


In Exercises 5 and 7, we asked you to make and describe a scatterplot for the hiker data 
shown in the table below. Here is a residual plot for the least-squares regression of pack 
weight on body weight for the 8 hikers. 


4 
3 
2 
E 1 
0 
z 
2 
3 
4 
100 110 120 130 140 150 160 170 180 190 
Body weight 
Body weight (Ib): 120 187 109 103 131 165 158 116 
Backpack weight (Ib): 26 30 26 24 29 35 31 28 


1. One of the hikers had a residual of nearly 4 pounds. Interpret this value. 


2. Based on the residual plot, is a linear model appropriate for these data? 
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How Well the Line Fits the Data: 
The Role of s and r in Regression 


A residual plot is a graphical tool for determining if a least-squares regression 
line is an appropriate model for a relationship between two variables. Once we 
determine that a least-squares regression line is appropriate, it makes sense to ask 
a follow-up question: How well does the line work? That is, if we use the least- 
squares regression line to make predictions, how good will these predictions be? 


The Standard Deviation of the Residuals We already know that a 
residual measures how far an observed y-value is from its corresponding predicted 
value ¥. In an earlier example, we calculated the residual for the Ford F-150 with 
68,474 miles driven and price $33,961. The residual was $6858, meaning that the 
actual price was $6858 higher than we predicted. 

To assess how well the line fits all the data, we need to consider 


10,0004 : the residuals for each of the 16 trucks, not just one. Using these 
' : residuals, we can estimate the “typical” prediction error when using 
oai| . the least-squares regression line. To do this, we calculate the stan- 
7 * 2” ° dard deviation of the residuals. 
got 
é 


P | “residuals” 
e 5= SS 
-5000) @ 7 e i 2 


For the Ford F-150 data, the sum of squared residuals is 461,300,000. 


T T T T T T T T 
0 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 


Miles driven So, the standard deviation of the residuals is 
/461,300,000 
$s =,/——— = 5740 dollars 
14 
Did you recognize the number When we use the least-squares regression line to predict the price of a Ford 


461,300,000? We first encountered this F'_1 50) using the number of miles it has been driven, our predictions will typically 
Rumbar oh Pave 17 anenevenny beat by about $5740. Looking at the residual plot, this seems like a reasonable 


that the least-squares regression : 
line minimized sie sinh squared value. Although some of the residuals are close to 0, others are close to $10,000 


residuals. We'll see it again shortly. or —$1 0,000. 


THINK Does the formula for s look slightly familiar? It should. In Chapter 1, 
we defined the standard deviation of a set of quantitative data as 
ABOUT IT 


Sige 
8, = 4{———— 
n—-| 
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We interpreted the resulting value as the “typical” distance of the data points from 
the mean. In the case of two-variable data, we’re interested in the typical (vertical) 
distance of the data points from the regression line. We find this value in much 
the same way: by adding up the squared deviations, then averaging (again in a 
funny way), and taking the square root to get back to the original units of measure- 
ment. Why do we divide by n — 2 this time instead of n — 1? You'll have to wait 
until Chapter 12 to find out. 


$A 


The Coefficient of Determination There is another numerical quantity 
that tells us how well the least-squares line predicts values of the response variable 
y. Itis7’, the coefficient of determination. Some computer packages call it “R-sq.” 
You may have noticed this value in some of the calculator and computer regres- 
sion output that we showed earlier. Although it’s true that 7” is equal to the square 
of r, there is much more to this story. 


How Much Is That Truck Worth? 


How can we predict y if we don’t know x? 


Suppose that we randomly selected an additional used Ford F-150 that was on 
sale. What should we predict for its price? Figure 
3.14 shows a scatterplot of the truck data that we have 
studied throughout this section, including the least- 
squares regression line. Another horizontal line has 
been added at the mean y-value, y = $27,834. If we 


don’t know the number of miles driven for the addi- 
tional truck, we can’t use the regression line to make 
a prediction. What should we do? Our best strategy 
is to use the mean price of the other 16 trucks as our 
prediction. 


Price (in dollars) 


FIGURE 3.14 Scatterplot and least-squares regression line for 
Miles daven the Ford F-150 data with a horizontal line added at the mean 
price, $27,834. 


Figure 3.15(a) on the facing page shows the prediction errors if we use the 
average price y as our prediction for the original group of 16 trucks. We can see 
that the sum of the squared residuals for this line is }(y; — ¥)’ = 1,374,000,000. 
This quantity measures the total variation in the y-values from their mean. This is 
also the same quantity we use to calculate the standard deviation of the prices, s). 

If we learn the number of miles driven on the additional truck, then we could 
use the least-squares line to predict its price. How much better does the regression 
line do at predicting prices than simply using the average price 7 of all 16 trucks? 
Figure 3.15(b) reminds us that the sum of squared residuals for the least-squares 
line is Sresiduals” = 461,300,000. This is the same quantity we used to calculate 
the standard deviation of the residuals. 


(a) 


Price (thousands) 
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Price (thousands) 


— Price = 27834 


50 


Miles Driven (thousands) 


100 150 200 50 0 50. 100. 150 200 
Miles Driven (thousands) 
— Price = 3.83e+04 - 0.163Miles Driven; r2 = 0.66 


Sum of squares = 1374000000 (b) Sum of squares = 461300000 


FIGURE 3.15 (a) The sum of squared residuals is 1,374,000,000 if we use the mean price as 
our prediction for all 16 trucks. (b) The sum of squares from the least-squares regression line is 
461,300,000. 


The ratio of these two quantities tells us what proportion of the total variation in 
y still remains after using the regression line to predict the values of the response 
variable. In this case, 


461,300,000 
1,374,000,000 


This means that 33.6% of the variation in price is unaccounted for by the least- 
squares regression line using x = miles driven. This unaccounted-for variation is 
likely due to other factors, including the age of the truck or its condition. ‘Taking 
this one step further, the proportion of the total variation in y that is accounted for 
by the regression line is 


= 0.336 


1 — 0.336 = 0.664 


We interpret this by saying that “66.4% of the variation in price is accounted for by 
the linear model relating price to miles driven.” 


If all the points fall directly on the least-squares line, the sum of squared re- 
siduals is 0 and r? = 1. Then all the variation in y is accounted for by the linear 
relationship with x. Because the least-squares line yields the smallest possible sum 
of squared prediction errors, the sum of squared residuals can never be more than 
the sum of squared deviations from the mean of y. In the worst-case scenario, the 
least-squares line does no better at predicting y than y = y does. Then the two 
sums of squares are the same and r’ = 0. 
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It seems fairly remarkable that the coefficient of determination is actually the 
correlation squared. This fact provides an important connection between correla- 
tion and regression. When you see a correlation, square it to get a better feel for 
how well the least-squares line fits the data. 


THINK What’s the relationship between the standard deviation of 
the residuals s and the coefficient of determination r2? They 
ABOUT IT are both calculated from the sum of squared residuals. They also both attempt to 
answer the question, “How well does the line fit the data?” The standard devia- 
tion of the residuals reports the size of a typical prediction error, in the same units 
as the response variable. In the truck example, s = 5740 dollars. The value of r’, 
however, does not have units and is usually expressed as a percentage between 0% 
and 100%, such as 1? = 66.4%. Because these values assess how well the line fits 
the data in different ways, we recommend you follow the example of most statisti- 
cal software and report them both. 


Oo __—_§|_ oAeo>js 
Let’s revisit the SEC football data to practice what we have learned. 


SEC Football 


Residual plots, s, and r? 


In Section 3.1, we looked at the relationship between the average num- 
ber of points scored per game x and the number of wins y for the 12 
college football teams in the Southeastern Conference. A scatterplot 
with the least-squares regression line and a residual plot are shown. The 
equation of the least-squares regression line is y = —3.75 + 0.437x. 


Also, s = 1.24 andr’ = 0.88. 


PROBLEM: 
(a) Calculate and interpret the residual for South Carolina, which scored 30.1 points 
per game and had 11 wins. 


(b) Isa linear model appropriate for these data? Explain. 
(c) Interpret the value of 5. 


(d) Interpret the value of r?. 


Residual 


Points per game Points per game 


AP® EXAM TIP Students 
often have a hard time 
interpreting the value of r? 
on AP® exam questions. 
They frequently leave out key 
words in the definition. Our 


advice: Treat this as a fill- 


in-the-blank exercise. Write 
“—__% of the variation 

in [response variable 

name] is accounted for by 
the linear model relating 
[response variable name] to 
[explanatory variable name].” 
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SOLUTION: 
(a) The predicted amount of wins for South Carolina is 


y = —3.75 + 0.437(30.1) = 9.40 wins 
The residual for South Carolina is 
residual = y— y = 11 — 9.40 = 1.60 wins 
South Carolina won 1.60 more games than expected, based on the number of points they scored per 
game. 
(b) Because there is no obvious pattern left over in the residual plot, the linear model is appropriate. 


(c) When using the least-squares regression line with x = points per game to predict y = the num- 
ber of wins, we will typically be off by about 1.24 wins. 


(4) About 88% of the variation in wins is accounted for by the linear model relating wins to points 
per game. 


For Practice Try Exercise 


Interpreting Computer Regression Output 


Figure 3.16 displays the basic regression output for the Ford F-150 data from two 
statistical software packages: Minitab and JMP. Other software produces very simi- 
lar output. Each output records the slope and y intercept of the least-squares line. 
The software also provides information that we don’t yet need (or understand!), 
although we will use much of it later. Be sure that you can locate the slope, the y 
intercept, and the values of s and r? on both computer outputs. Once you under- 
stand the statistical ideas, you can read and work with almost any software output. 


Minitab oMP 
Summary of Fit ag 
Slope y intercept RSquare (oe) sai 
Predictor Coef SE Coef T 2 RSquare Adj 0.640266 deviation 
Constant % 2446 15.64 0.000 Root Mean Square Error (5740.13 J of the residuals 
Miles Driven 0.03096 -5.26 0.000 Mean of Response 27833 .69 
r? Observations (or Sum Wgts) 16 
(s = 5740.13) [R-Sq = 66.4%] R-Sq(adj) = 64.0% 
Parameter Estimates 
Standard deviation of the residuals Term Estimate Std Error t Ratio Prob>|t| 
Intercept 2445.813 15.64 <.0001 
Miles Driven [oes 0.030956 -5.26 0.0001 
y intercept Slope 


FIGURE 3.16 Least-squares regression results for the Ford F-150 data from two statistical soft- 
ware packages. Other software produces similar output. 


Using Feet to Predict Height 


Interpreting regression output 
A random sample of 15 high school students was selected from the U.S. 


CensusAtSchool database. The foot length (in centimeters) and height (in 
centimeters) of each student in the sample were recorded. Least-squares 
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regression was performed on the data. A scatterplot with the regression line 
added, a residual plot, and some computer output from the regression are 
shown below. 


Predictor Coef SE Coef aE P 
Constant 103.41 ALS) 5/530) 530) 0.000 
Foot length 2.7469 0.7833 Sia 0.004 
S = 7.95126 R-Sq = 48.6% R-Sq(adj) = 44.7% 
15 
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e 
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=15 
T T T T T T T T T T T T T T T T T T T T T 
22 24 26 28 30 22 24 26 28 30 
Foot length (cm) Foot length (cm) 


PROBLEM: 


(a) What is the equation of the least-squares regression line that describes the relationship 
between foot length and height? Define any variables that you use. 


(b) Interpret the slope of the regression line in context. 
(c) Find the correlation. 
(4) Isa line an appropriate model to use for these data? Explain how you know. 


SOLUTION: 


(a) The equationis y = 103.41 + 2.7469x, where y = predicted height (in centimeters) and xis 
foot length (in centimeters). We could also write 


predicted height = 103.41 + 2.7469 (foot length) 


(b) For each additional centimeter of foot length, the least-squares regression line predicts an 
increase of 2.7469 cm in height. 

(c) Tofind the correlation, we take the square root of r?: r = +\/0.486 = +0.697. Because 
the scatterplot shows a positive association, r = 0.697. 


(a) Because the scatterplot shows a linear association and the residual plot has no obvious leftover 
patterns, a line is an appropriate model to use for these data. 


For Practice Try Exercise 


Regression to the Mean 


Using technology is often the most convenient way to find the equation of a least- 
squares regression line. It is also possible to calculate the equation of the least- 
squares regression line using only the means and standard deviations of the two 


AP® EXAM TIP The formula 
sheet for the AP® exam uses 
different notation for these 


: Sy 

equations: b, = r— and 
Sy 

bo = Y — b,X. That’s because 
the least-squares line is written 
as ¥ = by + b,x. We prefer our 
simpler versions without the 
subscripts! 
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variables and their correlation. Exploring this method will highlight an important 
relationship between the correlation and the slope of a least-squares regression 
line—and reveal why we include the word “regression” in the expression “least- 
squares regression line.” 


HOW TO CALCULATE THE LEAST-SQUARES REGRESSION LINE 


We have data on an explanatory variable x and a response variable y for n 
individuals. From the data, calculate the means x and ¥ and the standard de- 
viations s, and s, of the two variables and their correlation r. The least-squares 
regression line is the line y = a + bx with slope 
b= Bez 
Sx 
and y intercept 


a=y-—bx 


The formula for the y intercept comes from the fact that the least-squares regres- 
sion line always passes through the point (x, ¥). You discovered this in Step 4 of the 
Activity on page 170. Substituting (x, y) into the equation » = a + bx produces 
the equation ¥ = a + bx. Solving this equation for a gives the equation shown in 
the definition box, a = y — bx. 

To see how these formulas work in practice, let’s look at an example. 


Using Feet to Predict Height 


Calculating the least-squares regression line 


In the previous example, we used data from a random sample of 15 high school 
students to investigate the relationship between foot length (in centimeters) and 
height (in centimeters). The mean and standard deviation of the foot lengths 
are x = 24.76 cm and s, = 2.71 cm. The mean and standard deviation of the 
heights are y = 171.43 cm and s, = 10.69 cm. The correlation between foot 
length and height is r = 0.697. 


PROBLEM: Find the equation of the least-squares regression line for predicting height from foot 
length. Show your work. 


SOLUTION: The least-squares regression line of height yon foot length xhas slope 


5y 


10.69 
B= = 009 eae eo 
, 271 
The least-squares regression line has y intercept 

a= y — bx = 17145 — 2,75(24.76) = 103.34 


So, the equation of the least-squares regression line is y = 103.34 + 2.75x. 


For Practice Try Exercise 
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mean height 


y = 103.34 + 2.75x 


DESCRIBING RELATIONSHIPS 


There is a close connection between the correlation and the slope of the least- 
squares regression line. The slope is 


This equation says that along the regression line, a change of | standard deviation in x 
corresponds to a change ofr standard deviations in y. When the variables are perfectly 
correlated (r = 1 or r = —1), the change in the predicted response ¥ is the same (in 
standard deviation units) as the change in x. For example, ifr = | and x is 2 standard 
deviations above its mean, then the corresponding value of f will be 2 standard devia- 
tions above the mean of y. 

However, if the variables are not perfectly correlated (—1 <r < 1), the change 
in f is less than the change in x, when measured in standard deviation units. To il- 
lustrate this property, let’s return to the foot length and height data from the previous 
example. 

The figure at left shows the regression line = 103.34 + 2.75x. We 
have added four more lines to the graph: a vertical line at the mean 


95 5 foot length x, a vertical line at ¥ + s, (1 standard deviation above the 
aS] mean foot length), a horizontal line at the mean height ¥, and a hori- 
Pe zontal line at y + s, (1 standard deviation above the mean height). 
E 180 5 When a student’s foot length is 1 standard deviation above the 
a1 7 mean foot length x, the predicted height 7 is above the mean height 
2 ae y, but not an entire standard deviation above the mean. How far 
. 4 i above the mean is the value of ¥? 
<< ae , % = mean foot length From the graph, we can see that 
ae eS ee change in y ?? 
22 24 26 28 30 b = slope = wae 
Foot length (cm) change inx 8x 
From earlier, we know that 
ae PR, 
Sy 


Sir Francis Galton (1822-1911) looked 
at data on the heights of children 
versus the heights of their parents. 

He found that taller-than-average 
parents tended to have children who 
were taller than average but not quite 
as tall as their parents. Likewise, 
shorter-than-average parents tended 
to have children who were shorter 
than average but not quite as short as 
their parents. Galton called this fact 
“regression to the mean” and used the 
symbol r because of the correlation’s 
important relationship to regression. 


THINK 
ABOUT IT 


Setting these two equations equal to each other, we have 


Thus, 9 must be r-s, above the mean y. 

In other words, for an increase of | standard deviation in the value of the 
explanatory variable x, the least-squares regression line predicts an increase of 
only r standard deviations in the response variable y. When the correlation isn’t 
r= 1 or —1, the predicted value of y is closer to its mean y than the value of x 
is to its mean x. This is called regression to the mean, because the values of y 
“regress” to their mean. 


What happens if we standardize both variables? Standardizing 
a variable converts its mean to 0 and its standard deviation to 1. Doing this to both 
x and y will transform the point (x, ) to (0, 0). So the least-squares line for the 
standardized values will pass through (0, 0). What about the slope of this line? 
From the formula, it’s b = rs,/s,. Because we standardized, s, = s, = 1. That 
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means b = r. In other words, the slope is equal to the correlation. The Fathom 
screen shot confirms these results. It shows that 7? = 0.49, so r= V0.49 = 0.7, 
approximately the same value as the slope of 0.697. 
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zFootlength 
— zHeight = -8.2e-17 + 0.697zFootlength; r? = 0.49 
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Putting It All Together: Correlation 
and Regression 


In Chapter 1, we introduced a four-step process for organizing a statistics problem. 
Here is another example of the four-step process in action. 


Gesell Scores 
Putting it all together 


Does the age at which a child begins to talk predict a later score on a test of mental 
ability? A study of the development of young children recorded the age in months 
at which each of 21 children spoke their first word and their Gesell Adaptive Score, 
the result of an aptitude test taken much later.’° The data appear in the table be- 
low, along with a scatterplot, residual plot, and computer output. Should we use a 
linear model to predict a child’s Gesell score from his or her age at first word? If so, 
how accurate will our predictions be? 
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Predictor Coef SE Coef T P 
Constant 109.874 5.068 21.68 0.000 
Age =1,-1270 0.3102 =3.63 0.002 
S = 11.0229 R-Sq = 41.0% R-Sq(adj) = 37.9% 


STATE: Isa linear model appropriate for these data? If so, how well does the least-squares 


regression line fit the data? 


PLAN: To determine whether a linear model is appropriate, we will look at the scatterplot and 
residual plot to see if the association is linear or nonlinear. Then, if a linear model is appropriate, 
we will use the standard deviation of the residuals and r* to measure how well the least-squares 


line fits the data. 


DO: The scatterplot shows a moderately strong, negative linear association between age at 
first word and Gesell score. There are a couple of outliers in the scatterplot. Child 19 has a very 
high Gesell score for his or her age at first word. Also, child 18 didn’t speak his or her first word 
until much later than the other children in the study and has a much lower Gesell score. The residu- 
al plot does not have any obvious patterns, confirming what we saw in the scatterplot—a linear 


model is appropriate for these data. 


From the computer output, the equation of the least-squares regression line is y = 109.874 — 
1.1270x. The standard deviation of the residuals is s = 11.0229. This means that our predictions 
will typically be off by 11.0229 points when we use the linear model to predict Gesell scores from 
age at first word. Finally, 41% of the variation in Gesell score is accounted for by the linear model 


relating Gesell score to age at first word. 


CONCLUDE: Although a linear model is appropriate for these data, our predictions might not be 
very accurate. Our typical prediction error is about 11 points, and more than half of the variation in 
Gesell score is still unaccounted for. Furthermore, we should be hesitant to use this model to make 
predictions until we understand the effect of the two outliers on the regression results. 


For Practice Try Exercise 


Correlation and Regression Wisdom 


Correlation and regression are powerful tools for describing the relationship 
between two variables. When you use these tools, you should be aware of their 


limitations. 
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1. The distinction between explanatory and response variables is impor- Cg 


tant in regression. Least-squares regression makes the distances of 

the data points from the line small only in the y direction. If we re- 

verse the roles of the two variables, we get a different least-squares regression 
line. This isn’t true for correlation: switching x and y doesn’t affect the value 
of r. 


Predicting Price, Predicting Miles Driven 


Two different regression lines 


Figure 3.17(a) repeats the scatterplot of the Ford F-150 data with the least-squares 
regression line for predicting price from miles driven. We might also use the data 
on these 16 trucks to predict the number of miles driven from the price of the 
truck. Now the roles of the variables are reversed: price is the explanatory variable 
and miles driven is the response variable. Figure 3.17(b) shows a scatterplot of 
these data with the least-squares regression line for predicting miles driven from 
price. The two regression lines are very different. The standard deviations of the 
residuals are different as well. In (a), the standard deviation is s = 5740 dollars, but 
in (b) the standard deviation is s = 28,716 miles. However, no matter which vari- 
able we put on the x axis, the value of r? is 66.4% and the correlation is r = —0.815. 
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FIGURE 3.17 (a) Scatterplot with least-squares regression line for predicting price from miles 
driven. (b) Scatterplot with least-squares regression line for predicting miles driven from price. 


2. Correlation and regression lines describe only linear relationships. You 
can calculate the correlation and the least-squares line for any rela- 
tionship between two quantitative variables, but the results are useful 
only if the scatterplot shows a linear pattern. Always plot your data! 


The following four scatterplots show very different associations. Which do you 
think has the highest correlation? 
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Answer: All four have the same correlation, r = 0.816. Furthermore, the least- 
squares regression line for each association is exactly the same, ¥ = 3 + 0.5x. 
These four data sets, developed by statistician Frank Anscombe, illustrate the im- 
portance of graphing data before doing calculations.'” 
3. Correlation and least-squares regression lines are not resistant. You al- 4% 
ready know that the correlation r is not resistant. One unusual point 
in a scatterplot can greatly change the value of r. Is the least-squares 
line resistant? Not surprisingly, the answer is no. 
Let’s revisit the age at first word and Gesell score data to shed some light on this 
issue. The scatterplot and residual plot for these data are shown in Figure 3.18. 
The two outliers, child 18 and child 19, are indicated on each plot. 
140 30 ° 
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40 
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FIGURE 3.18 (a) Scatterplot of Gesell Adaptive Scores versus the age at first word for 21 children. The line is the 
least-squares regression line for predicting Gesell score from age at first word. (b) Residual plot for the regression. 
Child 18 and Child 19 are outliers. Each blue point in the graphs stands for two individuals. 


FIGURE 3.19 Three least-squares 
regression lines of Gesell score 
on age at first word. The green 


line is calculated from all the data. 


The dark blue line is calculated 
leaving out Child 18. Child 18 is 
an influential observation because 
leaving out this point moves the 
regression line quite a bit. The red 
line is calculated leaving out only 
Child 19. 
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Child 19 has a very large residual because this point lies far from the regres- 
sion line. However, Child 18 has a fairly small residual. That’s because Child 18’s 
point is close to the line. How do these two outliers affect the regression? 

Figure 3.19 shows the results of removing each of these points on the correla- 
tion and the regression line. The graph adds two more regression lines, one cal- 
culated after leaving out Child 18 and the other after leaving out Child 19. You 
can see that removing the point for Child 18 moves the line quite a bit. (In fact, 
the equation of the new least-squares line is ¥ = 105.630 — 0.779x.) Because of 
Child 18’s extreme position on the age scale, this point has a strong influence on 
the position of the regression line. However, removing Child 19 has little effect 
on the regression line. 
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e= Two children With all 19 children: 
Child 19 = —0.64 
120 4 0 y = 109.874 — 1.127x 
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Least-squares lines make the sum of the squares of the vertical distances to the 
points as small as possible. A point that is extreme in the x direction with no other 
points near it pulls the line toward itself. We call such points influential. 


(I 


DEFINITION: Outliers and influential observations in regression 


An outlier is an observation that lies outside the overall pattern of the other observa- 
tions. Points that are outliers in the y direction but not the x direction of a scatterplot 
have large residuals. Other outliers may not have large residuals. 


An observation is influential for a statistical calculation if removing it would mark- 
edly change the result of the calculation. Points that are outliers in the x direction of 
a scatterplot are often influential for the least-squares regression line. 


We did not need the distinction between outliers and influential observations 
in Chapter 1. A single large salary that pulls up the mean salary X for a group of 
workers is an outlier because it lies far above the other salaries. It is also influ- 
ential, because the mean changes when it is removed. In the regression setting, 
however, not all outliers are influential. The least-squares line is most likely to 
be heavily influenced by observations that are outliers in the x direction. The 
scatterplot will alert you to such observations. Influential points often have small 
residuals, because they pull the regression line toward themselves. If you look at 
just a residual plot, you may miss influential points. 

The best way to verify that a point is influential is to find the regression line 
both with and without the unusual point, as in Figure 3.19. If the line moves more 
than a small amount when the point is deleted, the point is influential. 
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How much difference can one point make? The strong influence 
THINK Fok: jes é 
of Child 18 makes the original regression of Gesell score on age at first word 
ABOUT IT misleading. The original data have r? = 0.41. That is, the least-squares line relat- 
ing age at which a child begins to talk with Gesell score explains 41% of the varia- 
tion on this later test of mental ability. This relationship is strong enough to be 
interesting to parents. If we leave out Child 18, r? drops to only 11%. The apparent 
strength of the association was largely due to a single influential observation. 
What should the child development researcher do? She must decide whether 
Child 18 is so slow to speak that this individual should not be allowed to influence 
the analysis. If she excludes Child 18, much of the evidence for a connection 
between the age at which a child begins to talk and later ability score vanishes. If 
she keeps Child 18, she needs data on other children who were also slow to begin 
talking, so that the analysis no longer depends so heavily on just one child. 


Oh 


We finish with our most important caution about correlation and regression. 


4. Association does not imply causation. When we study the relationship 
between two variables, we often hope to show that changes in the ex- 
planatory variable cause changes in the response variable. A strong as- 
sociation between two variables is not enough to draw conclusions about cause 
and effect. Sometimes an observed association really does reflect cause and 
effect. A household that heats with natural gas uses more gas in colder months 
because cold weather requires burning more gas to stay warm. In other cases, 
an association is explained by other variables, and the conclusion that x causes 
y is not valid. 


Does Having More Cars Make You 
Live Longer? 
Association, not causation 


A serious study once found that people with two cars live longer than people 
who own only one car.'* Owning three cars is even better, and so on. There 


is a substantial positive association between number of cars x and length of 
life y. 


The basic meaning of causation is that by changing x, we can bring about 
a change in y. Could we lengthen our lives by buying more cars? No. The 
study used number of cars as a quick indicator of wealth. Well-off people 
tend to have more cars. They also tend to live longer, probably because they 
are better educated, take better care of themselves, and get better medical 
care. The cars have nothing to do with it. There is no cause-and-effect link 
between number of cars and length of life. 


Associations such as those in the previous example are sometimes called “non- 
sense associations.” The association is real. What is nonsense is the conclusion 
that changing one of the variables causes changes in the other. Another variable — 
such as personal wealth in this example—that influences both x and y can create 
a strong association even though there is no direct connection between x and y. 


Remember: It only makes sense to 
talk about the correlation between two 
quantitative variables. If one or both 
variables are categorical, you should 
refer to the association between the 
two variables. To be safe, you can use 
the more general term “association” 
when describing the relationship 
between any two variables. 


How Faithful Is Old Faithful? 
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ASSOCIATION DOES NOT IMPLY CAUSATION 


An association between an explanatory variable x and a response variable y, 
even if it is very strong, is not by itself good evidence that changes in x actu- 
ally cause changes in y. 


Here is a chance to use the skills you have gained to address the question posed 
at the beginning of the chapter. 


= 


In the chapter-opening Case Study (page 141), the Starnes family had 
just missed seeing Old Faithful erupt. They wondered how long it would 
be until the next eruption. The scatterplot below shows data on the dura- 
tion (in minutes) and the interval of time until the next eruption (also in 
minutes) for each Old Faithful eruption in the month before their visit. 


Interval (minutes) 
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Describe the nature of the relationship between interval and 
duration. 


Here is some computer output from a least-squares regression 
analysis on these data. 


Residual 


Regression Analysis: Interval versus Duration 
Predictor Coef£ SE Coef T. P 
Constant 33.347 1.201 27.76 0.000 
Duration 13.2854 0.3404 39.03 0.000 


S = 6.49336 R-Sq = 85.4% R-Sq(adj) = 85.3% 


Is a linear model appropriate? Justify your answer. 


3 
Duration (minutes) 


Give the equation of the least-squares regression line. Be sure to 
define any variables you use. 


Park rangers indicated that the eruption of Old Faithful that just 
finished lasted 3.9 minutes. How long do you predict the Starnes 
family will have to wait for the next eruption? Show how you ar- 
rived at your answer. 

The actual time that the Starnes family has to wait is probably 
not exactly equal to your prediction in Question +. Based on the 
computer output, about how far off do you expect the prediction 
to be? Explain. 
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Summary 


e  Aregression line is a straight line that describes how a response variable y changes 
as an explanatory variable x changes. You can use a regression line to predict the 
value of y for any value of x by substituting this x into the equation of the line. 


e The slope b of a regression line ¥ = a + bx is the rate at which the predicted 
response changes along the line as the explanatory variable x changes. Spe- 
cifically, b is the predicted change in y when x increases by | unit. 


e The y intercept a of a regression line ? = a + bx is the predicted response 
when the explanatory variable x equals 0. This prediction is of no statistical 
use unless x can actually take values near 0. 


e Avoid extrapolation, the use of a regression line for prediction using values 
of the explanatory variable outside the range of the data from which the line 
was calculated. 


e ‘The most common method of fitting a line to a scatterplot is least squares. ‘The 
least-squares regression line is the straight line = a + bx that minimizes the 
sum of the squares of the vertical distances of the observed points from the line. 


e You can examine the fit of a regression line by studying the residuals, which 
are the differences between the observed and predicted values of y. Be on the 
lookout for patterns in the residual plot, which indicate that a linear model 
may not be appropriate. 


e The standard deviation of the residuals s measures the typical size of the 
prediction errors (residuals) when using the regression line. 


e The coefficient of determination r’ is the fraction of the variation in the 
response variable that is accounted for by least-squares regression on the ex- 
planatory variable. 


e The least-squares regression line of y on x is the line with slope b = 1(s,/s,) 
and intercept a = y — bx. This line always passes through the point (x, jy). 


¢ Correlation and regression must be interpreted with caution. Plot the data to be 
sure that the relationship is roughly linear and to detect outliers. Also look for in- 
fluential observations, individual points that substantially change the correlation 
or the regression line. Outliers in x are often influential for the regression line. 


e = Most of all, be careful not to conclude that there is a cause-and-effect rela- 
tionship between two variables just because they are strongly associated. 


3.2| TECHNOLOGY 
CORNERS 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


8. Least-squares regression lines on the calculator 


9. Residual plots on the calculator 


35. 


36. 


Be 


Exercises 


What’s my line? You use the same bar of soap to 
shower each morning. The bar weighs 80 grams when 
it is new. Its weight goes down by 6 grams per day on 
average. What is the equation of the regression line 
for predicting weight from days of use? 


What’s my line? An eccentric professor believes that 
a child with IQ 100 should have a reading test score 
of 50 and predicts that reading score should increase 
by | point for every additional point of IQ. What 

is the equation of the professor’s regression line for 
predicting reading score from IQ? 


Gas mileage We expect a car’s highway gas mileage 
to be related to its city gas mileage. Data for all 1198 
vehicles in the government’s recent Fuel Economy 
Guide give the regression line: predicted highway 
mpg = 4.62 + 1.109 (city mpg). 


What's the slope of this line? Interpret this value in context. 


What's the y intercept? Explain why the value of the 
intercept is not statistically meaningful. 


Find the predicted highway mileage for a car that gets 
16 miles per gallon in the city. 


. IQ and reading scores Data on the IQ test scores 


and reading test scores for a group of fifth-grade 
children give the following regression line: predicted 
reading score = —33.4 + 0.882(IQ score). 


What’s the slope of this line? Interpret this value in 
context. 


What's the y intercept? Explain why the value of the 
intercept is not statistically meaningful. 


Find the predicted reading score for a child with an 
IO score of 90. 


. Acid rain Researchers studying acid rain measured 


the acidity of precipitation in a Colorado wilderness 
area for 150 consecutive weeks. Acidity is measured 
by pH. Lower pH values show higher acidity. The 
researchers observed a linear pattern over time. 
They reported that the regression line pH = 5.43 — 
0.0053(weeks) fit the data well.!” 


Identify the slope of the line and explain what it 
means in this setting. 


Identify the y intercept of the line and explain what it 
means in this setting. 


According to the regression line, what was the pH at 
the end of this study? 


. How much gas? In Exercise 4 (page 159), we exam- 


ined the relationship between the average monthly 
temperature and the amount of natural gas consumed 
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a1. 


413% 


44, 


in Joan’s midwestern home. The figure below shows 
the original scatterplot with the least-squares line 
added. The equation of the least-squares line is 


§ = 1425 — 19.87x. 
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Identify the slope of the line and explain what it 
means in this setting. 


Identify the y intercept of the line. Explain why it’s 
risky to use this value as a prediction. 


Use the regression line to predict the amount of 
natural gas Joan will use in a month with an average 
temperature of 30°F. 


Acid rain Refer to Exercise 39. Would it be appropri- 
ate to use the regression line to predict pH after 1000 
months? Justify your answer. 


How much gas? Refer to Exercise 40. Would it be 
appropriate to use the regression line to predict Joan’s 
natural-gas consumption in a future month with an 
average temperature of 65°F? Justify your answer. 


Least-squares idea ‘The table below gives a small 

set of data. Which of the following two lines fits the 
data better: § = 1 — x or ¥ = 3 — 2x? Use the least- 
squares criterion to justify your answer. (Note: Neither 
of these two lines is the least-squares regression line 
for these data.) 


Least-squares idea In Exercise 40, the line drawn 
on the scatterplot is the least-squares regression line. 
Explain the meaning of the phrase “least-squares” to 
Joan, who knows very little about statistics. 


. Acid rain In the acid rain study of Exercise 39, the 


actual pH measurement for Week 50 was 5.08. Find 
and interpret the residual for this week. 


. How much gas? Refer to Exercise 40. During March, 


the average temperature was 46.4°F and Joan used 490 
cubic feet of gas per day. Find and interpret the residual 
for this month. 
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Bird colonies Exercise 6 (page 159) examined the 
relationship between the number of new birds y and 
percent of returning birds x for 13 sparrowhawk colo- 
nies. Here are the data once again. 


Percent return: 
New adults: 


74 66 81 52 73 62 52 45 62 46 60 46 38 
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(a) 


Use your calculator to help make a scatterplot. 


(b) Use your calculator’s regression function to find the 


(c) 


(d) 


48. 


equation of the least-squares regression line. Add this 
line to your scatterplot from (a). 


Explain in words what the slope of the regression line 
tells us. 


Calculate and interpret the residual for the colony 
that had 52% of the sparrowhawks return and 11 new 
adults. 


Do heavier people burn more energy? Exercise 

10 (page 160) presented data on the lean body mass 
and resting metabolic rate for 12 women who were 
subjects in a study of dieting. Lean body mass, given 
in kilograms, is a person’s weight leaving out all fat. 
Metabolic rate, in calories burned per 24 hours, is the 
rate at which the body consumes energy. Here are the 
data again. 
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Egyptian village of Nahya. Here are the mean weights 
(in kilograms) for 170 infants in Nahya who were 
weighed each month during their first year of life: 


Age (months): 1 2 9 4&4 © © 7 #8 
Weight (kg): 
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Mass: 36.1 54.6 48.5 42.0 50.6 42.0 40.3 33.1 42.4 34.5 51.1 41.2 


Rate: 995 1425 1396 1418 1502 1256 1189 913 1124 1052 1347 1204 


Use your calculator to help make a scatterplot. 


Use your calculator’s regression function to find the 
equation of the least-squares regression line. Add this 
line to your scatterplot from part (a). 


Explain in words what the slope of the regression line 
tells us. 


Calculate and interpret the residual for the woman 
who had a lean body mass of 50.6 kg and a metabolic 
rate of 1502. 


. Bird colonies Refer to Exercise 47. 


Use your calculator to make a residual plot. Describe 
what this graph tells you about the appropriateness of 
using a linear model. 


(b) Which point has the largest residual? Explain what 


50. 


(a) 


this residual means in context. 


Do heavier people burn more energy? Refer to 
Exercise 48. 


Use your calculator to make a residual plot. Describe 
what this graph tells you about the appropriateness of 
using a linear model. 


(b) Which point has the largest residual? Explain what 
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the value of that residual means in context. 


Nahya infant weights A study of nutrition in 
developing countries collected data from the 
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A hasty user of statistics enters the data into software and 
computes the least-squares line without plotting the data. 
The result is weight = 4.88 + 0.267 (age). A residual 
plot is shown below. Would it be appropriate to use this 
regression line to predict y from x? Justify your answer. 


Residual 


0 2 4 6 8 
Age (months) 


Driving speed and fuel consumption Exercise 9 
(page 160) gives data on the fuel consumption y 

of a car at various speeds x. Fuel consumption is 
measured in liters of gasoline per 100 kilometers 
driven and speed is measured in kilometers per hour. 
A statistical software package gives the least-squares 
regression line and the residual plot shown below. 
The regression line is ¥ = 11.058 — 0.01466x. Would 
it be appropriate to use the regression line to predict y 
from x? Justify your answer. 
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Oil and residuals ‘The ‘Trans-Alaska Oil Pipeline is 

a tube that is formed from 1/2-inch-thick steel and 

that carries oil across 800 miles of sensitive arctic and 
subarctic terrain. The pipe segments and the welds that 
join them were carefully examined before installation. 
How accurate are field measurements of the depth of 
small defects? The figure below compares the results 
of measurements on 100 defects made in the field with 
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measurements of the same defects made in the labora- 
tory.” The line y = x is drawn on the scatterplot. 


Field measurement 


0 20 40 60 80 


Laboratory measurement 


Describe the overall pattern you see in the scatterplot, 
as well as any deviations from that pattern. 


If field and laboratory measurements all agree, then 
the points should fall on the y = x line drawn on the 
plot, except for small variations in the measurements. 
Is this the case? Explain. 


The line drawn on the scatterplot (y = x) is not the 
least-squares regression line. How would the slope 
and y intercept of the least-squares line compare? 
Justify your answer. 


Oil and residuals Refer to Exercise 53. The following 
figure shows a residual plot for the least-squares regres- 
sion line. Discuss what the residual plot tells you about 
the appropriateness of using a linear model. 
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Laboratory measurement 


Olympic figure skating For many people, the wom- 
en’s figure skating competition is the highlight of the 
Olympic Winter Games. Scores in the short program 
x and scores in the free skate y were recorded for each 
of the 24 skaters who competed in both rounds during 
the 2010 Winter Olympics in Vancouver, Canada.”! 
A regression analysis was performed using these data. 
The scatterplot and residual plot follow. ‘The equation 
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of the least-squares regression line is 
j = —16.2 + 2.07x. Also, s = 10.2 and r’ = 0.736. 


160 4 
140 
120 


100 


Free skate score 


80 


Residual 
i 


Short program score 


(a) Calculate and interpret the residual for the gold 


medal winner, Yu-Na Kim, who scored 78.50 in the 
short program and 150.06 in the free skate. 


Is a linear model appropriate for these data? Explain. 
Interpret the value of s. 


Interpret the value of r’. 


. Age and height A random sample of 195 students 


was selected from the United Kingdom using the 
CensusAtSchool data selector. The age (in years) x 
and height (in centimeters) y was recorded for each 
of the students. A regression analysis was performed 
using these data. The scatterplot and residual plot 
are shown below. The equation of the least-squares 
regression line is ¥ = 106.1 + 4.21x. Also, s = 8.61 
and 7? = 0.274. 


Height 


196 


58. 


(a) 


(b) 


60. 
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20 4 < y ¢ brain that responds to physical pain goes up as distress 
a co er ae from social exclusion goes up. A scatterplot shows 
nai i qa eet a moderately strong, linear relationship. The figure 
B o* 4-4 i, below shows Minitab regression output for these data. 
“an i iv 
2-104 ° - 4 : i ' aor 
~20 ] e : : = _ Regression Analysis: Brain versus Distress 
4 : Predictor Coot SE Coef T Pp 
Constant =0.12608 0.02465 -5.12 0.000 
Ape ge ae Ee ae eee pee distress 0.060782 0.009979 6.09 0.000 


Age 


Calculate and interpret the residual for the student 
who was 141 cm tall at age 10. 


Is a linear model appropriate for these data? Explain. 
Interpret the value of s. 


Interpret the value of 7’. 


. Bird colonies Refer to Exercises 47 and 49. For the 


regression you performed earlier, 7? = 0.56 and s = 3.67. 
Explain what each of these values means in this setting. 


Do heavier people burn more energy? Refer to Exer- 
cises 48 and 50. For the regression you performed earlier, 
? = 0.768 and s = 95.08. Explain what each of these 


values means in this setting. 


Merlins breeding Exercise 13 (page 160) gives data on 
the number of breeding pairs of merlins in an isolated 
area in each of seven years and the percent of males who 
returned the next year. The data show that the percent 
returning is lower after successful breeding seasons and 
that the relationship is roughly linear. The figure below 
shows Minitab regression output for these data. 


(0 Session im | 


Regression Analysis: Percent return versus Breeding pairs ~ 


Coef SE Coef Tr P 
266.07 52.15 5.10 0.004 


Predictor 
Constant 


Breeding pairs -6.650 1.736 -3.83 0.012 


S = 7.76227 R-Sq = 74.6%  R-Sq(adj) = 69.5% 


What is the equation of the least-squares regression 
line for predicting the percent of males that return 
from the number of breeding pairs? Use the equa- 
tion to predict the percent of returning males after a 
season with 30 breeding pairs. 


What percent of the year-to-year variation in percent 
of returning males is accounted for by the straight- 
line relationship with number of breeding pairs the 
previous year? 

Use the information in the figure to find the cor- 
relation r between percent of males that return and 
number of breeding pairs. How do you know whether 
the sign of ris + or —? 


Interpret the value of s in this setting. 


Does social rejection hurt? Exercise 14 (page 161) 
gives data from a study that shows that social exclusion 
causes “real pain.” That is, activity in an area of the 
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S = 0.0250896 R-Sq = 77.1% R-Sq (adj) = 75.1% 


vb 


What is the equation of the least-squares regression 
line for predicting brain activity from social distress 
score? Use the equation to predict brain activity for 
social distress score 2.0. 


What percent of the variation in brain activity among 
these subjects is accounted for by the straight-line 
relationship with social distress score? 


Use the information in the figure to find the correla- 
tion r between social distress score and brain activity. 
How do you know whether the sign of ris + or —? 


Interpret the value of s in this setting. 


Husbands and wives ‘The mean height of married 
American women in their early twenties is 64.5 inches 
and the standard deviation is 2.5 inches. The mean 
height of married men the same age is 68.5 inches, with 
standard deviation 2.7 inches. ‘The correlation between 
the heights of husbands and wives is about r = 0.5. 


Find the equation of the least-squares regression line 
for predicting a husband’s height from his wife’s height 
for married couples in their early 20s. Show your work. 


Suppose that the height of a randomly selected wife 
was | standard deviation below average. Predict the 
height of her husband without using the least-squares 
line. Show your work. 


The stock market Some people think that the behay- 
ior of the stock market in January predicts its behavior 
for the rest of the year. ‘Take the explanatory variable 
x to be the percent change in a stock market index in 
January and the response variable y to be the change 
in the index for the entire year. We expect a positive 
correlation between x and y because the change 
during January contributes to the full year’s change. 
Calculation from data for an 18-year period gives 


E=1.75% s,=536% y = 9.07% 
s, = 15.35% r= 0.596 


Find the equation of the least-squares line for predicting 
full-year change from January change. Show your work. 


Suppose that the percent change in a particular Janu- 
ary was 2 standard deviations above average. Predict 
the percent change for the entire year, without using 
the least-squares line. Show your work. 
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. Husbands and wives Refer to Exercise 61. 


Find r? and interpret this value in context. 


For these data, s = 1.2. Interpret this value. 


. The stock market Refer to Exercise 62. 


Find 7? and interpret this value in context. 


For these data, s = 8.3. Interpret this value. 


. Will I bomb the final? We expect that students 


who do well on the midterm exam in a course will 
usually also do well on the final exam. Gary Smith 
of Pomona College looked at the exam scores of 

all 346 students who took his statistics class over a 
10-year period.” Assume that both the midterm and 
final exam were scored out of 100 points. 


State the equation of the least-squares regression line 
if each student scored the same on the midterm and 
the final. 


The actual least-squares line for predicting final- 
exam score y from midterm-exam score x was 

¥ = 46.6 + 0.41x. Predict the score of a student who 
scored 50 on the midterm and a student who scored 
100 on the midterm. 


Explain how your answers to part (b) illustrate regres- 
sion to the mean. 


It’s still early We expect that a baseball player who 
has a high batting average in the first month of the 
season will also have a high batting average the rest of 
the season. Using 66 Major League Baseball players 
from the 2010 season,”* a least-squares regression 

line was calculated to predict rest-of-season batting 
average y from first-month batting average x. Note: A 
player’s batting average is the proportion of times at 
bat that he gets a hit. A batting average over 0.300 is 
considered very good in Major League Baseball. 


State the equation of the least-squares regression line 
if each player had the same batting average the rest of 
the season as he did in the first month of the season. 


The actual equation of the least-squares regression line 
is ¥ = 0.245 + 0.109x. Predict the rest-of-season batting 
average for a player who had a 0.200 batting average 
the first month of the season and for a player who had a 
0.400 batting average the first month of the season. 


Explain how your answers to part (b) illustrate regres- 
sion to the mean. 


. Beavers and beetles Do beavers benefit beetles? 


Researchers laid out 23 circular plots, each + meters 

in diameter, in an area where beavers were cutting 
down cottonwood trees. In each plot, they counted the 
number of stumps from trees cut by beavers and the 
number of clusters of beetle larvae. Ecologists think 
that the new sprouts from stumps are more tender than 
other cottonwood growth, so that beetles prefer them. 
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Ifso, more stumps should produce more beetle larvae. 
Here are the data:”* 


Stumps: 2 2 3 8 4 8 |] 2 8 i s 
Beetle larvae: 10 30 12 24 36 40 43 11 27 56 18 40 
Stumps: BW ee a ll ee ae 
Beetle larvae: 25 8 21 14 16 6 54 9 13 14 50 


Can we use a linear model to predict the number of 
beetle larvae from the number of stumps? If so, how 
accurate will our predictions be? Follow the four-step 
process. 


. Fat and calories The number of calories in a food 


item depends on many factors, including the amount 
of fat in the item. The data below show the amount 
of fat (in grams) and the number of calories in 7 beef 
sandwiches at McDonalds.” 


Sandwich Fat Calories 
Big Mac® 29 550 
Quarter Pounder® with Cheese 26 520 
Double Quarter Pounder® with Cheese 42 750 
Hamburger 9 250 
Cheeseburger 12 300 
Double Cheeseburger 23 440 
McDouble 19 390 
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Can we use a linear model to predict the number of 
calories from the amount of fat? If so, how accurate will 
our predictions be? Follow the four-step process. 


Managing diabetes People with diabetes measure 
their fasting plasma glucose (FPG; measured in units 
of milligrams per milliliter) after fasting for at least 

8 hours. Another measurement, made at regular 
medical checkups, is called HbA. This is roughly 

the percent of red blood cells that have a glucose 
molecule attached. It measures average exposure to 
glucose over a period of several months. The table 
below gives data on both HbA and FPG for 18 diabet- 
ics five months after they had completed a diabetes 
education class.”” 


HbA FPG HbA FPG 
Subject (%) (mg/mL) Subject (%) (mg/mL) 

1 6.1 141 10 8.7 We 
2 6.3 158 | 9.4 200 
3 6.4 TZ 12 10.4 271 
4 6.8 158) 13 10.6 103 
5 7.0 134 14 10.7 172 
6 th 95 15 10.7 359 
i fe5 96 16 ane 145 
8 Toll 78 We Si 147 
9 7.9 148 18 19.3 255 
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(a) Make a scatterplot with HbA as the explanatory vari- 
able. Describe what you see. 


(b 


m= 


Subject 18 is an outlier in the x direction. What effect 
do you think this subject has on the correlation? What 
effect do you think this subject has on the equation of 
the least-squares regression line? Calculate the correla- 
tion and equation of the least-squares regression line 
with and without this subject to confirm your answer. 


(c) Subject 15 is an outlier in the y direction. What effect 
do you think this subject has on the correlation? What 
effect do you think this subject has on the equation of 
the least-squares regression line? Calculate the correla- 
tion and equation of the least-squares regression line 
with and without this subject to confirm your answer. 


70. Rushing for points What is the relationship between 
rushing yards and points scored in the 2011 National 
Football League? The table below gives the number 
of rushing yards and the number of points scored for 
each of the 16 games played by the 2011 Jacksonville 
Jaguars.”° 


Game Rushing yards Points scored 
1 163 16 
2 112 3 
3 128 10 
4 104 10 
5 96 20 
6 188) 13 
7 132 12 
8 84 14 
9 141 17 

10 108 10 
11 105 ils 
12 129 14 
18 116 za] 
14 116 14 
15 113 We 
16 190 19 


(a) Make a scatterplot with rushing yards as the explana- 
tory variable. Describe what you see. 


(b) ‘The number of rushing yards in Game 16 is an outlier 
in the x direction. What effect do you think this game 
has on the correlation? On the equation of the least- 
squares regression line? Calculate the correlation and 
equation of the least-squares regression line with and 
without this game to confirm your answers. 


(c) The number of points scored in Game 13 is an out- 
lier in the y direction. What effect do you think this 
game has on the correlation? On the equation of the 
least-squares regression line? Calculate the correla- 
tion and equation of the least-squares regression line 
with and without this game to confirm your answers. 
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Multiple choice: Select the best answer for Exercises 71 to 78. 


71. Which of the following is not a characteristic of the 
least-squares regression line? 


(a) The slope of the least-squares regression line is always 
between —1 and 1. 


(b) The least-squares regression line always goes through 
the point (x, y). 

(c) The least-squares regression line minimizes the sum 
of squared residuals. 


(d) The slope of the least-squares regression line will 
always have the same sign as the correlation. 


(e) The least-squares regression line is not resistant to 
outliers. 


72. Each year, students in an elementary school take a 
standardized math test at the end of the school year. 
For a class of fourth-graders, the average score was 55.1 
with a standard deviation of 12.3. In the third grade, 
these same students had an average score of 61.7 with 
a standard deviation of 14.0. The correlation between 
the two sets of scores is r = 0.95. Calculate the equa- 
tion of the least-squares regression line for predicting a 
fourth-grade score from a third-grade score. 


$ = 3.60 +0.835x  (d) $= —11.54 + 1.08x 


¥ = 15.69 + 0.835x (e) Cannot be calculated 
(c) $= 2.19 + 1.08x without the data. 


73. Using data from the 2009 LPGA tour, a regression 
analysis was performed using x = average driving 
distance and y = scoring average. Using the output 
from the regression analysis shown below, determine 
the equation of the least-squares regression line. 


Predictor Coef SE Coef ui P 
Constant 87.974 Ayasisl, 36.78 0.000 
Driving Distance -—0.060934 O.0G9536 —6.39 0-000 


Sy al toal 2s; R=Sq = 22.1% R-Sq(adj) = 21.6% 


(a) ~ = 87.947 + 2.391x 

(b) § = 87.947 + 1.01216x 

(c) § = 87.947 — 0.060934x 
(d) § = —0.060934 + 1.01216x 
(e) § = —0.060934 + 87.947x 


Exercises 74 to 78 refer to the following setting. 
Measurements on young children in Mumbai, India, 
found this least-squares line for predicting height y from 
arm span x:78 

7 — 6.4 + 0193x 


Measurements are in centimeters (cm). 


74. By looking at the equation of the least-squares regres- 
sion line, you can see that the correlation between 
height and arm span is 

(a) greater than zero. 

(b) less than zero. 


(c) 0.93. 

(d) 6.4. 

(e) Can’t tell without seeing the data. 

75. In addition to the regression line, the report on the 
Mumbai measurements says that r? = 0.95. This 
suggests that 

(a) although arm span and height are correlated, arm 
span does not predict height very accurately. 

(b) height increases by V0.95 = 0.97 cm for each ad- 
ditional centimeter of arm span. 

(c) 95% of the relationship between height and arm span 
is accounted for by the regression line. 

(d) 95% of the variation in height is accounted for by the 
regression line. 

(e) 95% of the height measurements are accounted for by 
the regression line. 

76. One child in the Mumbai study had height 59 cm 
and arm span 60 cm. This child’s residual is 

(a) =—3.2 Gm. (@) —1.3 eam. (e)| 62.2 cm. 

(b) —2.2 cm. (d) 3.2 cm. 

77. Suppose that a tall child with arm span 120 cm and 
height 118 cm was added to the sample used in this 
study. What effect will adding this child have on the 
correlation and the slope of the least-squares regres- 
sion line? 

(a) Correlation will increase, slope will increase. 

(b) Correlation will increase, slope will stay the same. 

(c) Correlation will increase, slope will decrease. 

(d) Correlation will stay the same, slope will stay the same. 

(e) Correlation will stay the same, slope will increase. 


78. Suppose that the measurements of arm span and 
height were converted from centimeters to meters 
by dividing each measurement by 100. How will this 
conversion affect the values of r? and s? 


5) era 8 oan o 
(a) r° will increase, s will increase. 
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(b) 7’ will increase, s will stay the same. 

(c) r’ will increase, s will decrease. 

(d) 7’ will stay the same, s will stay the same. 
( 


e) 1’ will stay the same, s will decrease. 


Exercises 79 and 80 refer to the following setting. 


In its recent Fuel Economy Guide, the Environmental 
Protection Agency gives data on 1152 vehicles. There are 
a number of outliers, mainly vehicles with very poor gas 
mileage. If we ignore the outliers, however, the combined 
city and highway gas mileage of the other 1120 or so ve- 
hicles is approximately Normal with mean 18.7 miles per 
gallon (mpg) and standard deviation 4.3 mpg. 


79. In my Chevrolet (2.2) ‘The Chevrolet Malibu with 
=> a four-cylinder engine has a combined gas mileage of 


© 25 mpg. What percent of all vehicles have worse gas 


mileage than the Malibu? 
. The top 10% (2.2) How high must a vehicle’s gas 


80 
a, mileage be in order to fall in the top 10% of all 


vehicles? (The distribution omits a few high outliers, 
mainly hybrid gas-electric vehicles.) 
81. Marijuana and traffic accidents (1.1) Researchers 
_ in New Zealand interviewed 907 drivers at age 21. 


€ They had data on traffic accidents and they asked the 


drivers about marijuana use. Here are data on the 

numbers of accidents caused by these drivers at age 

19, broken d b ij h ee 
, broken down by marijuana use at the same age: 


Marijuana use per year 
Never 1-10times 11-50times 51 + times 
Drivers 452 229 70 156 


Accidents caused 59 36 5) 50 


(a) Make a graph that displays the accident rate for each 
class. Is there evidence of an association between 
marijuana use and traffic accidents? 


(b) Explain why we can’t conclude that marijuana use 
causes accidents. 


Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


‘Two statistics students went to a flower shop and ran- 
domly selected 12 carnations. When they got home, the 
students prepared 12 identical vases with exactly the same 
amount of water in each vase. ‘They put one tablespoon of 
sugar in 3 vases, two tablespoons of sugar in 3 vases, and 
three tablespoons of sugar in 3 vases. In the remaining 
3 vases, they put no sugar. After the vases were prepared, 
the students randomly assigned | carnation to each vase 
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and observed how many hours each flower continued to 
look fresh. A scatterplot of the data is shown below. 


240 4 
230 4 
220 4 
210 


Freshness (h) 


T 
2 


Sugar (tbsp) 


(a) Briefly describe the association shown in the 
scatterplot. 

(b) ‘The equation of the least-squares regression line 
for these data is y = 180.8 + 15.8x. Interpret the 
slope of the line in the context of the study. 


Section 3.1: Scatterplots and Correlation 


In this section, you learned how to explore the relationship 
between two quantitative variables. As with distributions of 
a single variable, the first step is always to make a graph. A 
scatterplot is the appropriate type of graph to investigate as- 
sociations between two quantitative variables. ‘To describe a 
scatterplot, be sure to discuss four characteristics: direction, 
form, strength, and outliers. The direction of an associa- 
tion might be positive, negative, or neither. The form of 
an association can be linear or nonlinear. An association is 
strong if it closely follows a specific form. Finally, outliers 
are any points that clearly fall outside the pattern of the rest 
of the data. 

The correlationrisa numerical summary that describes 
the direction and strength of a linear association. When 
r > 0, the association is positive, and when r < 0, the 
association is negative. The correlation will always take 
values between —1 and 1, with r = —1 andr = 1 indicat- 
ing a perfectly linear relationship. Strong linear associa- 
tions have correlations near 1 or —1, while weak linear 
relationships have correlations near 0. However, it isn’t 
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(c) Calculate and interpret the residual for the flower 
that had 2 tablespoons of sugar and looked fresh for 
204 hours. 
Suppose that another group of students conducted 
a similar experiment using 12 flowers, but in- 
cluded different varieties in addition to carnations. 
Would you expect the value of r” for the second 
group’s data to be greater than, less than, or about 
the same as the value of r’ for the first group’s data? 
Explain. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score. 


possible to determine the form of an association from 
only the correlation. Strong nonlinear relationships can 
have a correlation close to | or a correlation close to 0, 
depending on the association. You also learned that out- 
liers can greatly affect the value of the correlation and 
that correlation does not imply causation. That is, we 
can’t assume that changes in one variable cause changes 
in the other variable, just because they have a correla- 
tion close to | or—l. 


Section 3.2: Least-Squares Regression 


In this section, you learned how to use least-squares re- 
gression lines as models for relationships between vari- 
ables that have a linear association. It is important to 
understand the difference between the actual data and 
the model used to describe the data. For example, when 
you are interpreting the slope of a least-squares regression 
line, describe the predicted change in the y variable. ‘To 
emphasize that the model only provides predicted val- 
ues, least-squares regression lines are always expressed in 
terms of f instead of y. 


The difference between the observed value of y and the 
predicted value of y is called a residual. Residuals are the key 
to understanding almost everything in this section. To find 
the equation of the least-squares regression line, find the line 
that minimizes the sum of the squared residuals. 'To see if a 
linear model is appropriate, make a residual plot. If there is 
no leftover pattern in the residual plot, you know the model 
is appropriate. ‘To assess how well a line fits the data, calculate 
the standard deviation of the residuals s to estimate the size 
of a typical prediction error. You can also calculate 7”, which 


What Did You Learn? 


Learning Objective 


measures the fraction of the variation in the y variable that is 
accounted for by its linear relationship with the x variable. 
You also learned how to obtain the equation of a least- 
squares regression line from computer output and from 
summary statistics (the means and standard deviations of 
two variables and their correlation). As with the correlation, 
the equation of the least-squares regression line and the val- 
ues of s and r’ can be greatly influenced by outliers, so be 
sure to plot the data and note any unusual values before 


making any calculations. 


Section Related Example 


on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Identify explanatory and response variables in situations where one 
variable helps to explain or influences the other. 


144 


R3.4 


Make a scatterplot to display the relationship between two 
quantitative variables. 


145, 148 


R3.4 


Describe the direction, form, and strength of a relationship 
displayed in a scatterplot and recognize outliers in a scatterplot. 


147, 148 


R3.1 


Interpret the correlation. 


iloZ 


R3.3, R3.4 


Understand the basic properties of correlation, including how the 
correlation is influenced by outliers. 


182, 11S), WS 


R3.1, R3.2 


Use technology to calculate correlation. 


Activity on 152, 171 


R3.4 


Explain why association does not imply causation. 


Discussion on 156, 190 


R3.6 


Interpret the slope and y intercept of a least-squares regression line. 


166 


R3.2, R3.4 


Use the least-squares regression line to predict y for a given x. 
Explain the dangers of extrapolation. 


167, Discussion on 168 
(for extrapolation) 


R3.2, R3.4, R3.5 


Calculate and interpret residuals. 


169 


R3.3, R3.4 


Explain the concept of least squares. 


Discussion on 169 


R3.5 


Determine the equation of a least-squares regression line using 
technology or computer output. 


Technology Corner on 
171, 181 


R3.3, R3.4 


Construct and interpret residual plots to assess whether a linear 
model is appropriate. 


Interpret the standard deviation of the residuals and r? and use 
these values to assess how well the least-squares regression line 
models the relationship between two variables. 


Discussion on 175, 180 


180 


R3.3, R3.4 


R3.3, R3.5 


Describe how the slope, y intercept, standard deviation of the 
residuals, and r? are influenced by outliers. 


Discussion on 188 


R3.1 


Find the slope and y intercept of the least-squares regression 
line from the means and standard deviations of x and y and their 
correlation. 


201 


202 


CHAPTER 3 


DESCRIBING RELATIONSHIPS 


Chapter 3 Chapter Review Exercises 


These exercises are designed to help you review the important 
ideas and methods of the chapter. 


R3.1 


R3.2 


(a) 
(b) 
(c) 
(d) 


Born to be old? Is there a relationship between the 
gestational period (time from conception to birth) 
of an animal and its average life span? The figure 
shows a scatterplot of the gestational period and av- 
erage life span for 43 species of animals. *” 
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Describe the association shown in the scatterplot. 


Point A is the hippopotamus. What effect does this 
point have on the correlation, the equation of the 
least-squares regression line, and the standard de- 
viation of the residuals? 


Point B is the Asian elephant. What effect does this 
point have on the correlation, the equation of the 
least-squares regression line, and the standard de- 
viation of the residuals? 


Penguins diving A study of king penguins looked 
for a relationship between how deep the penguins 
dive to seek food and how long they stay under 
water.*! For all but the shallowest dives, there is a 
linear relationship that is different for different pen- 
guins. The study gives a scatterplot for one penguin 
titled “The Relation of Dive Duration (y) to Depth 
(x).” Duration y is measured in minutes and depth 
x is in meters. ‘he report then says, “The regression 
equation for this bird is: ¥ = 2.69 + 0.0138x.” 


What is the slope of the regression line? Interpret 
this value. 


Does the y intercept of the regression line make any 
sense? If so, interpret it. If not, explain why not. 


According to the regression line, how long does a 
typical dive to a depth of 200 meters last? 


Suppose that the researchers reversed the variables, 
using x = dive duration and y = depth. What effect 
will this have on the correlation? On the equation 
of the least-squares regression line? 


R3.3 Stats teachers’ cars A random sample of AP® Sta- 
tistics teachers was asked to report the age (in years) 
and mileage of their primary vehicles. A scatterplot 
of the data, a least-squares regression printout, and 
a residual plot are provided below. 

Predictor Coef SE eCoer un P 
Constant 3704 8268 0.45 0.662 
Age 2188 1492 8.17 0.000 
S = 20870.5 R-Sq = 83.7% R-Sq(adj) = 82.4% 
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(a) Give the equation of the least-squares regression 
line for these data. Identify any variables you use. 
(b) One teacher reported that her 6-year-old car had 
65,000 miles on it. Find and interpret its residual. 
(c) What’s the correlation between car age and mile- 
age? Interpret this value in context. 
(d) Is a linear model appropriate for these data? Ex- 
plain how you know. 
(e) Interpret the values of s and 7’. 
R3.4 Late bloomers? Japanese cherry trees tend to blos- 


som early when spring weather is warm and later 
when spring weather is cool. Here are some data 
on the average March temperature (in °C) and the 
day in April when the first cherry blossom appeared 
over a 24-year period: *” 


Temperature (°C): 


4.0 5.4 3.2 2.6 4.2 4.7 4.9 4.0 4.9 3.8 4.0 5.1 


Days in April 


to first bloom: 


14 8 11:19 14 14 14 21 9 14 13 11 


Temperature (°C): 


4.3 1.5 3.7 3.8 4.5 4.1 6.1 6.2 5.1 5.0 4.6 4.0 


Days in April 


to first bloom: 


ie 43) Ue 1) WO av ss il & © iil 


(a) Make a well-labeled scatterplot that’s suitable for pre- 
dicting when the cherry trees will bloom from the 
temperature. Which variable did you choose as the 
explanatory variable? Explain. 


(b) Use technology to calculate the correlation and the 
equation of the least-squares regression line. Inter- 
pret the correlation, slope, and y intercept of the line 
in this setting. 

(c) Suppose that the average March temperature this year 
was 8.2°C. Would you be willing to use the equation 
in part (b) to predict the date of first bloom? Explain. 


(d) Calculate and interpretthe residual for the year when the 
average March temperature was 4.5°C. Show your work. 


(e) Use technology to help construct a residual plot. De- 
scribe what you see. 


R3.5 What’s my grade? In Professor Friedman’s econom- 
ics course, the correlation between the students’ total 
scores prior to the final examination and their final- 
examination scores is r = 0.6. The pre-exam totals for 
all students in the course have mean 280 and stan- 
dard deviation 30. The final-exam scores have mean 
75 and standard deviation 8. Professor Friedman has 
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lost Julie’s final exam but knows that her total before 
the exam was 300. He decides to predict her final- 
exam score from her pre-exam total. 


(a) Find the equation for the appropriate least-squares 
regression line for Professor Friedman’s prediction. 

(b) Use the least-squares regression line to predict Julie’s 
final-exam score. 


(c) Explain the meaning of the phrase “least squares” in 
the context of this question. 


(d) Julie doesn’t think this method accurately predicts 
how well she did on the final exam. Determine ’. 
Use this result to argue that her actual score could 
have been much higher (or much lower) than the 
predicted value. 


R3.6 Calculating achievement The principal of a high 
school read a study that reported a high correlation 
between the number of calculators owned by high 
school students and their math achievement. Based 
on this study, he decides to buy each student at his 
school two calculators, hoping to improve their 
math achievement. Explain the flaw in the princi- 
pal’s reasoning. 
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Section I: Multiple Choice Select the best answer for each question. 


T3.1 A school guidance counselor examines the number 
of extracurricular activities that students do and their 
grade point average. ‘The guidance counselor says, 
“The evidence indicates that the correlation between 
the number of extracurricular activities a student par- 
ticipates in and his or her grade point average is close 
to zero.” A correct interpretation of this statement 
would be that 

(a) active students tend to be students with poor grades, 
and vice versa. 

(b) students with good grades tend to be students who 
are not involved in many extracurricular activities, 
and vice versa. 

(c) students involved in many extracurricular activities are 
just as likely to get good grades as bad grades; the same is 
true for students involved in few extracurricular activities. 

(d) there is no linear relationship between number of activ- 
ities and grade point average for students at this school. 


— 
oO 
a 


involvement in many extracurricular activities and 


good grades go hand in hand. 


13.2 The British government conducts regular surveys 
of household spending. The average weekly house- 
hold spending (in pounds) on tobacco products and 


alcoholic beverages for each of 11] regions in Great 
Britain was recorded. A scatterplot of spending on 
alcohol versus spending on tobacco is shown below. 
Which of the following statements is true? 


6.5 4 


T 
3.0 3.5 4.0 45 
Tobacco 


(a) The observation (4.5, 6.0) is an outlier. 

(b) There is clear evidence of a negative association be- 
tween spending on alcohol and tobacco. 

(c) The equation of the least-squares line for this plot 
would be approximately ¥ = 10 — 2x. 

(d) The correlation for these data is r = 0.99. 

(e) The observation in the lower-right corner of the plot is 
influential for the least-squares line. 
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13.3 The fraction of the variation in the values of y that is 
explained by the least-squares regression of y on x is 
the correlation. 


) 

) the slope of the least-squares regression line. 
) the square of the correlation coefficient. 
) 

) 


a eS 
Le oes 


the intercept of the least-squares regression line. 
(e) the residual. 


T3.4 An AP® Statistics student designs an experiment to see 
whether today’s high school students are becoming too 
calculator-dependent. She prepares two quizzes, both 
of which contain 40 questions that are best done using 
paper-and-pencil methods. A random sample of 30 stu- 
dents participates in the experiment. Each student takes 
both quizzes—one with a calculator and one without— 
in a random order. To analyze the data, the student con- 
structs a scatterplot that displays the number of correct 
answers with and without a calculator for each of the 30 
students. A least-squares regression yields the equation 


ee 
Calculator = —1.2 + 0.865(Pencil) r= 0.79 


Which of the following statements is/are true? 
I. Ifthe student had used Calculator as the explanatory 
variable, the correlation would remain the same. 

Il. Ifthe student had used Calculator as the explanato- 
ty variable, the slope of the least-squares line would 
remain the same. 

Ill. ‘The standard deviation of the number of correct an- 
swers on the paper-and-pencil quizzes was larger than 
the standard deviation on the calculator quizzes. 

(a) I only (c) I only (e) I, Il, and Ill 

(b) II only (d) [and III only 


Questions T3.5 and T3.6 refer to the following setting. Scien- 
tists examined the activity level of 7 fish at different tempera- 
tures. Fish activity was rated on a scale of 0 (no activity) to 100 
(maximal activity). The temperature was measured in degrees 
Celsius. A computer regression printout and a residual plot 
are given below. Notice that the horizontal axis on the residual 
plot is labeled “Fitted value.” 


Predictor Coef SE Coef T P 
Constant 148.62 AOS T/L ALS) i833) 0.000 
Temperature =3.2167) 0.4533 =7. 10 (0) (Oio)aL 
S = 4.78505 R-Sq = 91.0% R-Sq(adj) = 89.2% 
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5 60 65 70 163 80 85 90 95 
Fitted value 


13.5 What was the activity level rating for the fish at a 
temperature of 20°C? 
(ayes7  ib)es4. a(c)) sie widiro6 (Ee) 3 
13.6 Which of the following gives a correct interpreta- 
tion of s in this setting? 
(a) For every 1°C increase in temperature, fish activity 
is predicted to increase by 4.785 units. 
(b) The typical distance of the temperature readings 
from their mean is about 4.785°C. 
(c) The typical distance of the activity level ratings 
from the least-squares line is about 4.785 units. 
(d) The typical distance of the activity level readings 
from their mean is about 4.785. 
(e) Ata temperature of 0°C, this model predicts an ac- 
tivity level of 4.785. 


T3.7 Which of the following statements is not true of 
the correlation r between the lengths in inches and 
weights in pounds of a sample of brook trout? 


(a) rmust take a value between —] and 1. 

(b) ris measured in inches. 

(c) If longer trout tend to also be heavier, then r > 0. 
(d) r would not change if we measured the lengths of 


the trout in centimeters instead of inches. 


(e) r would not change if we measured the weights of 
the trout in kilograms instead of pounds. 


T3.8 When we standardize the values of a variable, the 
distribution of standardized values has mean 0 and 
standard deviation 1. Suppose we measure two 
variables X and Y on each of several subjects. We 
standardize both variables and then compute the 
least-squares regression line. Suppose the slope of 
the least-squares regression line is —0.44. We may 
conclude that 


(a) the intercept will also be —0.44. 
(b) the intercept will be 1.0. 

(c) the correlation will be 1/—0.44. 
(d) the correlation will be 1.0. 

(e) 


e) the correlation will also be —0.44. 


13.9 There is a linear relationship between the number of 
chirps made by the striped ground cricket and the air 
temperature. A least-squares fit of some data collect- 
ed by a biologist gives the model f = 25.2 + 3.3x, 
where x is the number of chirps per minute and 
is the estimated temperature in degrees Fahrenheit. 
What is the predicted increase in temperature for an 
increase of 5 chirps per minute? 


(a) 3.3°F (co) 25.28 (ec) 41.7°F 
(b) 16.5°F (d) 28.5°F 
13.10 Adatasetincluded the number of people pertelevision 


set and the number of people per physician for 
40 countries. The Fathom screen shot below displays 


= 


a scatterplot of the data with the least-squares regres- 
sion line added. In Ethiopia, there were 503 people 
per ‘I'V and 36,660 people per doctor. What effect 
would removing this point have on the regression line? 


) Slope would increase; y intercept would increase. 
) Slope would increase; y intercept would decrease. 
) Slope would decrease; y intercept would increase. 
) Slope would decrease; y intercept would decrease. 
) Slope and y intercept would stay the same. 


AP® Statistics Practice Test be 205 


{ Scatter Pot ie 


televisions 


0 100 200 300 400 S00 600 
PpiperTV 
plperDoc = 1400 + 29.6PpiperTV; = = 0.38 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T3.11 Sarah’s parents are concerned that she seems short 
for her age. Their doctor has the following record of 
Sarah’s height: 


Age (months): 36 48 51 54 57 60 
Height (cm): 86 90 91 93 94 95 


(a) Make a scatterplot of these data. 

(b) Using your calculator, find the equation of the least- 
squares regression line of height on age. 

(c) Use your regression line to predict Sarah’s height at 
age 40 years (480 months). Convert your prediction 
to inches (2.54 cm = | inch). 

(d) The prediction is impossibly large. Explain why this 
happened. 

13.12 Drilling down beneath a lake in Alaska yields chemi- 
cal evidence of past changes in climate. Biological 
silicon, left by the skeletons of single-celled creatures 
called diatoms, is a measure of the abundance of life 
in the lake. A rather complex variable based on the 
ratio of certain isotopes relative to ocean water gives 
an indirect measure of moisture, mostly from snow. 
As we drill down, we look further into the past. Here 
is a scatterplot of data from 2300 to 12,000 years ago: 


Silicon 


214 210 -206 -202 -108 -194 
Isotope 
(a) Identify the unusual point in the scatterplot. Explain 
what’s unusual about this point. 
(b) If this point was removed, describe the effect on 
i. the correlation. 
ii. the slope and y intercept of the least-squares line. 
iii. the standard deviation of the residuals. 
13.13 Long-term records from the Serengeti National Park 


in Tanzania show interesting ecological relationships. 
When wildebeest are more abundant, they graze the 


grass more heavily, so there are fewer fires and more 
trees grow. Lions feed more successfully when there are 
more trees, so the lion population increases. Research- 
ers collected data on one part of this cycle, wildebeest 
abundance (in thousands of animals) and the percent 
of the grass area burned in the same year. The results of 
a least-squares regression on the data are shown here.*® 
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Constant C48) 10.06 Ona 0.000 
Wildebeest (1000s) -0.05762 0.01035 =6256 0-000 
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(a) Give the equation of the least-squares regression 
line. Be sure to define any variables you use. 

(b) Explain what the slope of the regression line means 
in this setting. 

(c) Find the correlation. Interpret this value in context. 

(d) Is a linear model appropriate for describing the rela- 
tionship between wildebeest abundance and percent 
of grass area burned? Support your answer with appro- 
priate evidence. 
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Designing 
Studies 


Can Magnets Help Reduce Pain? 


Early research showed that magnetic fields affected living tissue in humans. Some doctors have begun to 
use magnets to treat patients with chronic pain. Scientists wondered whether this type of therapy really 
worked. They designed a study to find out. 

Fifty patients with chronic pain were recruited for the study. A doctor identified a painful site on each 
patient and asked him or her to rate the pain on a scale from 0 (mild pain) to 10 (severe pain). Then, the 
doctor selected a sealed envelope containing a magnet at random from a box with a mixture of active and 
inactive magnets. That way, neither the doctor nor the patient knew which type of magnet was being used. 
The chosen magnet was applied to the site of the pain for +5 minutes. After “treatment,” each patient was 
again asked to rate the level of pain from 0 to 10. 

In all, 29 patients were given active magnets and 21 patients received inactive magnets. Scientists de- 
cided to focus on the improvement in patients’ pain ratings. Here they are, grouped by the type of magnet 
used:! 


Active: VO 6. 110) G8) 5S 6°87 8 6 a a I 6-10 G5 > 0 Oe 
Inactive: --h- 3 a oe ae A TO eS dl oe UE 
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Introduction 


You can hardly go a day without hearing the results of a statistical study. Here are 

some examples: 

e The National Highway Traffic Safety Administration (NHTSA) reports 
that seat belt use in passenger vehicles increased from 84% in 2011 to 86% 


in 20127 
i mu 8 According to a recent survey, U.S. teens aged 13 to 18 spend 
| an average of 26.8 hours per week online. Although 59% 
Ne] . ' of the teens said that posting personal information or pho- 
> tos online is unsafe, 62% said they had posted photos of 


themselves.’ 


e Arecent study suggests that lack of sleep increases the risk of 
catching a cold.* 


¢ For their final project, two AP® Statistics students showed 
that listening to music while studying decreased subjects’ 
La : : performance on a memory task.’ 


Can we trust these results? As you'll learn in this chapter, that depends on how the 
data were produced. Let’s take a closer look at where the data came from in each 
of these studies. 

Each year, the NHTSA conducts an observational study of seat belt use in 
vehicles. The NHTSA sends trained observers to record the actual behavior of 
people in vehicles at randomly selected locations across the country. The idea 
of an observational study is simple: you can learn a lot just by watching. Or by 
asking a few questions, as in the survey of teens’ online habits. Harris Interactive 
conducted this survey using a “representative sample” of 655 U.S. 13- to 18-year- 
olds. Both of these studies use information from a sample to draw conclusions 
about some larger population. Section 4.1 examines the issues involved in sam- 
pling and surveys. 

In the sleep and catching a cold study, 153 volunteers took part. They answered 
questions about their sleep habits over a two-week period. Then, researchers gave 
them a virus and waited to see who developed a cold. This was a complicated 
observational study. Compare this with the experiment performed by the AP® 
Statistics students. They recruited 30 students and divided them into two groups 
of 15 by drawing names from a hat. Students in one group tried to memorize a list 
of words while listening to music. Students in the other group tried to memorize 
the same list of words while sitting in silence. Section 4.2 focuses on designing 
experiments. 

The goal of many statistical studies is to show that changes in one vari- 
able cause changes in another variable. In Section 4.3, we'll look at why 
establishing causation is so difficult, especially in observational studies. We'll 
also consider some of the ethical issues involved in planning and conducting 
a study. 

Here’s an Activity that gives you a preview of what lies ahead. 
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ACTIVITY | See no evil, hear no evil? 


MATERIALS: Confucius said, “I hear and I forget. I see and I remember. I do and I understand.” 
Two index cards, each Do people really remember what they see better than what they hear?® In this 
with 10 distinct numbers Activity, you will perform an experiment to try to find out. 

from 00 to 99 written on it 1. Divide the class into pairs of students by drawing names from a hat. 

(prepared by your teacher); 
clock, watch, or stopwatch 
to measure 30 seconds; 


and a coin for each pair of 
students 3. Flip a coin to decide which of you is Student | and which is Student 2. 


Shuffle the index cards and deal one face down to each partner. 
4. Student | will be the first to attempt a memory task while Student 2 keeps time. 


2. Your teacher will give each pair two index cards with 10 distinct numbers 
from 00 to 99 on them. Do not look at the numbers until it is time for you to do 
the experiment. 


Directions: Study the numbers on the index card for 30 seconds. Then turn the 
card over. Recite the alphabet aloud (A, B, C, and so on). Then tell your partner 
what you think the numbers on the card are. You may not say more than 10 num- 
bers! Student 2 will record how many numbers you recalled correctly. 

5. Now it’s Student 2’s turn to do a memory task while Student | records the data. 
Directions: Your partner will read the numbers on your index card aloud three 
times slowly. Next, you will recite the alphabet aloud (A, B, C, and so on) and 
then tell your partner what you think the numbers on the card are. You may not 
say more than 10 numbers! Student | will record how many numbers you recalled 
correctly. 

6. Your teacher will scale and label axes on the board for parallel dotplots of the re- 
sults. Plot how many numbers you remembered correctly on the appropriate graph. 
7. Did students in your class remember numbers better when they saw them or 
when they heard them? Give appropriate evidence to support your answer. 

8. Based on the results of this experiment, can we conclude that people in 
general remember better when they see than when they hear? Why or why not? 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


Identify the population and sample in a statistical study. e Distinguish a simple random sample from a stratified 
Identify voluntary response samples and convenience random sample or cluster sample. Give the advantages 
samples. Explain how these sampling methods can lead and disadvantages of each sampling method. 

to bias. Explain how undercoverage, nonresponse, question 
Describe how to obtain a random sample using slips of wording, and other aspects of a sample survey can lead 
paper, technology, or a table of random digits. to bias. 
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Suppose we want to find out what percent of young drivers in the United States 
text while driving. To answer the question, we will survey 16- to 20-year-olds 
who live in the United States and drive. Ideally, we would ask them all (take a 
census). But contacting every driver in this age group wouldn’t be practical: it 
would take too much time and cost too much money. Instead, we put the question 
to a sample chosen to represent the entire population of young drivers. 


DEFINITION: Population, census, and sample 


The population in a statistical study is the entire group of individuals we want 
information about. A census collects data from every individual in the population. 


A sample is a subset of individuals in the population from which we actually 
collect data. 


The distinction between population and sample is basic to statistics. To make 
sense of any sample result, you must know what population the sample represents. 
Here’s an example that illustrates this distinction and also introduces some major 
uses of sampling. 


Sampling Hardwood and Humans 


i Populations and samples 
PROBLEM: Identify the population and the sample in each of the following settings. 
(a) A furniture maker buys hardwood in large batches. The supplier is supposed to dry the wood 
before shipping (wood that isn’t dry won't hold its size and shape). The furniture maker chooses five 
pieces of wood from each batch and tests their moisture content. If any piece exceeds 12% moisture 
content, the entire batch is sent back. 
(b) Each week, the Gallup Poll questions a sample of about 1500 adult U.S. residents to determine 
national opinion on a wide variety of issues. 
. SOLUTION: 

ns st (a) The population is all the pieces of hardwood in a batch. The sample is the five pieces of wood that 

1 are selected from that batch and tested for moisture content. 
(b) Gallup’s population is all adult U.S. residents. Their sample is the 1500 adults who actually 
respond to the survey questions. 


For Practice Try Exercise 


The Idea of a Sample Survey 


We often draw conclusions about a whole population on the basis of a sample. 
Have you ever tasted a sample of ice cream and ordered a cone if the sample tastes 
good? Because ice cream is fairly uniform, the single taste represents the whole. 
Choosing a representative sample from a large and varied population (like all 
young U.S. drivers) is not so easy. The first step in planning a sample survey is to 
say exactly what population we want to describe. The second step is to say exactly 
what we want to measure, that is, to give exact definitions of our variables. 


The sampling method that yields 

a convenience sample is called 
convenience sampling. Other sampling 
methods are named in similarly 
obvious ways! 
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We reserve the term “sample survey” for studies that use an organized plan to 
choose a sample that represents some specific population, like the pieces of hard- 
wood and the U.S. adults in the previous example. By our definition, the popula- 
tion in a sample survey can consist of people, animals, or things. Some people use 
the terms “survey” or “sample survey” to refer only to studies in which people are 
asked one or more questions, like the Gallup Poll of the last example. We'll avoid 
this more restrictive terminology. 


How Does the Current Population 
Survey Work? 


A sample survey 


One of the most important government sample surveys in the United States is 
the monthly Current Population Survey (CPS). The CPS contacts about 60,000 
households each month. It produces the monthly unemployment rate and lots of 
other economic and social information. To measure unemployment, we must first 
specify the population we want to describe. The CPS defines its population as all 
US. residents (legal or not) 16 years of age and over who are civilians and are not 
in an institution such as a prison. The unemployment rate announced in the news 
refers to this specific population. 


What does it mean to be “unemployed”? Someone who is not looking for work— 
for example, a full-time student—should not be called unemployed just because 
she is not working for pay. If you are chosen for the CPS sample, the interviewer 
first asks whether you are available to work and whether you actually looked for 
work in the past four weeks. If not, you are neither employed nor unemployed— 
you are not in the labor force. 


If you are in the labor force, the interviewer goes on to ask about employment. If you 
did any work for pay or in your own business during the week of the survey, you are 
employed. If you worked at least 15 hours in a family business without pay, you are 
employed. You are also employed if you have a job but didn’t work because of vaca- 
tion, being on strike, or other good reason. An unemployment rate of 9.7% means 
that 9.7% of the sample was unemployed, using the exact CPS definitions of both 
“labor force” and “unemployed.” 


The final step in planning a sample survey is to decide how to choose a sample 
from the population. Let’s take a closer look at some good and not-so-good sam- 
pling methods. 


How to Sample Badly 


Suppose we want to know how long students at a large high school spent do- 
ing homework last week. We might go to the school library and ask the first 
30 students we see about their homework time. The sample we get is known 
as a convenience sample. 
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Voluntary response samples are also 
known as self-selected samples. 


The Internet brings voluntary response 
samples to the computer nearest you. 
Visit www.misterpoll.com to become 
part of the sample in any of dozens of 
online polls. As the site says, “None 
of these polls are ‘scientific,’ but 

do represent the collective opinion 

of everyone who participates.” 
Unfortunately, such polls don’t tell 
you anything about the views of the 
population. 


DESIGNING STUDIES 


Convenience sampling often produces unrepresentative data. Consider 
our sample of 30 students from the school library. It’s unlikely that this 
convenience sample accurately represents the homework habits of all stu- 


dents at the high school. In fact, if we were to repeat this sampling process again 
and again, we would almost always overestimate the average homework time in 
the population. Why? Because students who hang out in the library tend to be 
more studious. This is bias: using a method that favors some outcomes over others. 


AP® EXAM TIP If you’re asked to describe how the design of a study leads to bias, you’re 
expected to do two things: (1) identify a problem with the design, and (2) explain how this 
problem would lead to an underestimate or overestimate. Suppose you were asked, “Explain 
how using your statistics class as a sample to estimate the proportion of all high school 
students who own a graphing calculator could result in bias.” You might respond, “This is a 
convenience sample. It would probably include a much higher proportion of students with a 
graphing calculator than in the population at large because a graphing calculator is required 
for the statistics class. So this method would probably lead to an overestimate of the actual 
population proportion.” 


Bias is not just bad luck in one sample. It’s the result of a bad study 
design that will consistently miss the truth about the population in the 
same way. Convenience samples are almost guaranteed to show bias. So 
are voluntary response samples. 


Call-in, text-in, write-in, and many Internet polls rely on voluntary 
response samples. People who choose to participate in such surveys are 
usually not representative of some larger population of interest. Voluntary 
response samples attract people who feel strongly about an issue, and who 
often share the same opinion. That leads to bias. 
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Illegal Immigration 
Online polls 


Former CNN commentator Lou Dobbs doesn’t like illegal immigration. One 
of his shows was largely devoted to attacking a proposal to offer driver’s licenses 
to illegal immigrants. During the show, Mr. Dobbs invited his viewers to go 
to loudobbs.com to vote on the question “Would you be more or less likely to 
vote for a presidential candidate who supports giving driver’s licenses to illegal 
aliens? The result: 97% of the 7350 people who voted by the end of the show 
said, “Less likely.” 


PROBLEM: What type of sample did Mr. Dobbs use in his poll? Explain how this sampling method 
could lead to bias in the poll results. 


SOLUTION: Mr. Dobbs used a voluntary response sample: people chose to go online and 
respond. Those who voted were viewers of Mr. Dobbs’s program, which means that they are likely 
to support his views. The 97% poll result is probably an extreme overestimate of the percent of 
people in the population who would be less likely to support a presidential candidate with this 
position. 


For Practice Try Exercise 9 | 


CHECK YOUR UNDERSTANDING 


For each of the following situations, identify the sampling method used. Then explain 
how the sampling method could lead to bias. 

1. A farmer brings a juice company several crates of oranges each week. A company 
inspector looks at 10 oranges from the top of each crate before deciding whether to buy 
all the oranges. 

2. The ABC program Nightline once asked whether the United Nations should 
continue to have its headquarters in the United States. Viewers were invited to call one 


telephone number to respond “Yes” and another for “No.” There was a charge for calling 
either number. More than 186,000 callers responded, and 67% said “No.” 


How to Sample Well: 
Simple Random Sampling 


In convenience sampling, the researcher chooses easy-to-reach members of the 
population. In voluntary response sampling, people decide whether to join the 
sample. Both sampling methods suffer from bias due to personal choice. The best 
way to avoid this problem is to let chance choose the sample. That’s the idea of 
random sampling. 
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DEFINITION: Random Sampling 


Random sampling involves using a chance process to determine which members 
of a population are included in the sample. 


In everyday life, some people use the The easiest way to choose a random sample of n people is to write their names 
SI tah i te oa nara, on identical slips of paper, put the slips in a hat, mix them well, and pull out slips 


as in “that’s so random.” In statistics ; . i : 
‘aie . An alternati HI i h mem- 
andbin manne “dua inches: one ata time until you have n of them. An alternative would be to give each me 


Don't say that a sample was chosen ber of the population a distinct number and to use the “hat method” with these 
at random if a chance process wasn’t © numbers instead of people’s names. Note that this version would work just as well 
used to select the individuals. if the population consisted of animals or things. The resulting sample is called a 


simple random sample, or SRS for short. 


(MM 


DEFINITION: Simple Random Sample (SRS) 


A simple random sample (SRS) of size nis chosen in such a way that every group 
of n individuals in the population has an equal chance to be selected as the sample. 


An SRS gives every possible sample of the desired size an equal chance to 
be chosen. It also gives each member of the population an equal chance to 
be included in the sample. Picture drawing 20 slips (the sample) from a hat 
containing 200 identical slips (the population). Any 20 slips have the same 
chance as any other 20 to be chosen. Also, each slip has a 1-in-10 chance 
(20/200) of being selected. 

Some other random sampling methods give each member of the popu- 
lation, but not each sample, an equal chance. We'll look at some of these 
later. 


How to Choose a Simple Random Sample The hat method won’t 
work well if the population is large. Imagine trying to take a simple random 
sample of 1000 U.S. adults! In practice, most people use random numbers gen- 
erated by technology to choose samples. 


Statisticians Fall asleep Paster by 
taking a random sample of sheep. 


Teens on the Internet 


Choosing an SRS with technology 


The principal at Canyon del Oro High School in Arizona wants student input 
about limiting access to certain Internet sites on the school’s computers. He asks 
the AP® Statistics teacher, Mr. Tabor, to select a “representative sample” of 10 stu- 
dents. Mr. Tabor decides to take an SRS from the 1750 students enrolled this year. 
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He gets an alphabetical roster from the registrar’s office, and numbers the stu- 
dents from 1 to 1750. Then Mr. Tabor uses the random number generator at 
www.randomizer.org to choose 10 distinct numbers between | and 1750: 


Print =! Download in Excel [X) Close [x] 


ANDOMIZER 


Research Randomizer Results 


1 Set of 10 Unique Numbers Per Sct 
Range: From 1 to 1750 -- Sorted from Least to Greatest 


Job Status: Finished 


Set #1; 


117, 311, 461, 724, 843, 854, 1073, 1131, 1713, 1720 


The 10 students on the roster that correspond to the chosen numbers will be on 
the principal’s committee. 


This example highlights the steps in choosing a simple random sample with 
technology. 


CHOOSING AN SRS WITH TECHNOLOGY 


It is standard practice to use n for the 
sample size and WV for the population 
size. 


You can also use a graphing calculator to choose an SRS. 


TECHNOLOGY 


CORNER CHOOSING AN SRS 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Let’s use a graphing calculator to select an SRS of 10 students from the Canyon del Oro High School roster. 


1. Check that your calculator’s random number generator is working properly. 


TI-83/84 TI-89 
e Press [MATH], then select PRB and randInt (. e Press| CATALOG |, then|F3 | (Flash Apps) and choose 
Complete the command randInt (1,1750)and randInt (. Complete the command TIStat. 
press [ENTER |, randInt (1, 1750) and press |ENTER |. 


Compare your results with those of your classmates. If several students got the same number, you'll need to seed your 
calculator’s random integer generator with different numbers before you proceed. Directions for doing this are given in the 
Annotated Teacher's Edition. 
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2. Randomly generate 10 distinct numbers from | to 1750. 
Do randInt (1, 1750) again. Keep pressing | ENTER | until you have chosen 10 different labels. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


randiInt(1,1750) 


Note: If you have a ‘T1-83/84 with OS 2.55 or later, you can use the command RandIntNoRep (1, 1750) to sort the 
numbers from | to 1750 in random order. The first 10 numbers listed give the labels of the chosen students. 


If you don’t have technology handy, you can use a table of random digits to 
choose an SRS. We have provided a table of random digits at the back of the book 
(Table D). Here is an excerpt. 


LINE 

101 19223 95034 05756 28713 96409 = =12531 42544 = 82853 
102 73676 =947150 »=s 99400) (01927) 27754 = 42648 = 82425 = 36290 
103 45467 71709 77558 00095 32863 29485 82226 90056 


You can think of this table as the result of someone putting the digits 0 to 9 ina 
hat, mixing, drawing one, replacing it, mixing again, drawing another, and so on. 
The digits have been arranged in groups of five within numbered rows to make 
the table easier to read. The groups and rows have no special meaning—‘Table D 
is just a long list of randomly chosen digits. As with technology, there are two steps 
in using Table D to choose a random sample. 


HOW TO CHOOSE AN SRS USING TABLE D 


Always use the shortest labels that will cover your population. For instance, 
you can label up to 100 individuals with two digits: 01, 02, ..., 99, 00. As stan- 
dard practice, we recommend that you begin with label | (or 01 or 001 or 0001, 
as needed). Reading groups of digits from the table gives all individuals the same 
chance to be chosen because all labels of the same length have the same chance 
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to be found in the table. For example, any pair of digits in the table is equally 
likely to be any of the 100 possible labels 01, 02,..., 99, 00. Here’s an example 
that shows how this process works. 


Spring Break! 
Choosing an SRS with Table D 


The school newspaper is planning an article on family-friendly 
places to stay over spring break at a nearby beach town. The 
editors intend to call 4 randomly chosen hotels to ask about their 
amenities for families with children. They have an alphabetized 
list of all 28 hotels in the town. 


PROBLEM: Use Table D at line 130 to choose an SRS of 4 hotels for the 
editors to call. 


SOLUTION: We'll use the two-step process for selecting an SRS using 
Table D. 


Step 1: Label. Two digits are needed to label the 28 hotels. We have added 
labels 01 to 28 to the alphabetized list of hotels below. 


01 Aloha Kai 08 Captiva 15 Palm Tree 22 Sea Shell 

02 Anchor Down O09 Casa del Mar 16 Radisson 23 Silver Beach 
03 Banana Bay 10 Coconuts 17 Ramada 24 Sunset Beach 
04 Banyan Tree 11 Diplomat 18 Sandpiper 25 Tradewinds 
05 Beach Castle 12 Holiday Inn 19 SeaCastle 26 Tropical Breeze 
06 Best Western 13 Lime Tree 20 Sea Club 27 Tropical Shores 
07 Cabana 14 Outrigger 21 Sea Grape 28 Veranda 


Step 2: Randomize. To use Table D, start at the left-hand side of line 130 and read two-digit 
groups. Skip any groups that aren't between 01 and 28, as well as any repeated groups. Continue 
until you have chosen four hotels. Here is the beginning of line 130: 


69051 64817 87174 09517 84534 06489 87201 97245 
The first 10 two-digit groups are 
69 05 16 48 17 87 AY/ 40 95 17 


Skip of af Skip Se Skip Skip Skip Skip Skip 
Too big Too big Toobig Repeat Toobig Toobig Repeat 


We skip 5 of these 10 groups because they are too high (over 28) and 2 because they are repeats 
(both 17s). The hotels labeled 05, 16, and 17 go into the sample. We need one more hotel to com- 
plete the sample. Continuing along line 130: 


84 aye) 40 64 89 8&7 20 
Skip Skip Skip Skip Skip Skip v 
Too big Too big Too big Too big Too big Too big 


Our SRS of 4 hotels for the editors to contact is 05 Beach Castle, 16 Radisson, 17 Ramada, and 
20 Sea Club. 


For Practice Try Exercise 
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We can trust results from an SRS, as well as from other types of random samples 
that we will meet later, because the use of impersonal chance avoids bias. The 
following activity shows why random sampling is so important. 


ACTIVITY | Who Wrote the Federalist Papers? 


The Federalist Papers are a series of 85 essays supporting the ratification of the U.S. 
Constitution. At the time they were published, the identity of the authors was a 
secret known to just a few people. Over time, however, the authors were identi- 
fied as Alexander Hamilton, James Madison, and John Jay. The authorship 
of 73 of the essays is fairly certain, leaving 12 in dispute. However, thanks 
in some part to statistical analysis,’ most scholars now believe that the 12 
disputed essays were written by Madison alone or in collaboration with 
Hamilton.® 
There are several ways to use statistics to help determine the authorship 
of a disputed text. One method is to estimate the average word length in 
a disputed text and compare it to the average word lengths of works where 
the authorship is not in dispute. 
The following passage is the opening paragraph of Federalist Paper #51,” 
one of the disputed essays. ‘The theme of this essay is the separation of powers 
between the three branches of government. 


To what expedient, then, shall we finally resort, for maintaining in practice the 
necessary partition of power among the several departments, as laid down in 
the Constitution? The only answer that can be given is, that as all these exterior 
provisions are found to be inadequate, the defect must be supplied, by so 
contriving the interior structure of the government as that its several constituent 
parts may, by their mutual relations, be the means of keeping each other in 
their proper places. Without presuming to undertake a full development of this 
important idea, I will hazard a few general observations, which may perhaps 
place it in a clearer light, and enable us to form a more correct judgment of the 
principles and structure of the government planned by the convention. 


1. Choose 5 words from this passage. Count the number of letters in each of the 
words you selected, and find the average word length. 


2. Your teacher will draw and label a horizontal axis for a class dotplot. Plot the 
average word length you obtained in Step | on the graph. 


3. Use a table of random digits or a random number generator to select a simple 
random sample of 5 words from the 130 words in the opening passage. Count 
the number of letters in each of the words you selected, and find the average 
word length. 


4. Your teacher will draw and label another horizontal axis with the same scale 
for a comparative class dotplot. Plot the average word length you obtained in 
Step 3 on the graph. 


5. How do the dotplots compare? Can you think of any reasons why they might 
be different? Discuss with your classmates. 


Stratum is singular. Strata are plural. 


ACTIVITY 


MATERIALS: 


Calculator for each student 
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Other Random Sampling Methods 


The basic idea of sampling is straightforward: take an SRS from the population 
and use your sample results to gain information about the population. Unfortu- 
nately, it’s usually difficult to get an SRS from the population of interest. Imagine 
trying to get a simple random sample of all the batteries produced in one day at a 
factory. Or an SRS of all U.S. high school students. In either case, it would be dif- 
ficult to obtain an accurate list of the population from which to draw the sample. 
It would also be very time-consuming to collect data from each individual that’s 
randomly selected. Sometimes, there are also statistical advantages to using more 
complex sampling methods. 

One of the most common alternatives to an SRS involves sampling groups 
(strata) of similar individuals within the population separately. Then these sepa- 
rate “subsamples” are combined to form one stratified random sample. 


DEFINITION: Stratified random sample and strata 


To get a stratified random sample, start by classifying the population into groups 
of similar individuals, called strata. Then choose a separate SRS in each stratum 
and combine these SRSs to form the sample. 


Choose the strata based on facts known before the sample is taken. For example, 
in a study of sleep habits on school nights, the population of students in a large high 
school might be divided into freshman, sophomore, junior, and senior strata. In a pre- 
election poll, a population of election districts might be divided into urban, suburban, 
and rural strata. Stratified random sampling works best when the individuals within 
each stratum are similar with respect to what is being measured and when there are 
large differences between strata. The following Activity makes this point clear. 


Sampling sunflowers 


A British farmer grows sunflowers for making sunflower oil. Her field is arranged 

in a grid pattern, with 10 rows and 10 columns as shown in the figure on the next 

page. Irrigation ditches run along the top and bottom of the field. The farmer 

would like to estimate the number of healthy plants in the field so she can project 
how much money she’ll make from selling them. It would take too much time 
to count the plants in all 100 squares, so she’ll accept an estimate based on a 
sample of 10 squares. 


1. Use Table D or technology to take a simple random sample of 10 grid 
squares. Record the location (for example, B6) of each square you select. 


2. This time, you'll take a stratified random sample using the rows as strata. 
Use Table D or technology to randomly select one square from each (hori- 
zontal) row. Record the location of each square — for example, Row I: G, 
Row 2: B, and so on. 


DESIGNING STUDIES 


3. Now, take a stratified random sample using the columns as strata. 
Use Table D or technology to randomly select one square from each 


(vertical) column. Record the location of each square —for example, 


J Column A: 4, Column B: 1, and so on. 


4. The table on page N/DS-5 in the back of the book gives the actual 


number of sunflowers in each grid square. Use the information pro- 


vided to calculate your estimate of the mean number of sunflowers per 


square for each of your samples in Steps 1, 2, and 3. 


5. Make comparative dotplots showing the mean number of sunflow- 


ers obtained using the three different sampling methods for all mem- 


bers of the class. Describe any similarities and differences you see. 
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6. Your teacher will provide you with the mean number of sunflowers 


50 @'0 S0a000@ DO® WO a 
100 101 102 103 104 105 


in the population of all 100 grid squares in the field. How did the three 
sampling methods do? 


The dotplots below show the mean number of healthy plants in 100 samples us- 
ing each of the three sampling methods in the Activity: simple random sampling, 
stratified random sampling with rows of the field as strata, and stratified random 
sampling with columns of the field as strata. Notice that all three distributions are 
centered at about 102.5, the true mean number of healthy plants in all squares of 
the field. That makes sense because random sampling yields accurate estimates of 
unknown population values. 

One other detail stands out in the graphs. There is much less variability in the 
estimates using stratified random sampling with the rows as strata. The table on 
page N/DS-5 shows the actual number of healthy sunflowers in each grid square. 
Notice that the squares within each row contain a similar number of healthy 
plants but there are big differences between rows. When we can choose strata that 
are “similar within but different between,” stratified random samples give more pre- 
cise estimates than simple random samples of the same size. 

Why didn’t using the columns as strata reduce the variability of the estimates 
in a similar way? Because the squares within each column have very different 
numbers of healthy plants. 
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Both simple random sampling and stratified random sampling are hard to use 
when populations are large and spread out over a wide area. In that situation, we'd 


In a cluster sample, some people take 
an SRS from each cluster rather than 
including all members of the cluster. 


Remember: strata are ideally “similar 
within, but different between,” while 
clusters are ideally “different within, 
but similar between.” 
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prefer a method that selects groups (clusters) of individuals that are “near” one 
another. That’s the idea of a cluster sample. 


= 


DEFINITION: Cluster sample and clusters 


To get a cluster sample, start by classifying the population into groups of individuals 
that are located near each other, called clusters. Then choose an SRS of the clus- 
ters. All individuals in the chosen clusters are included in the sample. 


Cluster samples are often used for practical reasons, like saving time and 
money. Cluster sampling works best when the clusters look just like the popula- 
tion but on a smaller scale. Imagine a large high school that assigns its students to 
homerooms alphabetically by last name. The school administration is consider- 
ing a new schedule and would like student input. Administrators decide to survey 
200 randomly selected students. It would be difficult to track down an SRS of 200 
students, so the administration opts for a cluster sample of homerooms. The prin- 
cipal (who knows some statistics) takes a simple random sample of 8 homerooms 
and gives the survey to all 25 students in each homeroom. 

Cluster samples don’t offer the statistical advantage of better information 
about the population that stratified random samples do. That’s because clusters 
are often chosen for ease so they may have as much variability as the population 
itself. 

Be sure you understand the difference between strata and clusters. We 4 
want each stratum to contain similar individuals and for there to be large @ 
differences between strata. For a cluster sample, we’d like each cluster to 
look just like the population, but on a smaller scale. Here’s an example that com- 
pares the random sampling methods we have discussed so far. 


Sampling at a School Assembly 


Strata or clusters? 


ieee =| be student council wants to conduct a survey during the first 


five minutes of an all-school assembly in the auditorium about 
use of the school library. They would like to announce the re- 
sults of the survey at the end of the assembly. The student coun- 
cil president asks your statistics class to help carry out the survey. 


PROBLEM: There are 800 students present at the assembly. A map of 
the auditorium is shown on the next page. Note that students are seated by 
grade level and that the seats are numbered from 1 to 800. 
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9th grade: Seats 601-800 10th grade: Seats 401-600 
11th grade: Seats 201-400 12th grade:Seats 1-200 


Describe how you would use your calculator to select 50 students to complete the survey with each 
of the following: 


(a) Simple random sample 
(b) Stratified random sample 
(c) Cluster sample 


SOLUTION: 
(a) Totake an SRS, we need to choose 80 of the seat numbers at random. Use randlnt(1,600) on your 
calculator until 80 different seats are selected. Then give the survey to the students in those seats. 


(b) The students in the assembly are seated by grade level. Because students library use might be 
similar within grade levels but different across grade levels, we'll use the grade level seating areas as 
our strata. Within each grade’s seating area, we'll select 20 seats at random. For the 9th grade, use 
randInt(601,800) to select 20 different seats. Use randlnt(401,600) to pick 20 different sopho- 
more seats, randInt(201,400) to get 20 different junior seats, and randInt(1,200) to choose 20 
different senior seats. Give the survey to the students in the selected seats. 


(c) Withthe way students are seated, each column of seats from the stage to the back of the audito- 
more aficient than finding 80:seats rium could be used as a cluster. Note that each cluster contains students from all four grade levels, so 
scattered about the auditorium, each should represent the population well. Because there are 20 clusters, each with 40 seats, we need 
as required by both of the other to choose 2 clusters at random to get &0 students for the survey. Use randlnt(1,20) to select two 
sampling methods. clusters, and then give the survey to all 40 students in each column of seats. 


Note that cluster sampling is much 


For Practice Try Exercise 


Most large-scale sample surveys use multistage samples that combine two or more 
sampling methods. For example, the U.S. Census Bureau carries out a monthly 
Current Population Survey (CPS) of about 60,000 households. Researchers start by 
choosing a stratified random sample of neighborhoods in 756 of the 2007 geograph- 
ical areas in the United States. Then they divide each neighborhood into clusters of 
four nearby households and select a cluster sample to interview. 

Analyzing data from sampling methods more complex than an SRS takes us 
beyond basic statistics. But the SRS is the building block of more elaborate meth- 
ods, and the principles of analysis remain much the same for these other methods. 
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CHECK YOUR UNDERSTANDING 


The manager of a sports arena wants to learn more about the financial status of the people 
who are attending an NBA basketball game. He would like to give a survey to a representa- 
tive sample of the more than 20,000 fans in attendance. ‘Ticket prices for the game 
srnaptTtON vary a great deal: seats near the court cost over $100 each, while seats in the top rows 
_ ae B of the arena cost $25 each. The arena is divided into 30 numbered sections, from 
tool 101 to 130. Each section has rows of seats labeled with letters from A (nearest the 
What ka w §=court) to ZZ (top row of the arena). 
1. Explain why it might be difficult to give the survey to an SRS of 200 fans. 


2. Which would be a better way to take a stratified random sample of fans: using 
the lettered rows or the numbered sections as strata? Explain. 

3. Which would be a better way to take a cluster sample of fans: using the 
lettered rows or the numbered sections as clusters? Explain. 


Inference for Sampling 


The purpose of a sample is to give us information about a larger population. The 
process of drawing conclusions about a population on the basis of sample data is 
called inference because we infer information about the population from what we 
know about the sample. 

Inference from convenience samples or voluntary response samples would be 
misleading because these methods of choosing a sample are biased. We are almost 
certain that the sample does not fairly represent the population. The first reason to 
rely on random sampling is to avoid bias in choosing a sample. 

Still, it is unlikely that results from a random sample are exactly the same as for 
the entire population. Sample results, like the unemployment rate obtained from 
the monthly Current Population Survey, are only estimates of the truth about the 
population. If we select two samples at random from the same population, we will 
almost certainly choose different individuals. So the sample results will differ some- 
what, just by chance. Properly designed samples avoid systematic bias. But their 
results are rarely exactly correct, and we expect them to vary from sample to sample. 


Going to class 
How much do sample results vary? 


Suppose that 70% of the students in a large university attended all their classes last 
week. Imagine taking a simple random sample of 100 students and recording the 
proportion of students in the sample who went to every class last week. Would the 
sample proportion be exactly 0.70? Probably not. Would the sample proportion 
be close to 0.70? That depends on what we mean by “close.” The following graph 
shows the results of taking 500 SRSs, each of size 100, and recording the propor- 
tion of students who attended all their classes in each sample. 


What do we see? The graph is centered at about 0.70, the population proportion. 
All of the sample proportions fall between 0.55 and 0.85. So we shouldn’t be sur- 
prised if the difference between the sample proportion and the population propor- 
tion is as large as 0.15. The graph also has a very distinctive “bell shape.” 
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Dotplot of the sample proportion of stu- il | 1 | | | | , g 
dents in each of 500 SRSs of size 100 _08o oF SESSE TSEC ESSE TESTS SESS 


who attended all their classes last week. 0.55 060 065 0.70 0.75 0.80 0.85 
The population proportion is 0.70. sampleproportion 


Why can we trust random samples? As the previous example illustrates, the results 
of random sampling don’t change haphazardly from sample to sample. Because we 
deliberately use chance, the results obey the laws of probability that govern chance 
behavior. These laws allow us to say how likely it is that sample results are close to 
the truth about the population. The second reason to use random sampling is that 
the laws of probability allow trustworthy inference about the population. Results 
from random samples come with a “margin of error” that sets bounds on the size of 
the likely error. We will discuss the details of inference for sampling later. 

One point is worth making now: larger random samples give better information 
about the population than smaller samples. For instance, let’s look at what hap- 
pens if we increase the sample size in the example from 100 to 400 students. The 
dotplot below shows the results of taking 500 SRSs, each of size 400, and recording 
the proportion of students who attended all their classes in each sample. This graph 
is also centered at about 0.70. But now all the sample proportions fall between 0.63 
and 0.77. So the difference between the sample proportion and the population pro- 
portion is at most 0.07. When using SRSs of size 100, this difference could be as 
much as 0.15. The moral of the story: by taking a very large random sample, you can 
be confident that the sample result is very close to the truth about the population. 


Dotplot of the sample proportion of stu- 
dents in each of 500 SRSs of size 400 
who attended all their classes last week. 055 060 065 0.70 07. 5 080 085 
The population proportion is 0.70. sampleproportion 


The list of individuals from which a 
sample will be drawn is called the 
sampling frame. 
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The Current Population Survey contacts about 60,000 households, so we’d 
expect its estimate of the national unemployment rate to be within about 0.1% 
of the actual population value. Opinion polls that contact 1000 or 1500 people 
give less precise results—we expect the sample result to be within about 3% of 
the actual population percent with a given opinion. Of course, only samples 
chosen by chance carry this guarantee. Lou Dobbs’s online sample tells us 
little about overall American public opinion even though 7350 people clicked 
a response. 


Sample Surveys: What Can Go Wrong? 


The use of bad sampling methods (convenience or voluntary response) often 
leads to bias. Researchers can avoid bad methods by using random sampling to 
choose their samples. Other problems in conducting sample surveys are more 
difficult to avoid. 

Sampling is often done using a list of individuals in the population. Such lists 
are seldom accurate or complete. The result is undercoverage. 


Most samples suffer from some degree of undercoverage. A sample survey of 
households, for example, will miss not only homeless people but also prison in- 
mates and students in dormitories. An opinion poll conducted by calling landline 
telephone numbers will miss households that have only cell phones as well as 
households without a phone. The results of national sample surveys therefore 
have some bias due to undercoverage if the people not covered differ from the 
rest of the population. 

Well-designed sample surveys avoid bias in the sampling process. The real 
problems start after the sample is chosen. 

One of the most serious sources of bias in sample surveys is nonresponse. 


Nonresponse to surveys often exceeds 50%, even with careful planning and 
several follow-up calls. If the people who respond differ from those who don’t, in 
a way that is related to the response, bias results. 

Some students misuse the term “voluntary response” to explain why certain 
individuals don’t respond in a sample survey. Their idea is that participation rr) 
in the survey is optional (voluntary), so anyone can refuse to take part. What 
the students are describing is nonresponse. Think about it this way: nonresponse 
can occur only after a sample has been selected. In a voluntary response sample, 
every individual has opted to take part, so there won’t be any nonresponse. 
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The ACS, GSS, and Opinion Polls 


How bad is nonresponse? 


The Census Bureau’s American Community Survey (ACS) has the lowest nonre- 
sponse rate of any poll we know: only about 1% of the households in the sample 
refuse to respond. The overall nonresponse rate, including “never at home” and 
other causes, is just 2.5%.!° This monthly survey of about 250,000 households re- 
places the “long form” that in the past was sent to some households in the every- 
ten-years national census. Participation in the ACS is mandatory, and the Census 
Bureau follows up by telephone and then in person if a household doesn’t return 
the mail questionnaire. 


The University of Chicago’s General Social Survey (GSS) is the nation’s most 
important social science survey (see Figure 4.1). The GSS contacts its sample in 
person, and it is run by a university. Despite these advantages, its most recent sur- 
vey had a 30% rate of nonresponse. 


f@ Session 


GSS General Social NUBVEY 


The General Social Survey (GSS) conducts basic scientific research on 
the structure and development of American society with a data-collection 
program designed to both monitor social change within the United States 
nd to compare the United States to other nations. 


FIGURE 4.1 The home page of the General Social Survey at the University of Chicago’s National 
Opinion Research Center (http://www3.norc.org/GSS+Website/). The GSS has tracked opinions 
about a wide variety of issues since 1972. 


What about opinion polls by news media and opinion-polling firms? We don’t 
know their rates of nonresponse because they won't say. That’s a bad sign. The 
Pew Research Center for the People and the Press imitated a careful random digit 
dialing survey and published the results: over 5 days, the survey reached 76% of the 
households in its chosen sample, but “because of busy schedules, skepticism and 
outright refusals, interviews were completed in just 38% of households that were 
reached.” Combining households that could not be contacted with those who did 
not complete the interview gave a nonresponse rate of 73%.!! 


Another type of nonsampling problem occurs when people give inaccurate 
answers to survey questions. People may lie about their age, income, or drug use. 
They may misremember how many hours they spent on the Internet last week. Or 
they might make up an answer to a question that they don’t understand. 
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The gender, race, age, ethnicity, or behavior of the interviewer can also affect 
people’s responses. A systematic pattern of inaccurate answers in a survey leads to 
response bias. 


IM FILLING OUT SEE, THEY ASKED HOW MUCK MONEN THIS MAGAZINE SHOULD } I LOVE 
A READER SURVEN T SPEND ON GUM EACH WEEK, SOT HAVE SOME AMUSING MESSING 
FOR CHEWING WROTE, "$500." FOR MY AGE, I PUT WITH DATA. 
MAGAZINE. “43° AND WHEN THEY ASKED WHAT MY 
FAVORITE FLAVOR IS, I WROTE 
“GARLIC / CURRY" 
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The wording of questions is the most important influence on the answers 
given to a sample survey. Confusing or leading questions can introduce strong 
bias. Changes in wording can greatly affect a survey’s outcome. 


How Do Americans Feel about 


Illegal Immigrants? 


Question wording matters 


“Should illegal immigrants be prosecuted and deported for being in the U.S. illegally, 
or shouldn’t they?” Asked this question in an opinion poll, 69% favored deportation. 
But when the very same sample was asked whether illegal immigrants who have 
worked in the United States for two years “should be given a chance to keep their 
jobs and eventually apply for legal status,” 62% said that they should. Different ques- 
tions give quite different impressions of attitudes toward illegal immigrants. 


Even the order in which questions are asked matters. Don’t trust the at 
results of a sample survey until you have read the exact questions asked. 


THINK Does the order matter? Ask a sample of college students these two 


ABOUT IT questions: 


“How happy are you with your life in general?” (Answers on a scale of | to 5) 
“How many dates did you have last month?” 


There is almost no association between responses to the two questions when asked 
in this order. It appears that dating has little to do with happiness. Reverse the 
order of the questions, however, and a much stronger association appears: college 
students who say they had more dates tend to give higher ratings of happiness 
about life. Asking a question that brings dating to mind makes dating success a big 
factor in happiness. 


OoeE>E—<$_io _ _<$_ Oo 


228 CHAPTER 4 DESIGNING STUDIES 


CHECK YOUR UNDERSTANDING 


1. Each of the following is a possible source of bias in a sample survey. Name the type of 
bias that could result. 


(a) The sample is chosen at random from a telephone directory. 
(b) Some people cannot be contacted in five calls. 


(c) Interviewers choose people walking by on the sidewalk to interview. 


2. Asurvey paid for by makers of disposable diapers found that 84% of the sample 
opposed banning disposable diapers. Here is the actual question: 


It is estimated that disposable diapers account for less than 2% of the trash in today’s 
landfills. In contrast, beverage containers, third-class mail, and yard wastes are esti- 
mated to account for about 21% of the trash in landfills. Given this, in your opinion, 
would it be fair to ban disposable diapers?” 


Explain how the wording of the question could result in bias. Be sure to specify the direc- 
tion of the bias. 


Summary 


e Acensus collects data from every individual in the population. 


e Asample survey selects a sample from the population of all individuals about 
which we desire information. The goal of a sample survey is inference: we 
draw conclusions about the population based on data from the sample. It is 
important to specify exactly what population you are interested in and what 
variables you will measure. 


e¢ Convenience samples choose individuals who are easiest to reach. In vol- 
untary response samples, individuals choose to join the sample in response 
to an open invitation. Both of these sampling methods usually lead to bias: 
they consistently underestimate or consistently overestimate the value you 
want to know. 


e¢ Random sampling uses chance to select a sample. 


e The basic random sampling method is a simple random sample (SRS). An 
SRS gives every possible sample of a given size the same chance to be cho- 
sen. Choose an SRS by labeling the members of the population and using 
slips of paper, random digits, or technology to select the sample. 


e To choose a stratified random sample, divide the population into strata, 
groups of individuals that are similar in some way that might affect their 
responses. Then choose a separate SRS from each stratum and combine 
these SRSs to form the sample. When strata are “similar within but different 
between,” stratified random samples tend to give more precise estimates of 
unknown population values than simple random samples. 


¢ To choose a cluster sample, divide the population into groups of individuals 
that are located near each other, called clusters. Randomly select some of 
these clusters. All the individuals in the chosen clusters are included in the 
sample. Ideally, clusters are “different within but similar between.” Cluster 
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sampling saves time and money by collecting data from entire groups of indi- 
viduals that are close together. 


e Random sampling helps avoid bias in choosing a sample. Bias can still occur 
in the sampling process due to undercoverage, which happens when some 
members of the population cannot be chosen. 


e The most serious errors in sample surveys, however, are ones that occur af- 
ter the sample is chosen. The single biggest problem is nonresponse: when 
people can’t be contacted or refuse to answer. Incorrect answers by respon- 
dents can lead to response bias. Finally, the wording of questions has a big 


influence on the answers. 


4.1) TECHNOLOGY 
CORNER 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


10. Choosing an SRS 


Exercises 


Students as customers A high school’s student 
newspaper plans to survey local businesses about the 
importance of students as customers. From an al- 
phabetical list of all local businesses, the newspaper 
staff chooses 150 businesses at random. Of these, 73 
return the questionnaire mailed by the staff. Identify 
the population and the sample. 


Student archaeologists An archaeological dig turns 
up large numbers of pottery shards, broken stone 
tools, and other artifacts. Students working on the 
project classify each artifact and assign it a number. 
The counts in different categories are important for 
understanding the site, so the project director chooses 
2% of the artifacts at random and checks the students’ 
work. Identify the population and the sample. 


Sampling stuffed envelopes A large retailer prepares its 
customers’ monthly credit card bills using an automatic 
machine that folds the bills, stuffs them into envelopes, 
and seals the envelopes for mailing. Are the envelopes 
completely sealed? Inspectors choose 40 envelopes 

at random from the 1000 stuffed each hour for visual 
inspection. Identify the population and the sample. 
Customer satisfaction A department store mails 

a customer satisfaction survey to people who make 
credit card purchases at the store. This month, 


45,000 people made credit card purchases. Surveys 
are mailed to 1000 of these people, chosen at ran- 
dom, and 137 people return the survey form. Identify 
the population and the sample. 


Call the shots An advertisement for an upcoming TV 
show asked: “Should handgun control be tougher? 
You call the shots in a special call-in poll tonight. If 
yes, call 1-900-720-6181. If no, call 1-900-720-6182. 
Charge is 50 cents for the first minute.” Over 90% 

of people who called in said “Yes.” Explain why this 
opinion poll is almost certainly biased. 


Explain it to the congresswoman You are on the staff 
of a member of Congress who is considering a bill that 
would provide government-sponsored insurance for 
nursing-home care. You report that 1128 letters have 
been received on the issue, of which 871 oppose the 
legislation. “’m surprised that most of my constituents 
oppose the bill. I thought it would be quite popular,” 
says the congresswoman. Are you convinced that a 
majority of the voters oppose the bill? How would you 
explain the statistical issue to the congresswoman? 


Instant opinion A recent online poll posed the 
question “Should female athletes be paid the 
same as men for the work they do?” In all, 13,147 
(44%) said “Yes,” 15,182 (50%) said “No,” and the 
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remaining 1448 said “Don’t know.” In spite of the 
large sample size for this survey, we can’t trust the 
result. Why not? 


Sampling at the mall You have probably seen the 
mall interviewer, approaching people passing by 
with clipboard in hand. Explain why even a large 
sample of mall shoppers would not provide a trust- 
worthy estimate of the current unemployment rate. 


Sleepless nights How much sleep do high school 
students get on a typical school night? An interested 
student designed a survey to find out. To make data 
collection easier, the student surveyed the first 100 
students to arrive at school on a particular morning. 
These students reported an average of 7.2 hours of 
sleep on the previous night. 


What type of sample did the student obtain? 


Explain why this sampling method is biased. Is 7.2 
hours probably higher or lower than the true aver- 
age amount of sleep last night for all students at the 


school? Why? 


Online polls In June 2008, Parade magazine posed 
the following question: “Should drivers be banned 
from using all cell phones?” Readers were encour- 
aged to vote online at parade.com. The July 13, 
2008, issue of Parade reported the results: 2407 
(85%) said “Yes” and 410 (15%) said “No.” 


What type of sample did the Parade survey obtain? 


Explain why this sampling method is biased. Is 85% 
probably higher or lower than the true percent of all 
adults who believe that cell phone use while driving 


should be banned? Why? 


Do you trust the Internet? You want to ask a sample 
of high school students the question “How much do 
you trust information about health that you find on 
the Internet—a great deal, somewhat, not much, or 
not at all?” You try out this and other questions on a 
pilot group of 5 students chosen from your class. The 
class members are listed below. 


Explain how you would use a line of Table D to choose 
an SRS of 5 students from the following list. Explain 
your method clearly enough for a classmate to obtain 
your results. 

Use line 107 to select the sample. Show how you use 
each of the digits. 

Anderson Deng Glaus Nguyen Samuels 
Arroyo De Ramos __ Helling Palmiero Shen 
Batista Drasin Husain Percival Tse 

Bell Eckstein Johnson Prince Velasco 
Burke Fernandez Kim Puri Wallace 
Cabrera Fullmer Molina Richards Washburn 
Calloway — Gandhi Morgan Rider Zabidi 
Delluci Garcia Murphy Rodriguez Zhao 
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12. Apartment living You are planning a report on 
apartment living in a college town. You decide to 
select three apartment complexes at random for in- 
depth interviews with residents. 

(a) Explain how you would use a line of Table D to 
choose an SRS of 3 complexes from the list below. 
Explain your method clearly enough for a classmate 
to obtain your results. 

(b) Use line 117 to select the sample. Show how you use 
each of the digits. 

Ashley Oaks Chauncey Village Franklin Park Richfield 

Bay Pointe Country Squire Georgetown Sagamore Ridge 

Beau Jardin Country View Greenacres Salem Courthouse 

Bluffs Country Villa Lahr House Village Manor 

Brandon Place Crestview Mayfair Village Waterford Court 

Briarwood Del-Lynn Nobb Hill Williamsburg 

Brownstone Fairington Pemberly Courts 

Burberry Fairway Knolls Peppermill 

Cambridge Fowler Pheasant Run 

13. Sampling the forest ‘To gather data on a 1200-acre 
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pine forest in Louisiana, the U.S. Forest Service laid a 
grid of 1410 equally spaced circular plots over a map 

of the forest. A ground survey visited a sample of 10% 
of these plots.'? 


Explain how you would use your calculator or Table D 
to choose an SRS of 141 plots. Your description should 
be clear enough for a classmate to carry out your plan. 


Use your method from (a) to choose the first 3 plots. 


Sampling gravestones ‘The local genealogical society 
in Coles County, Illinois, has compiled records on all 
55,914 gravestones in cemeteries in the county for the 
years 1825 to 1985. Historians plan to use these records 
to learn about African Americans in Coles County’s his- 
tory. They first choose an SRS of 395 records to check 
their accuracy by visiting the actual gravestones.!* 


Explain how you would use your calculator or Table 
D to choose the SRS. Your description should be 
clear enough for a classmate to carry out your plan. 
Use your method from (a) to choose the first 3 
gravestones. 

Random digits Which of the following statements 
are true of a table of random digits, and which are 
false? Briefly explain your answers. 

There are exactly four 0s in each row of 40 digits. 
Each pair of digits has chance 1/100 of being 00. 


The digits 0000 can never appear as a group, because 
this pattern is not random. 

Random digits In using Table D repeatedly to 
choose random samples, you should not always begin 
at the same place, such as line 101. Why not? 
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iPhones Suppose 1000 iPhones are produced at 

a factory today. Management would like to ensure 
that the phones’ display screens meet their quality 
standards before shipping them to retail stores. Since 
it takes about 10 minutes to inspect an individual 
phone’s display screen, managers decide to inspect a 
sample of 20 phones from the day’s production. 


Explain why it would be difficult for managers to 
inspect an SRS of 20 iPhones that are produced today. 


An eager employee suggests that it would be easy to 
inspect the last 20 iPhones that were produced today. 
Why isn’t this a good idea? 

Another employee recommends a different sam- 
pling method: Randomly choose one of the first 50 
iPhones produced. Inspect that phone and every 
fiftieth iPhone produced afterward. (‘This method 

is known as systematic random sampling.) Explain 
carefully why this sampling method is not an SRS. 


Dead trees On the west side of Rocky Mountain 
National Park, many mature pine trees are dying due 
to infestation by pine beetles. Scientists would like to 
use sampling to estimate the proportion of all pine 
trees in the area that have been infected. 


Explain why it wouldn't be practical for scientists to 
obtain an SRS in this setting. 


A possible alternative would be to use every pine tree 
along the park’s main road as a sample. Why is this 
sampling method biased? 


Suppose that a more complicated random sampling 
plan is carried out, and that 35% of the pine trees in 
the sample are infested by the pine beetle. Can scien- 
tists conclude that exactly 35% of all the pine trees on 
the west side of the park are infested? Why or why not? 


Who goes to the convention? A club has 30 student 
members and 10 faculty members. The students are 


Abel Fisher Huber Miranda Reinmann 
Carson Ghosh Jimenez Moskowitz Santos 
Chen Griswold Jones Neyman Shaw 
David Hein Kim O’Brien Thompson 
Deming Hernandez Klotz Pearl Utts 
Elashoff — Holland Liu Potter Varga 


The faculty members are 


Andrews Fernandez Kim Moore West 
Besicovitch Gupta Lightman Phillips Yang 


The club can send 4 students and 2 faculty mem- 
bers to a convention. It decides to choose those who 
will go by random selection. Describe a method for 
using ‘Table D to select a stratified random sample 
of + students and 2 faculty. Then use line 123 to 
select the sample. 


20. 


21. 
a] 221 


Section 4.1 Sampling and Surveys 


Sampling by accountants Accountants often use 
stratified samples during audits to verify a company’s 
records of such things as accounts receivable. ‘The strat- 
ification is based on the dollar amount of the item and 
often includes 100% sampling of the largest items. One 
company reports 5000 accounts receivable. Of these, 
100 are in amounts over $50,000; 500 are in amounts 
between $1000 and $50,000; and the remaining 4400 
are in amounts under $1000. Using these groups as 
strata, you decide to verify all the largest accounts and 
to sample 5% of the midsize accounts and 1% of the 
small accounts. Describe a method for using ‘Table D 
to select a stratified random sample of the midsize and 
small accounts. ‘Then use line 115 to select only the 
first 3 accounts from each of these strata. 


Go Blue! Michigan Stadium, also known as “The 
Big House,” seats over 100,000 fans for a football 
game. ‘The University of Michigan athletic depart- 
ment plans to conduct a survey about concessions 
that are sold during games. Tickets are most expen- 
sive for seats on the sidelines. ‘he cheapest seats are 
in the end zones (where one of the authors sat as a 
student). A map of the stadium is shown. 
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The athletic department is considering a stratified 
random sample. What would you recommend as the 
strata? Why? 


MS Sideline 
[1 Corner 
TE) Endzone 


(b) Explain why a cluster sample might be easier to 


Dipl 


obtain. What would you recommend for the clusters? 
Why? 

How was your stay? A hotel has 30 floors with 40 
rooms per floor. The rooms on one side of the hotel 
face the water, while rooms on the other side face a 
golf course. There is an extra charge for the rooms 
with a water view. The hotel manager wants to 
survey 120 guests who stayed at the hotel during a 
convention about their overall satisfaction with the 


property. 
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Explain why choosing a stratified random sample 
might be preferable to an SRS in this case. What 
would you use as strata? 


Why might a cluster sample be a simpler option? 
What would you use as clusters? 


Is it an SRS? A corporation employs 2000 male and 
500 female engineers. A stratified random sample 

of 200 male and 50 female engineers gives each 
engineer | chance in 10 to be chosen. This sample 
design gives every individual in the population the 
same chance to be chosen for the sample. Is it an 
SRS? Explain your answer. 


Attitudes toward alcohol At a party there are 30 
students over age 21 and 20 students under age 

21. You choose at random 3 of those over 2] and 
separately choose at random 2 of those under 21 to 
interview about attitudes toward alcohol. You have 
given every student at the party the same chance to 
be interviewed. Why is your sample not an SRS? 


High-speed Internet Laying fiber-optic cable is expen- 
sive. Cable companies want to make sure that if they 
extend their lines out to less dense suburban or rural 
areas, there will be sufficient demand and the work 

will be cost-effective. They decide to conduct a survey 
to determine the proportion of households in a rural 
subdivision that would buy the service. They select a 
simple random sample of 5 blocks in the subdivision 
and survey each family that lives on one of those blocks. 


What is the name for this kind of sampling method? 


Give a possible reason why the cable company chose 
this method. 


Timber! A lumber company wants to estimate the 
proportion of trees in a large forest that are ready to 
be cut down. They use an aerial map to divide the 
forest into 200 equal-sized rectangles. ‘Then they 
choose a random sample of 20 rectangles and exam- 
ine every tree that’s in one of those rectangles. 


What is the name for this kind of sampling method? 


Give a possible reason why the lumber company 
chose this method. 


Tweet, tweet! What proportion of students at your 
school use ‘Twitter? To find out, you survey a simple 
random sample of students from the school roster. 


Will your sample result be exactly the same as the 
true population proportion? Explain. 


Which would be more likely to get your sample 
result closer to the true population value: an SRS of 
50 students or an SRS of 100 students? Explain. 


Far from home? A researcher wants to estimate the 

average distance that students at a large community col- 
lege live from campus. To find out, she surveys a simple 
random sample of students from the registrar's database. 
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Will the researcher’s sample result be exactly the 
same as the true population mean? Explain. 


Which would be more likely to get the researcher’s 
sample result closer to the true population value: 
an SRS of 100 students or an SRS of 50 students? 
Explain. 


Baseball tickets Suppose you want to know the 
average amount of money spent by the fans attend- 
ing opening day for the Cleveland Indians baseball 
season. You get permission from the team’s manage- 
ment to conduct a survey at the stadium, but they 
will not allow you to bother the fans in the club 
seating or box seats (the most expensive seating). Us- 
ing a computer, you randomly select 500 seats from 
the rest of the stadium. During the game, you ask the 
fans in those seats how much they spent that day. 

Give a reason why this survey might yield a biased 
result. Explain the likely direction of the bias. 


Rise and shine How long before school starts do 
students get out of bed, on average? Administrators 
survey a random sample of students on each school 
bus one morning. 

Give a reason why this survey might yield a biased 
result. Explain the likely direction of the bias. 


Nonresponse A survey of drivers began by randomly 
sampling all listed residential telephone numbers in 
the United States. Of 45,956 calls to these numbers, 
5029 were completed. The goal of the survey was to 
estimate how far people drive, on average, per day.” 


What was the rate of nonresponse for this sample? 


Explain how nonresponse can lead to bias in this 
survey. Be sure to give the direction of the bias. 


Ring-no-answer A common form of nonresponse in 
telephone surveys is “ring-no-answer.” ‘That is, a call 
is made to an active number but no one answers. 
The Italian National Statistical Institute looked at 
nonresponse to a government survey of households 
in Italy during the periods January | to Easter and 
July | to August 31. All calls were made between 7 
and 10 p.M., but 21.4% gave “ring-no-answer” in one 
period versus 41.5% “ring-no-answer’” in the other 
period.'® Which period do you think had the higher 
rate of no answers? Why? Explain why a high rate of 
nonresponse makes sample results less reliable. 


Running red lights The sample described in 
Exercise 31 produced a list of 5024 licensed drivers. 
The investigators then chose an SRS of 880 of these 
drivers to answer questions about their driving habits. 
One question asked was: “Recalling the last ten traf 
fic lights you drove through, how many of them were 
red when you entered the intersections?” Of the 880 
respondents, 171 admitted that at least one light had 
been red. A practical problem with this survey is that 
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people may not give truthful answers. What is the 
likely direction of the bias? Explain. 


Seat belt use A study in El Paso, ‘Texas, looked at seat 
belt use by drivers. Drivers were observed at randomly 
chosen convenience stores. After they left their cars, 
they were invited to answer questions that included 
questions about seat belt use. In all, 75% said they al- 
ways used seat belts, yet only 61.5% were wearing seat 
belts when they pulled into the store parking lots.!” 
Explain the reason for the bias observed in responses 
to the survey. Do you expect bias in the same direc- 
tion in most surveys about seat belt use? 


Wording bias Comment on each of the following 
as a potential sample survey question. Is the question 
clear? Is it slanted toward a desired response? 


“Some cell phone users have developed brain 
cancer. Should all cell phones come with a warning 
label explaining the danger of using cell phones?” 


“Do you agree that a national system of health insur- 
ance should be favored because it would provide 
health insurance for everyone and would reduce 
administrative costs?” 


“In view of escalating environmental degradation 
and incipient resource depletion, would you favor 
economic incentives for recycling of resource— 
intensive consumer goods?” 


Checking for bias Comment on each of the follow- 
ing as a potential sample survey question. Is the ques- 
tion clear? Is it slanted toward a desired response? 
Which of the following best represents your opinion 
on gun control? 

1. The government should confiscate our guns. 

2. We have the right to keep and bear arms. 

A freeze in nuclear weapons should be favored 
because it would begin a much-needed process to 
stop everyone in the world from building nuclear 
weapons now and reduce the possibility of nuclear 
war in the future. Do you agree or disagree? 


Multiple choice: Select the best answer for Exercises 
37 1042. 
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The Web portal AOL places opinion poll questions 
next to many of its news stories. Simply click your 
response to join the sample. One of the questions 
in January 2008 was “Do you plan to diet this year?” 
More than 30,000 people responded, with 68% say- 
ing “Yes.” You can conclude that 


about 68% of Americans planned to diet in 2008. 


the poll used a convenience sample, so the results 
tell us little about the population of all adults. 

the poll uses voluntary response, so the results tell us 
little about the population of all adults. 
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(d) the sample is too small to draw any conclusion. 

(ec) None of these. 

38. ‘To gather information about the validity of a new 
standardized test for tenth-grade students in a par- 
ticular state, a random sample of 15 high schools was 
selected from the state. The new test was adminis- 
tered to every 10th-grade student in the selected high 
schools. What kind of sample is this? 

(a) Asimple random sample 

(b) A stratified random sample 

(c) Acluster sample 

(d) A systematic random sample 

(e) A voluntary response sample 

39. Your statistics class has 30 students. You want to call 
an SRS of 5 students from your class to ask where 
they use a computer for the online quizzes. You 
label the students 01, 02, ..., 30. You enter the table 
of random digits at this line: 

14459 26056 31424 80371 65103 62253 22490 61181 

Your SRS contains the students labeled 

(2) ROD GONG: 

(oy) Wee SISOS), TO), 22 

(CEOS OR 222.27 

(d) 14, 03, 10, 22, 06. 

(e) 14,03, 10, 22, 11. 

40. Suppose that 35% of the registered voters in a state 
are registered as Republicans, 40% as Democrats, 
and 25% as Independents. A newspaper wants to se- 
lect a sample of 1000 registered voters to predict the 
outcome of the next election. If they randomly select 
350 Republicans, randomly select 400 Democrats, 
and randomly select 250 Independents, did this sam- 
pling procedure result in a simple random sample of 
registered voters from this district? 

(a) Yes, because each registered voter had the same 
chance of being chosen. 

(b) Yes, because random chance was involved. 

(c) No, because not all registered voters had the same 
chance of being chosen. 

(d) No, because there were a different number of regis- 
tered voters selected from each party. 

(e) No, because not all possible groups of 1000 regis- 
tered voters had the same chance of being chosen. 

41. Alocal news agency conducted a survey about unem- 


ployment by randomly dialing phone numbers until 

they had gathered responses from 1000 adults in their 
state. In the survey, 19% of those who responded said 
they were not currently employed. In reality, only 6% 
of the adults in the state were not currently employed 
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CHAPTER 4 


at the time of the survey. Which of the following best 
explains the difference in the two percentages? 

The difference is due to sampling variability. We 
shouldn’t expect the results of a random sample to 
match the truth about the population every time. 


The difference is due to response bias. Adults who 
are employed are likely to lie and say that they are 
unemployed. 

The difference is due to undercoverage bias. ‘The 
survey included only adults and did not include 
teenagers who are eligible to work. 


The difference is due to nonresponse bias. Adults 
who are employed are less likely to be available for 
the sample than adults who are unemployed. 

The difference is due to voluntary response. Adults 
are able to volunteer as a member of the sample. 

A simple random sample of 1200 adult Americans 
is selected, and each person is asked the following 
question: “In light of the huge national deficit, 
should the government at this time spend ad- 
ditional money to establish a national system of 
health insurance?” Only 39% of those responding 
answered “Yes.” This survey 

is reasonably accurate since it used a large simple 
random sample. 

needs to be larger since only about 24 people were 
drawn from each state. 

probably understates the percent of people who favor 
a system of national health insurance. 


Distinguish between an observational study and an 
experiment. 

Explain the concept of confounding and how it 
limits the ability to make cause-and-effect 


conclusions. 


Identify the experimental units, explanatory 
and response variables, and treatments in an 
experiment. 


Explain the purpose of comparison, random assign- 
ment, control, and replication in an experiment. 
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is very inaccurate but neither understates nor over- 
states the percent of people who favor a system of 
national health insurance. Because simple random 
sampling was used, it is unbiased. 

probably overstates the percent of people who favor a 
system of national health insurance. 


Sleep debt (3.2) A researcher reported that the typi- 
cal teenager needs 9.3 hours of sleep per night but 
gets only 6.3 hours.!* By the end of a 5-day school 
week, a teenager would accumulate about 15 hours 
of “sleep debt.” Students in a high school statistics 
class were skeptical, so they gathered data on the 
amount of sleep debt (in hours) accumulated over 
time (in days) by a random sample of 25 high school 
students. The resulting least-squares regression equa- 


tion for their data is Sleep debt = 2.23 + 3.17(days). 
Interpret the slope of the regression line in context. 


Are the students’ results consistent with the 
researcher's report? Explain. 


Internet charges (2.1) Some Internet service provid- 
ers (ISPs) charge companies based on how much 
bandwidth they use in a month. One method that 
ISPs use for calculating bandwidth is to find the 95th 
percentile of a company’s usage based on samples of 
hundreds of 5-minute intervals during a month. 


Explain what “95th percentile” means in this setting. 


Which would cost a company more: the 95th per- 
centile method or a similar approach using the 98th 
percentile? Justify your answer. 


Experiments 


By the end of the section, you should be able to: 


Describe a completely randomized design for an experi- 
ment, including how to randomly assign treatments using 
slips of paper, technology, or a table of random digits. 
Describe the placebo effect and the purpose of blinding 
in an experiment. 

Interpret the meaning of statistically significant in the 
context of an experiment. 

Explain the purpose of blocking in an experiment. 
Describe a randomized block design or a matched pairs 
design for an experiment. 
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A sample survey aims to gather information about a population without disturb- 
ing the population in the process. Sample surveys are one kind of observational 
study. Other observational studies watch the behavior of animals in the wild or the 
interactions between teacher and students in the classroom. This section is about 
statistical designs for experiments, a very different way to produce data. 


Observational Study versus Experiment 


In contrast to observational studies, experiments don’t just observe individuals 
or ask them questions. They actively impose some treatment to measure the re- 
sponse. Experiments can answer questions like “Does aspirin reduce the chance 
of a heart attack?” and “Can yoga help dogs live longer?” 


DEFINITION: Observational study and experiment 


An observational study observes individuals and measures variables of interest but 
does not attempt to influence the responses. 


An experiment deliberately imposes some treatment on individuals to measure their 
responses. 


The goal of an observational study can be to describe some group or situation, 
to compare groups, or to examine relationships between variables. The purpose 
of an experiment is to determine whether the treatment causes a change in the 
response. An observational study, even one based on a random sample, is a poor 
way to gauge the effect that changes in one variable have on another variable. To 
see the response to a change, we must actually impose the change. When our goal 
is to understand cause and effect, experiments are the only source of fully convincing 
data. For this reason, the distinction between observational study and experiment 
is one of the most important in statistics. 


Does Taking Hormones Reduce Heart 


Attack Risk after Menopause? 
Observation versus experiment 


Should women take hormones such as estrogen after menopause, when natural 
production of these hormones ends? In 1992, several major medical organizations 
said “Yes.” Women who took hormones seemed to reduce their risk of a heart at- 
tack by 35% to 50%. The risks of taking hormones appeared small compared 
with the benefits. 


The evidence in favor of hormone replacement came from a number of ob- 
servational studies that compared women who were taking hormones with 
others who were not. But the women who chose to take hormones were 
richer and better educated and saw doctors more often than women who 
didn’t take hormones. Because the women who took hormones did 
many other things to better maintain their health, it isn’t surprising 
that they had fewer heart attacks. 
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From Chapter 3: A response variable 
measures an outcome of a study. An 
explanatory variable may help explain 
or predict changes in a response 
variable. 


Some people call a variable that results 
in confounding, like the number of 
doctor visits per year in this case, a 
confounding variable. 


AP® EXAM TIP If you are 
asked to identify a possible 
confounding variable in a given 
setting, you are expected to 


explain how the variable you 
choose (1) is associated with 
the explanatory variable and 
(2) affects the response 
variable. 


To get convincing data on the link between hormone replacement and heart at- 
tacks, we should do an experiment. Experiments don’t let women decide what to 
do. They assign women to either hormone replacement pills or to placebo pills 
that look and taste the same as the hormone pills. The assignment is done by 
a coin toss, so that all kinds of women are equally likely to get either treatment. 
By 2002, several experiments with women of different ages agreed that hormone 
replacement does not reduce the risk of heart attacks. The National Institutes of 
Health, after reviewing the evidence, concluded that the first studies were wrong. 
Taking hormones after menopause quickly fell out of favor.'” 


For each of these studies, the explanatory variable was whether or not a woman 
took hormones, and the response variable was whether or not the woman had a 
heart attack. Researchers wanted to argue that changes in the explanatory variable 
(hormone status) actually caused changes in the response variable (heart attack 
status). In the early observational studies, however, the effect of taking hormones 
was mixed up with the characteristics of women who chose to take them. These 
other variables make it hard to see the true relationship between the explanatory 
and response variables. 

Let’s consider two other variables from the observational studies of hormone 
replacement: number of doctor visits per year and age. The women who chose to 
take hormones visited their doctors more often than the women who didn’t take 
hormones. Did the women in the hormone group have fewer heart attacks be- 
cause they got better health care or because they took hormones? We can’t be sure. 
A situation like this, in which the effects of two variables on a response variable 
cannot be separated from each other, is called confounding. 

What about age? Older women are at greater risk of having a heart attack than 
younger women. If the women who took hormones were generally younger than 
those who didn’t, we’d have more confounding. That wasn’t the case, however. 
There was no link between age and group membership (hormones or not) in the 
observational studies. If there is no difference between the groups with respect to 
the other variable, there can be no confounding. 


DEFINITION: Confounding 


Confounding occurs when two variables are associated in such a way that their ef- 
fects on a response variable cannot be distinguished from each other. 


Observational studies of the effect of an explanatory variable on a re- 
sponse variable often fail because of confounding between the explanatory 
variable and one or more other variables. Well-designed experiments take 
steps to prevent confounding. The later hormone therapy experiments avoided 
confounding by letting chance decide who took hormones and who didn’t. That 
way, women who took better care of themselves were split about evenly between 
the two groups. So were older women and younger women. When these experi- 
ments found no reduction in heart attack risk for women taking hormones, re- 
searchers began to doubt the results of the earlier observational studies. The moral 
of the story is simple: beware the influence of other variables! 
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CHECK YOUR UNDERSTANDING 


1. Does reducing screen brightness increase battery life in laptop computers? ‘To find 
out, researchers obtained 30 new laptops of the same brand. ‘They chose 15 of the com- 
puters at random and adjusted their screens to the brightest setting. 
The other 15 laptop screens were left at the default setting — moderate 
brightness. Researchers then measured how long each machine’s bat- 
tery lasted. Was this an observational study or an experiment? Justify 
your answer. 

Questions 2 to 4 refer to the following setting. Does eating dinner 
with their families improve students’ academic performance? Accord- 
ing to an ABC News article, “Teenagers who eat with their families at 
least five times a week are more likely to get better grades in school.””° 
This finding was based on a sample survey conducted by researchers at 
Columbia University. 


2. Was this an observational study or an experiment? Justify your answer. 
3. What are the explanatory and response variables? 


4. Explain clearly why such a study cannot establish a cause-and-effect relationship. 
Suggest a variable that may be confounded with whether families eat dinner together. 


The Language of Experiments 


An experiment is a statistical study in which we actually do something (a treat- 
ment) to people, animals, or objects (the experimental units) to observe the re- 
sponse. Here is the basic vocabulary of experiments. 


DEFINITION: Treatment, experimental units, subjects 


A specific condition applied to the individuals in an experiment is called a treatment. 
If an experiment has several explanatory variables, a treatment is a combination of 
specific values of these variables. 


The experimental units are the smallest collection of individuals to which treat- 
ments are applied. When the units are human beings, they often are called subjects. 


The best way to learn the language of experiments is to practice using it. 


When Will | Ever Use This Stuff? 


Vocabulary of experiments 


Researchers at the University of North Carolina were concerned about the in- 
creasing dropout rate in the state’s high schools, especially for low-income stu- 
dents. Surveys of recent dropouts revealed that many of these students had start- 
ed to lose interest during middle school. They said they saw little connection 
between what they were studying in school and their future plans. To change 
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this perception, researchers developed a program called CareerStart. The big 
idea of the program is that teachers show students how the topics they learn get 
used in specific careers. 


To test the effectiveness of CareerStart, the researchers recruited 14 middle 
schools in Forsyth County to participate in an experiment. Seven of the 
schools, determined at random, used CareerStart along with the district’s 
standard curriculum. The other seven schools just followed the standard 
curriculum. Researchers followed both groups of students for several years, 
collecting data on students’ attendance, behavior, standardized test scores, 
level of engagement in school, and whether or not the students graduated 


from high school. 


Results: Students at schools that used CareerStart generally had better at- 
tendance and fewer discipline problems, earned higher test scores, report- 
ed greater engagement in their classes, and were more likely to graduate.”! 


PROBLEM: Identify the experimental units, explanatory and response variables, and the treat- 
ments in the CareerStart experiment. 


SOLUTION: The experimental units are 14 middle schools in Forsyth County, NC. The explanatory 
variable is whether the school used the CareerStart program with its students. Several response 
variables were measured, including test scores, attendance, behavior, student engagement, and 
graduation rates. This experiment compares two treatments: (1) the standard middle school cur- 
riculum and (2) the standard curriculum plus CareerStart. 


For Practice Try Exercise 


Note that the experimental units in the CareerStart example are the schools, 
not individual students. Experimental units are the smallest collection of indi- 
viduals to which treatments are applied. The curricular treatments were adminis- 
tered to entire schools, so those are the experimental units. 

The previous example illustrates the big advantage of experiments over obser- 
vational studies: experiments can give good evidence for causation. In an experi- 
ment, we study the effects of the specific treatments we are interested in, while 
trying to control for the effects of other variables. For instance, the students in 
all 14 schools followed the standard curriculum. To ensure that the two groups 
of schools were as similar as possible before the treatments were administered, 
researchers let chance decide which 7 schools would use CareerStart. The only 
systematic difference between the schools was the educational treatment. When 
students from the CareerStart schools did much better, researchers were able to 
conclude that the program made the difference. 

Sometimes, the explanatory variables in an experiment are called factors. Many 
experiments study the joint effects of several factors. In such an experiment, each 
treatment is formed by combining a specific value (often called a level) of each of 
the factors. Here’s an example of a multifactor experiment. 
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TV Advertising 


Experiments with multiple explanatory variables 


What are the effects of repeated exposure to an advertising message? The answer 
may depend on both the length of the ad and on how often it is repeated. An 
experiment investigated this question using 120 undergraduate students who 
volunteered to participate. All subjects viewed a 40-minute television pro- 
gram that included ads for a digital camera. Some subjects saw a 30-second 
commercial; others, a 90-second version. ‘he same commercial was shown 
either 1, 3, or 5 times during the program. After viewing, all the subjects an- 
swered questions about their recall of the ad, their attitude 
toward the camera, and their intention to purchase it.”” 


Subjects assigned 
to Treatment 3 see PROBLEM: Forthe advertising study, identify the experimen- 
4 30-second ad P F | : 
ni nd response variables, and th 
beter ® eystiinet dariva tal units or subjects, explanatory and response variables, and the 
Repetitions the program. treatments. 


1time 3times 5 times SOLUTION: 
30 


seconds The subjects are the 120 undergraduate students. This experiment 
pple has 2 explanatory variables (factors): length of the commercial and 
number of repetitions. The response variables include measures of 
subjects recall of the ad, their attitudes about the digital camera, 
and whether they intend to purchase it. 


FIGURE 4.2 The six treatments There are 2 different lengths of commercial (30 and 90 seconds) and three different numbers of 
in the TV ad experiment. repetitions (1, 3, and 5). The 6 combinations consisting of one level of each factor form the 6 treat- 
Combinations of values of ments shown in Figure 4.2: (1) 30 seconds, 1 time; (2) 30 seconds, 3 times; (3) 30 seconds, 5 


the two explanatory variables times; (4) 90 seconds, 1 time; (5) 90 seconds, 3 times; (6) 90 seconds, 5 times. 
(factors) form six treatments. 


seconds 


For Practice Try Exercise 


This example shows how experiments allow us to study the combined effect of 
several factors. The interaction of several factors can produce effects that could 
not be predicted from looking at the effect of each factor alone. Perhaps longer 
commercials increase interest in a product, and more commercials also increase 
interest. But if we both make a commercial longer and show it more often, viewers 
get annoyed and their interest in the product drops. The two-factor experiment in 
the TV advertising example will help us find out. 


How to Experiment Badly 


Experiments are the preferred method for examining the effect of one variable on 
another. By imposing the specific treatment of interest and controlling other influ- 
ences, we can pin down cause and effect. Good designs are essential for effective 
experiments, just as they are for sampling. To see why, let’s start with an example 
of a bad experimental design. 
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Are Online SAT Prep Courses 


Effective? 


A bad experiment 


A high school regularly offers a review course to prepare students for the SAT. 
This year, budget cuts will allow the school to offer only an online version of the 
course. Suppose the group of students who take the online course earn an average 
increase of 45 points in their math scores from a pre-test to the actual SAT test. 
Can we conclude that the online course is effective? 


This experiment has a very simple design. A group of subjects (the students) were 
exposed to a treatment (the online course), and the outcome (increase in math 
scores) was observed. Here is the design: 


Students > Online course — increase in math scores 


A closer look showed that many of the students in the online review course were 
taking advanced math classes in school. Maybe the students in the online course 
improved their math scores because of what they were learning in their school 
math classes, not because of the online course. This confounding prevents us from 
concluding that the online course is effective. 


Many laboratory experiments use a design like the one in the online SAT 
course example: 


Experimental units > Treatment > Measure response 


In the lab environment, simple designs often work well. Field experiments and 
experiments with animals or people deal with more varied conditions. Outside the 
lab, badly designed experiments often yield worthless results because of confounding. 


How to Experiment Well 


The remedy for the confounding in the SAT prep course example is to do a com- 
parative experiment in which some students are taught the SAT course in the 
classroom and other, similar students take the course online. Most well-designed 
experiments compare two or more treatments. 

Comparison alone isn’t enough to produce results we can trust. If the treat- 
ments are given to groups that differ greatly when the experiment begins, bias will 
result. For example, if we allow students to select online or classroom instruction, 
more self-motivated students are likely to sign up for the online course. Allowing 
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personal choice will bias our results in the same way that volunteers bias the re- 
sults of online opinion polls. The solution to the problem of bias in sampling is 
random selection. In experiments, the solution is random assignment. 


TT 


DEFINITION: Random assignment 


In an experiment, random assignment means that experimental units are assigned 
to treatments using a chance process. 


Let’s look at how random assignment can be used to improve the design of the 
SAT prep course experiment. 


SAT Prep: Online versus Classroom 


How random assignment works 


This year, the high school has enough budget money to compare the online SAT 
course with the classroom SAT course. Fifty students have agreed to participate in 
an experiment comparing the two instructional methods. 


PROBLEM: Describe how you would randomly assign 25 students to each of the two methods: 
(a) Using 50 identical slips of paper 

(b) Using technology 

(c) Using Table D 


SOLUTION: 


(a) The simplest way would be to use the “hat method.” Write each subject's name on one of the 
slips. Put all the slips ina hat and mix them thoroughly. Draw them out one at a time until you have 
25 slips. These 25 students will take the online course. The remaining 25 students will take the 
classroom course. Alternatively, you could write “online” on 25 of the slips and “classroom” on the 
other 25 slips. Then put the slips in a hat and mix them well. Have students come up one by one and 
(without looking) pick a slip from the hat. This guarantees 25 students per group, with the treat- 
ments assigned by chance. 


(b) Give numbers 1,2, 3,...,49, 50 to the subjects in alphabetical order by last name. Then use 
your calculator’s randInt command or a computer's random number generator to produce numbers 
between 1 and 50. Ignore any repeated numbers. The first 25 different numbers chosen select the 
students for the online course. The remaining 25 subjects will take the classroom course. 


(c) Give labels 01,02, 03,...,49,50 to the subjects in alphabetical order by last name. Go toa line 
of Table D and read two-digit groups moving from left to right. The first 25 distinct labels between 01 
and 50 identify the 25 students that are assigned to the online course. The remaining 25 students will 
take the classroom course. Ignore repeated labels and groups of digits from 51 to 00. 


For Practice Try Exercise 


242 CHAPTER 4 DESIGNING STUDIES 


In statistics, replication means “use 
enough subjects.” In other fields, 

the term “replication” has a different 
meaning. When one experiment is 
conducted and then the same or a 
similar experiment is independently 
conducted in a different location by 
different investigators, this is known as 
replication. That is, replication means 
repeatability. 


Random assignment should distribute the students taking advanced math class- 
es in roughly equal numbers to each group. It should also balance out the number 
of students with lots of extracurricular activities and those with part-time jobs in 
the classroom and online SAT prep courses. Random assignment helps ensure 
that the effects of other variables (such as current math course or amount of avail- 
able study time) are spread evenly among the two groups. 

Although random assignment should create two groups of students that are 
roughly equivalent to begin with, we still have to ensure that the only consistent 
difference between the groups during the experiment is the type of SAT prep they 
receive. We can control for the effects of some variables by keeping them the 
same for both groups. For instance, we should give all students the same pretest 
and actual SAT test at the same times on the same days. The length, timing, con- 
tent, and instructor of the SAT prep classes should also be the same. 

Because the two groups are alike except for the treatments, any difference in 
their average math score improvements must be due either to the treatments 
themselves or to the random assignment. We can’t say that any difference between 
the average SAT scores of students enrolled online and in the classroom must be 
caused by a difference in the effectiveness of the two types of instruction. There 
would be some difference, even if both groups received the same instruction, be- 
cause of variation among students in background and study habits. Chance as- 
signs students to one group or the other, which results in a chance difference 
between the groups. 

We would not trust an experiment with just one student in each group. The re- 
sults would depend too much on which group got lucky and received the stronger 
student. If we assign many subjects to each group, however, the effects of chance 
will balance out, and there will be little difference in the average responses in the 
two groups unless the treatments themselves cause a difference. This is the idea 
of replication: use enough experimental units to distinguish a difference in the 
effects of the treatments from chance variation due to the random assignment. 


PRINCIPLES OF EXPERIMENTAL DESIGN 


THINK 
ABOUT IT 
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Why is control important in an experiment? For two reasons. Sup- 
pose we used two different instructors in the SAT experiment. If Mrs. McDonald 
taught the online group and Mr. ‘Tyson taught the classroom group, then course 
type will be confounded with instructor. We won’t know if the difference in aver- 
age improvement for the two groups was due to the difference in instructor or the 
difference in course type. So one reason we need to control other variables is to 
prevent confounding. 

The second reason we should control other variables is to reduce variability in 
the response variable. Suppose that we allow students in both groups to choose 
how many class sessions to attend. Their choices will increase the variation in the 
response variable (improvement) for both groups. Some students will attend fewer 
sessions and experience smaller improvements than they would have otherwise. 
Other students will attend as many sessions as possible and experience bigger im- 
provements than they might have otherwise. This increase in variation will make 
it harder to see if one treatment is really more effective. 

The dotplots on the left show the results of an experiment in which the number 
of class sessions was the same for all participating students. From these graphs, it 
seems clear that the online course is more effective than the classroom course. The 
dotplots on the right show the results of an experiment in which the students were 
permitted to choose the number of class sessions they attended. Notice that the 
centers of the distributions haven’t changed, but the distributions are much more 
variable. The increased overlap in the graphs makes the evidence supporting the 
online course less convincing. 
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Let’s see how these principles were used in designing a famous medical experiment. 


The Physicians’ Health Study 


A well-designed experiment 


Does regularly taking aspirin help protect people against heart attacks? The 
Physicians’ Health Study was a medical experiment that helped answer this ques- 
tion. In fact, the Physicians’ Health Study looked at the effects of two drugs: aspi- 
rin and beta-carotene. Researchers wondered whether beta-carotene would help 
prevent some forms of cancer. The subjects in this experiment were 21,996 male 
physicians. There were two explanatory variables (factors), each having two levels: 
aspirin (yes or no) and beta-carotene (yes or no). Combinations of the levels of 
these factors form the four treatments shown in Figure 4.3 on the next page. One- 
fourth of the subjects were assigned at random to each of these treatments. 
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Factor 1: 
Aspirin 


No 


Factor 2: Beta-carotene 
No 


alia 


Aspirin Beta-carotene Aspirin Placebo 


alia 


Placebo Beta-carotene Placebo Placebo 


FIGURE 4.3 The treatments in the Physicians’ Health Study. 


Why did researchers decide to do 
the Physicians’ Health Study (PHS)? 
The interesting history that led to 
this experiment is detailed at the 
PHS Web site. You can also find out 


On odd-numbered days, the subjects took either a tablet 
that contained aspirin or a dummy pill that looked and 
tasted like the aspirin but had no active ingredient (a 
placebo). On even-numbered days, they took either a 
capsule containing beta-carotene or a placebo. There 
were several response variables—the study looked for 
heart attacks, several kinds of cancer, and other medical 
outcomes. After several years, 239 of the placebo group 
but only 139 of the aspirin group had suffered heart at- 
tacks. This difference is large enough to give good evi- 
dence that taking aspirin does reduce heart attacks.” 
It did not appear, however, that beta-carotene had any 
effect on preventing cancer. 


PROBLEM: Explain how each of the four principles of experimental design was used in the Physi- 


cians’ Health Study. 
SOLUTION: 


Random assignment: Wasu 


about the Physicians’ Health Study ll, the same schedule of pill taking. 


which ended in December 2007. 


Comparison: Researchers used a design that compared both of the active treatments to a placebo. 


sed to determine which subjects received each of the four treatment 


combinations. This helped ensure that the treatment groups were roughly equivalent to begin with. 
Control: The experiment used subjects of the same gender and occupation. All subjects followed 


Replication: There were over 5000 subjects per treatment, group. This large number of subjects 


helped ensure that the difference in heart attacks was due to the aspirin and not to chance variation 


in the random assignment. 


For Practice Try Exercise 


The Physicians’ Health Study shows how well-designed experiments can yield 
good evidence that differences in the treatments cause the differences we observe 


in the response. 


Completely Randomized Designs 


The diagram in Figure 4. 


4 presents the details of the SAT prep experiment: ran- 


dom assignment, the sizes of the groups and which treatment they receive, and 
the response variable. There are, as we will see later, statistical reasons for using 


treatment groups that are 


about equal in size. This type of design is called a com- 


pletely randomized design. 


50 volunteer Random 


Group 1 Treatment 1 


wer 25 students Online i 
Compare 


students assignment a SAT scores 
ae Group 2 Treatment 2 


FIGURE 4.4 Outline of a comp 
instruction. 


25 students Classroom 


letely randomized design to compare online and classroom 


THINK 
ABOUT IT 
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DEFINITION: Completely randomized design 


In a completely randomized design, the experimental units are assigned to the 
treatments completely by chance. 


Notice that the definition of a completely randomized design does not require 
that each treatment be assigned to an equal number of experimental units. It does 
specify that the assignment of treatments must occur completely at random. 


Does using chance to assign treatments in an experiment 
guarantee a completely randomized design? Actually, no. Let’s 
return to the SAT prep course experiment. Another way to randomly assign the 
50 students to the two treatments is by tossing a coin. Have each student come 
forward and toss a coin. If it’s heads, then the student will take the course online. 
If it’s tails, then the student will take the classroom course. 

As long as all 50 students toss a coin, this is still a completely randomized 
design. Of course, the two experimental groups are unlikely to contain exactly 
25 students each due to the chance variation in coin tosses. 

The problem comes if we try to force the two groups to have equal sizes. 
Suppose we let the coin tossing continue until one of the groups has 25 students 
and then place the remaining students in the other group. This is no longer a 
completely randomized design, because the last few students aren’t being as- 
signed to one of the treatments by chance. In fact, these students will all end 
up in the same group, which could lead to bias if these individuals share some 
characteristic that would systematically affect the response variable. For example, 
if the students came to toss the coin last because they’re lazier than the other 
students who volunteered, then the SAT prep class that they’re in will seem less 
effective than it really is. 
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Completely randomized designs can compare any number of treatments. Here 
is an experiment that compares three treatments. 


Conserving Energy 
A completely randomized design 


Many utility companies have introduced programs to encourage energy conserva- 
tion among their customers. An electric company considers placing small digital 
displays in households to show current electricity use and what the cost would be 
if this use continued for a month. Will the displays reduce electricity use? One 
cheaper approach is to give customers a chart and information about monitoring 
their electricity use from their outside meter. Would this method work almost as 
well? The company decides to conduct an experiment to compare these two ap- 
proaches (display, chart) with a group of customers who receive information about 
energy consumption but no help in monitoring electricity use. 
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PROBLEM: Describe a completely randomized design involving 60 single-family residences in 
the same city whose owners are willing to participate in such an experiment. Write a few sentences 
explaining how you would implement your design. 


SOLUTION: Figure 4.5 outlines the design. We'll randomly assign 20 houses to each of three 
treatments: digital display, chart plus information, and information only. Our response variable is the 
total amount of electricity used in a year. 


Group 1 Treatment 1 


a 20 houses Display * 
Random Group 2 Treatment 2 Compare 
60 houses §=£——> : —_> — — Boe 
assignment 20 houses Chart electricity 
* vA use 
Group 3 Treatment 3 
20 houses Control 


FIGURE 4.5 Outline of a completely randomized design to compare three energy-saving 
programs. 


To implement the design, start by labeling each house with a distinct number from 1 to 60. Write 
the labels on 60 identical slips of paper, put them ina hat, and mix them well. Draw out 20 slips. The 
corresponding homes will be given digital displays showing current electricity use. Now draw out 20 
more slips. Those homes will use a chart. The remaining 20 houses will be given information about 
energy consumption but no way to monitor their usage. At the end of the year, compare how much 
electricity was used by the homes in the three groups. 


AP® EXAM TIP If you are asked to describe the design of an experiment on the AP® exam, you 
won’t get full credit for providing only a diagram like Figure 4.5. You are expected to describe 


how the treatments are assigned to the experimental units and to clearly state what will be 
measured or compared. Some students prefer to start with a diagram and then add a few 
sentences. Others choose to skip the diagram and put their entire response in narrative form. 


For Practice Try Exercise 


Why did we include a control group of 20 houses in the energy conservation 
experiment? The main purpose of a control group is to provide a baseline for com- 
paring the effects of the other treatments. Without such a comparison group, we 
wouldn’t be able to tell whether homes with digital displays or charts used less 
electricity than homes without such aids. 


THINK Was a control group really necessary? You might be thinking that the 
change in electricity use from last year to this year in the houses with displays and 

ABOUT IT charts would tell us whether these treatments helped. Unfortunately, it’s not that sim- 
ple. Suppose last year’s temperatures were more extreme than this year’s. Then many 

households might show a decrease in electricity use, but we couldn’t be sure whether 

this change was due to the weather or to the treatments. (Can you say confounding?!) 


°e_$AA]A]Pj 


Section 4.2 Experiments 4.247 


Many experiments (like the one in the previous example) include a control group 
that receives an inactive treatment. However, a control group can also be given an 
active treatment. Suppose we want to compare the effectiveness of a newly developed 
drug for treating a serious disease with a drug that’s already known to work. In that 
case, the experimental units that receive the existing drug form the control group. 

Some experimental designs don’t include a control group. That’s appropriate if 
researchers simply want to compare the effects of several treatments, rather than 
determining whether any of them works better than an inactive treatment. For 
instance, a state’s highway department wants to see which of three brands of paint 
will last longest when marking lane lines on the freeway. Putting no paint on the 
highway is clearly not a good option! 


CHECK YOUR UNDERSTANDING 


Music students often don’t evaluate their own performances accurately. Can small-group 
discussions help? The subjects were 29 students preparing for the end-ofsemester perfor- 
mance that is an important part of their grade. The 15 students in one group each video- 
taped a practice performance, evaluated it themselves, and then discussed the tape with a 
small group of other students. ‘The remaining 14 students watched and evaluated their tapes 
alone. At the end of the semester, the discussion-group students evaluated their final perfor- 
mance more accurately.”4 


1. Describe a completely randomized design for this experiment. Write a few sentences 
describing how you would implement your design. 


2. What is the purpose of the control group in this experiment? 


Experiments: What Can Go Wrong? 


The logic of a randomized comparative experiment depends on our ability to treat 
all the subjects the same in every way except for the actual treatments being com- 
pared. Good experiments, therefore, require careful attention to details to ensure 
that all subjects really are treated identically. 

If some subjects in a medical experiment take a pill each day and a control 
group takes no pill, the subjects are not treated identically. Many medical ex- 
periments are therefore “placebo-controlled,” like the Physicians’ Health Study. 
On odd-numbered days, all the subjects took an aspirin or a placebo. On even- 
numbered days, all of them took either a beta-carotene pill or a placebo. 

Many patients respond favorably to any treatment, even a placebo, perhaps 
because they trust the doctor. The response to a dummy treatment is called the 
placebo effect. If some subjects in the Physicians’ Health Study did not take any 
pills, the effect of aspirin or beta-carotene would be confounded with the placebo 

. effect, the effect of simply taking pills. 


Curing Baldness and Soothing Pain 


Do placebos work? 


Want to help balding men keep their hair? Give them a placebo. One study found 
that 42% of balding men maintained or increased the amount of hair on their 
heads when they took a placebo. In another study, researchers zapped the wrists 
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Reports in medical journals regularly 
begin with words like these, from a 
study of a flu vaccine given as a nose 
spray: “This study was a randomized, 
double-blind, placebo-controlled trial. 
Participants were enrolled from 13 
sites across the continental United 
States between mid-September 

and mid-November.””” Doctors are 
supposed to know what this means. 
Now you know, too. 
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of 24 test subjects with a painful jolt of electricity. Then they rubbed a cream with 
no active medicine on subjects’ wrists and told them the cream should help soothe 
the pain. When researchers shocked them again, 8 subjects said they experienced 
significantly less pain.” 


When the ailment is vague and psychological, like depression, some experts think 
that the placebo effect accounts for about three-quarters of the effect of the most 
widely used drugs.”° Others disagree. In any case, “placebos work” is a good place 
to start when you think about planning medical experiments. 


The strength of the placebo effect is a strong argument for randomized com- 
parative experiments. In the baldness study, 42% of the placebo group kept or 
increased their hair, but 86% of the men getting a new drug to fight baldness did 
so. The drug beats the placebo, so it has something besides the placebo effect 
going for it. Of course, the placebo effect is still part of the reason this and other 
treatments work. 

Because the placebo effect is so strong, it would be foolish to tell subjects in a 
medical experiment whether they are receiving a new drug or a placebo. Knowing 
that they are getting “just a placebo” might weaken the placebo effect and bias the 
experiment in favor of the other treatments. 

It is also foolish to tell doctors and other medical personnel what treatment 
each subject received. If they know that a subject is getting “just a placebo,” they 
may expect less than if they know the subject is receiving a promising experimen- 
tal drug. Doctors’ expectations change how they interact with patients and even 
the way they diagnose a patient’s condition. Whenever possible, experiments with 
human subjects should be double-blind. 


DEFINITION: Double-blind 


In a double-blind experiment, neither the subjects nor those who interact with them 
and measure the response variable know which treatment a subject received. 


The idea of a double-blind design is simple. Until the experiment ends and the 
results are in, only the study’s statistician knows for sure which treatment a subject 
is receiving. However, some experiments cannot be carried out in a double-blind 
manner. If researchers are comparing the effects of exercise and dieting on weight 
loss, then subjects will know which treatment they are receiving. Such an experi- 
ment can still be single-blind if the individuals who are interacting with the sub- 
jects and measuring the response variable don’t know who is dieting and who is 
exercising. In other single-blind experiments, the subjects are unaware of which 
treatment they are receiving, but the people interacting with them and measuring 
the response variable do know. 


ACTIVITY 
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Pedy: 


CHECK YOUR UNDERSTANDING 


In an interesting experiment, researchers examined the effect of ultrasound on birth 
weight. Pregnant women participating in the study were randomly assigned to one of 
two groups. The first group of women received an ultrasound; the second group did not. 
When the subjects’ babies were born, their birth weights were recorded. The women 
who received the ultrasounds had heavier babies.”® 

1. Did the experimental design take the placebo effect into account? Why is this 
important? 

2. Was the experiment double-blind? Why is this important? 

3. Based on your answers to Questions | and 2, describe an improved design for this 
experiment. 


Inference for Experiments 


In an experiment, researchers usually hope to see a difference in the responses so 
large that it is unlikely to happen just because of chance variation. We can use the 
laws of probability, which describe chance behavior, to learn whether the treat- 
ment effects are larger than we would expect to see if only chance were operating. 
If they are, we call them statistically significant. 


DT 


DEFINITION: Statistically significant 
An observed effect so large that it would rarely occur by chance is called 
statistically significant. 


If we observe statistically significant differences among the groups in a ran- 
domized comparative experiment, we have good evidence that the treatments 
caused these differences. You will often see the phrase “statistically significant” 
in published research reports in many fields. The great advantage of randomized 
comparative experiments is that they can produce data that give good evidence for 
a cause-and-effect relationship between the explanatory and response variables. 
We know that in general a strong association does not imply causation. A statisti- 
cally significant association in data from a well-designed experiment does imply 
causation. 


Distracted driving 


MATERIALS: Set of 48 index 
cards or standard deck of 
playing cards for each pair 
of students 


Is talking on a cell phone while driving more distracting than talking to a pas- 
senger? David Strayer and his colleagues at the University of Utah designed an 
experiment to help answer this question. They used 48 undergraduate students as 
subjects. The researchers randomly assigned half of the subjects to drive in a simu- 
lator while talking on a cell phone, and the other half to drive in the simulator 
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Why did we only ask you to count 
the number of drivers who didn’t 
stop at the rest area in the cell- 
phone group? Suppose you get 10 
in one trial of the simulation. That 
means the other 24 — 10 drivers in 
the cell-phone group did stop at the 
rest area. Also, there must be 

15 — 10 = 5 drivers in the 
passenger group who failed to stop, 
leaving 24 — 5 = 19 drivers in the 
group who did stop. Recording the 
one number tells you all the others. 
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while talking to a passenger. One response variable was whether or not the driver 
stopped at a rest area that was specified by researchers before the simulation start- 
ed. The table below shows the results:”” 


Distraction 


Stopped at rest area? Cell phone Passenger 


Yes 12 21 
No 12 3 


Are these results statistically significant? To find out, let’s see what 
would happen just by chance if we randomly reassign the 48 people in 
this experiment to the two groups many times, assuming the treatment 
received doesn’t affect whether a driver stops at the rest area. 


1. We need 48 cards to represent the drivers in this study. In the original experi- 
ment, 33 drivers stopped at the rest area and 15 didn’t. Because we’re assuming 
that the treatment received won’t change whether each driver stops at the rest 
area, we use 33 cards to represent drivers who stop and 15 cards to represent 
those who don’t. 


© Using index cards: Write “Yes” on 33 cards and “No” on 15 cards. 


¢ Using playing cards: Remove jokers and other specialty cards from the deck, 
as well as the ace of spades and any three of the 2s. All cards with denomina- 
tions 2 through 10 represent drivers who stop. All jacks, queens, kings, and 
aces represent drivers who don’t stop. 


2. Shuffle and deal two piles of 24 cards each—the first pile represents the cell 
phone group and the second pile represents the passenger group. The shuffling 
reflects our assumption that the outcome for each subject is not affected by the 
treatment. Record the number of drivers who failed to stop at the rest area in the 
cell-phone group. 


3. Your teacher will draw and label axes for a class dotplot. Plot the result you 
got in Step 2 on the graph. 


4. Repeat Steps 2 and 3 if needed to get a total of at least 40 repetitions of the 
simulation for your class. 


5. In the original experiment, 12 of the 24 drivers using cell phones didn’t stop 
at the rest area. Based on the class’s simulation results, how surprising would it 
be to get a result this large or larger simply due to the chance involved in the 
random assignment? Is the result statistically significant? 


6. What conclusion would you draw about whether talking on a cell phone is 
more distracting than talking to a passenger? 


Here is an example of what the class dotplot in the Activity might look like after 
100 trials. In the 100 trials, only once did 12 or more people fail to stop when 
using a cell phone. Because a result of 12 or more is unlikely to happen by chance 
alone, the results of this study should be considered statistically significant. 


THINK 
ABOUT IT 
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There was only one trial out of 100 in 
which 12 or more drivers in the cell-phone 


group missed the rest area just by chance. 
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Can an “unlucky” random assignment lead to confounding? 
Let’s return to the distracted-driver Activity. Some people are more forgetful than 
others. Suppose that the random assignment happens to put most of the forgetful 
subjects in one group. If more drivers in that group fail to stop at the rest area, we 
don’t know if it’s because of the treatment they received (cell phone or passenger) 
or their forgetfulness. Is this confounding? 

You might be surprised that the answer is “No!” Although people’s memory is 
a variable that might affect whether or not they stop at the rest area (the response 
variable), the design of the experiment takes care of this by randomly assigning 
subjects to the two treatment groups. The “unlucky” random assignments are taken 
into account in determining statistical significance. In an experiment, confound- 
ing occurs when the design doesn’t account for existing differences in the experi- 
mental units that might systematically affect their response to the treatments. 
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Blocking 


Completely randomized designs are the simplest statistical designs for experiments. 
They illustrate clearly the principles of comparison, random assignment, control, 
and replication. But just as with sampling, there are times when the simplest meth- 
od doesn’t yield the most precise results. When a population consists of groups of in- 
dividuals that are “similar within but different between,” a stratified random sample 
gives a better estimate than a simple random sample. This same logic applies in 
experiments. 


A Smarter Design? 
The idea of blocking 


Suppose that a mobile phone company is considering two different keyboard de- 
signs (A and B) for its new smart phone. The company decides to perform an 
experiment to compare the two keyboards using a group of 10 volunteers. The 
response variable is typing speed, measured in words per minute. 
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How should the company deal with the fact that four of the volunteers already use 
a smart phone, whereas the remaining six volunteers do not? They could use a 
completely randomized design and hope that the random assignment distributes 
the smart-phone users and non-smart-phone users about evenly between the group 
using keyboard A and the group using keyboard B. Even so, there might be a lot of 
variability in typing speed in both groups because some members of each group 
are much more familiar with smart phones than others. This additional variabil- 
ity might make it difficult to detect a difference in the effectiveness of the two 
keyboards. What should the researchers do? 


Because the company knows that experience with smart phones will affect typing 
speed, they could start by separating the volunteers into two groups—one with 
experienced smart-phone users and one with inexperienced smart-phone users. 
Each of these groups of similar subjects is known as a block. Within each block, 
the company could then randomly assign half of the subjects to use keyboard 
A and the other half to use keyboard B. ‘To control other variables, each subject 
should be given the same passage to type while in a quiet room with no distrac- 
tions. This randomized block design helps account for the variation in typing 
speed that is due to experience with smart phones. 


Figure 4.6 outlines the randomized block design for the smart-phone experi- 
ment. The subjects are first separated into blocks based on their experience with 
smart phones. Then the two treatments are randomly assigned within each block. 


Assignment to blocks 
is not random 


Keyboard A 


Ya n=2 
Random Compare 
ve — assignment typing speed 
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Figure 4.6 Outline of a randomized block design for the smart-phone experiment. The blocks 
consist of volunteers who have used smart phones and volunteers who have not used smart 
phones. The treatments are keyboard A and keyboard B. 


DEFINITION: Block and randomized block design 


A block is a group of experimental units that are known before the experiment to be 
similar in some way that is expected to affect the response to the treatments. 


In a randomized block design, the random assignment of experimental units to 
treatments is carried out separately within each block. 
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Using a randomized block design allows us to account for the variation in the 
response that is due to the blocking variable. This makes it easier to determine if 
one treatment is really more effective than the other. 

To see how blocking helps, let’s look at the results of an experiment using 
10 volunteers, 4 who already use a smart phone and 6 who do not. In the block 
of 4 smart-phone users, 2 will be randomly assigned to use keyboard A and the 
other 2 will be assigned to use keyboard B. Likewise, in the block of 6 non- 
smart-phone users, 3 will be randomly assigned to use keyboard A and the other 
3 will be assigned to use keyboard B. Each of the 10 volunteers will type the 
same passage and the typing speed will be recorded. 

Here are the results: 
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There is some evidence that keyboard A results in higher typing speeds, but the evi- 
dence isn’t that convincing. Enough overlap occurs in the two distributions that the 
differences might simply be due to the chance variation in the random assignment. 
If we compare the results for the two keyboards within each block, however, a 
different story emerges. Among the 4 smart-phone users (indicated by the blue 
squares), keyboard A was the clear winner. Likewise, among the 6 non-smart- 
phone users (indicated by the gray dots), keyboard A was also the clear winner. 
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The overlap in the first set of dotplots was due almost entirely to the variation 
in smart-phone experience—smart-phone users were generally faster than non- 
smart-phone users, regardless of which keyboard they used. In fact, the average 
typing speed for the smart-phone users was 40, while the average typing speed for 
non-smart-phone users was only 26, a difference of 14 words per minute. ‘To ac- 
count for the variation created by the difference in smart-phone experience, let’s 
subtract 14 from each of the typing speeds in the block of smart-phone users to 
“even the playing field.” 
Here are the results: 


@=Smart-phoneuser ©=Non-smart-phone user 
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AP® EXAM TIP Don’t mix the Because we accounted for the variation due to the difference in smart-phone 
experience, the variation in each of the distributions has been reduced. There is 
now almost no overlap between the two distributions, meaning that the evidence 
in favor of keyboard A is much more convincing. When blocks are formed wisely, 
it is easier to find convincing evidence that one treatment is more effective than 
another. 

The idea of blocking is an important additional principle of experimental 
design. A wise experimenter will form blocks based on the most important un- 
avoidable sources of variation (other variables) among the experimental units. 
Randomization will then average out the effects of the remaining other variables 
and allow an unbiased comparison of the treatments. The moral of the story is: 
control what you can, block on what you can’t control, and randomize to create 
comparable groups. 


language of experiments and the 
language of sample surveys or 
other observational studies. You 
will lose credit for saying things 


like “use a randomized block 
design to select the sample for 
this survey” or “this experiment 
suffers from nonresponse since 
some subjects dropped out 
during the study.” 


Men, Women, and Advertising 
Blocking in an experiment 


Women and men respond differently to advertising. Researchers would like to 
design an experiment to compare the effectiveness of three advertisements for the 
same product. 


PROBLEM: 


(a) Explain why a randomized block design might be preferable to a completely randomized design for 
this experiment. 


(b) Outline a randomized block design using 300 volunteers (180 men and 120 women) as 
subjects. Describe how you would carry out the random assignment required by your design. 


SOLUTION: 


(a) Acompletely randomized design considers all subjects, both men and women, as a single pool. 
The random assignment would send subjects to three treatment groups without regard to their 
gender. This ignores the differences between men and women, which would probably result in a great 
deal of variability in responses to the advertising in all three groups. For example, if an ad appealed 
much more to men, you would get a wide range of reactions to that ad from the two genders. That 
would make it harder to determine whether one ad was more effective. 


Arandomized block design would consider women and men separately. In this case, the random as- 
signment would occur separately in each block. Blocking will account for the variability in responses 
to advertising due to gender. This will allow researchers to look separately at the reactions of men 
and women, as well as to more effectively assess the overall response to the ads. 


(b) Figure 4.7 outlines the randomized block design. We randomly assign the 120 women into three 
groups of 40, one for each of the advertising treatments. Write the women’s names on 120 identical 
slips of paper, place the slips ina hat, and mix them well. Pull out 40 slips to determine which women 
will view Ad 1. Pull out another 40 slips to determine which women will view Ad 2. The remaining 40 
women will view Ad 3. Randomly assign the 180 men into three groups of 60 using a similar process. 
After each subject has viewed the assigned ad, compare reactions to the three ads within the gender 
blocks. To compare the overall effectiveness of the three ads, combine the results from the two 
blocks after accounting for the difference in response for the men and women. 
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FIGURE 4.7 Randomized block design for comparing responses to three advertisements. The 
blocks consist of male and female subjects. 


For Practice Try Exercise 


Matched Pairs Design: A common type of randomized block design for 
comparing two treatments is a matched pairs design. The idea is to create blocks 
by matching pairs of similar experimental units. Then we can use chance to de- 
cide which member of a pair gets the first treatment. The other subject in that 
pair receives the other treatment. That is, the random assignment of subjects to 
treatments is done within each matched pair. Just as with other forms of blocking, 
matching helps account for the variation among the experimental units. 

Sometimes each “pair” in a matched pairs design consists of just one 
experimental unit that gets both treatments one after the other. In that case, each 
experimental unit serves as its own control. The order of the treatments can influ- 
ence the response, so we randomize the order for each experimental unit. 


ACTIVITY Get your heart beating 


MATERIALS: Are standing pulse rates generally higher than sitting pulse rates? In this Activity, 
Clock or stopwatch you will perform two experiments to try to answer this question. 


1. Completely randomized design For the first experiment, your teacher will 
randomly assign half of the students in your class to stand and the other half to 
sit. Once the two treatment groups have been formed, students should stand or 
sit as required. Then they should measure their pulses for one minute. Have the 
subjects in each group record their data on the board. 


2. Matched pairs design In a matched pairs design, each student should receive 
both treatments in a random order. Because you already sat or stood in Step 1, 
you just need to do the opposite now. As before, everyone should measure their 
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pulses for one minute after completing the treatment (that is, once they are 
standing or sitting). Have all the subjects record their data (both measurements) 
in a chart on the board. 


3. Analyze the data for the completely randomized design. Make parallel 
dotplots and calculate the mean pulse rate for each group. Is there convincing 
evidence that standing pulse rates are higher? Explain. 


4. Analyze the data for the matched pairs design. Because the data are paired 
by student, your first step should be to calculate the difference in pulse rate 
(standing — sitting) for each subject. Make a dotplot of these differences and 
calculate their mean. Is there convincing evidence that standing pulse rates 


are higher? Explain. 


5. What advantage does the matched pairs design have over the completely 
randomized design? 


An AP® Statistics class with 24 students performed the “Get Your Heart Beating” 
Activity. We'll analyze the results of their experiment in the following example. 


Standing and Sitting Pulse Rate 


Design determines analysis 


Pulse study 


Position 


A Fathom dotplot of the pulse rates for their completely 
randomized design is shown. The mean pulse rate for the 
standing group is 74.83; the mean for the sitting group is 
68.33. So the average pulse rate is 6.5 beats per minute 
higher in the standing group. However, the variability in 
pulse rates for the two groups creates a lot of overlap in the 
dotplots. These data don’t provide convincing evidence 
that standing pulse rates tend to be higher. 


What about the class's matched pairs experiment? The 
Fathom dotplot shows their data on the difference in pulse 
rates (standing — sitting). For these 24 students, the mean 
difference was 6.8 beats per minute. In addition, 21 of the 
24 students recorded a positive difference (meaning the 
standing pulse rate was higher). These data provide con- 
vincing evidence that people’s standing pulse rates tend to 
be higher than their sitting pulse rates. 


Let’s take one more look at the two Fathom dotplots in the example. Notice 
that we used the same scale for both graphs. This is to help you visually compare 
the amount of variability in the response variable for each of the two experimen- 
tal designs. Blocking by subject in the matched pairs design greatly reduced the 


DATA EXPLORATION 
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variability in the response variable. That made it easier to detect the fact that 
standing causes an increase in pulse rate. With the large amount of variability in 
the completely randomized design, we were unable to draw such a conclusion. 

Another important lesson to take away from the example is this: the design of 
the study determines the appropriate method of analysis. For the completely ran- 
domized design, it makes sense to compare pulse rates for the two groups with 
parallel dotplots and means. In the matched pairs design, each student is a block. 
We compare the effects of the treatments within each block by examining the 
differences in standing and sitting pulse rates for each student. Then we combine 
the results from each block (student) and examine the distribution of differences. 

The following Data Exploration asks you to apply what you have learned about 
analyzing data from an experiment. 


Nitrogen in tires—a lot of hot air? 


Most automobile tires are inflated with compressed air, which consists of about 
78% nitrogen. Aircraft tires are filled with pure nitrogen, which is safer than air 
in case of fire. Could filling automobile tires with nitrogen improve safety, perfor- 
mance, or both? 

Consumers Union designed a study to test whether nitrogen-filled tires would 
maintain pressure better than air-filled tires. They obtained two tires from each of 
several brands and then filled one tire in each pair with air and one with nitrogen. All 
tires were inflated to a pressure of 30 pounds per square inch and then placed outside 
for a year. At the end of the year, Consumers Union measured the pressure in each 
tire. The amount of pressure lost (in pounds per square inch) during the year for the 
air-filled and nitrogen-filled tires of each brand is shown in the table below.” 


Brand 


BF Goodrich Traction T/A HR 
Bridgestone HP50 (Sears) 
Bridgestone Potenza GO09 
Bridgestone Potenza RE950 
Bridgestone Potenza EL400 
Continental Premier Contact H 
Cooper Lifeliner Touring SLE 
Dayton Daytona HR 

Falken Ziex ZE-512 

Fuzion Hrl 

General Exclaim 

Goodyear Assurance Tripletred 
Hankook Optimo H418 
Kumho Solus KH16 

Michelin Energy MXV4 Plus 
Michelin Pilot XGT H4 


Air Nitrogen Brand Air Nitrogen 
7.6 7.2 Pirelli P6 Four Seasons 4.4 4.2 
3.8 2.5 Sumitomo HTR H4 1.4 2.1 
3.7 1.6 Yokohama Avid H4S 4.3 3.0 
4.7 1 BF Goodrich Traction T/A V 5.5 3.4 
2.1 1.0 Bridgestone Potenza RE950 4.1 2.8 
4.9 3.1 Continental ContiExtreme Contact 5.0 3.4 
5.2 3.5 Continental ContiProContact 48 3.3 
3.4 3.2 Cooper Lifeliner Touring SLE 3.2 2.5 
41 3.3 General Exclaim UHP 6.8 2.7 
2.7 2.2 Hankook Ventus V4 H105 3.1 1.4 
3.1 3.4 Michelin Energy MXV4 Plus 2.5 1.5 
3.8 3.2 Michelin Pilot Exalto A/S 6.6 2.2 
3.0 0.9 Michelin Pilot HX MXM4 2.2 2.0 
6.2 3.4 Pirelli P6 Four Seasons 2.5 2.7 
2.0 1.8 Sumitomo HTR* 4.4 3.7 
1.1 0.7 


Does filling tires with nitrogen instead of compressed air reduce pressure loss? 
Give appropriate graphical and numerical evidence to support your answer. 
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Summary 


e = We can produce data intended to answer specific questions by observational 
studies or experiments. An observational study gathers data on individuals as 
they are. Experiments actively do something to people, animals, or objects in 
order to measure their response. 


e — Statistical studies often try to show that changing one variable (the explana- 
tory variable) causes changes in another variable (the response variable). 
Variables are confounded when their effects on a response variable can’t 
be distinguished from each other. Observational studies and uncontrolled 
experiments often fail to show that changes in an explanatory variable cause 
changes ina response variable because the explanatory variable is confounded 
with other variables. 


e In an experiment, we impose one or more treatments on a group of experi- 
mental units (sometimes called subjects if they are human). Each treatment 
is a combination of values of the explanatory variables (also called factors). 


e The basic principles of experimental design are as follows: 
1. Comparison: Use a design that compares two or more treatments. 


2. Random assignment: Use chance to assign experimental units to treat- 
ments. This helps create roughly equivalent groups before treatments are 
imposed. 


3. Control: Keep as many other variables as possible the same for all groups. 
Control helps avoid confounding and reduces the variation in responses, 
making it easier to decide whether a treatment is effective. 


4. Replication: Impose each treatment on enough experimental units so 
that the effects of the treatments can be distinguished from chance dif- 
ferences between the groups. 


e Ina completely randomized design, all of the experimental units are as- 
signed to the treatments completely by chance. 


e Some experiments give a placebo (fake treatment) to a control group. That 
helps prevent confounding due to the placebo effect, in which some patients 
get better because they expect the treatment to work. 


e Many behavioral and medical experiments are double-blind. That is, nei- 
ther the subjects nor those interacting with them and measuring their re- 
sponses know who is receiving which treatment. If one party knows and the 
other doesn’t, then the experiment is single-blind. 


e When an observed difference in responses between the groups in an experi- 
ment is too large to be explained by chance variation in the random assign- 
ment, we say that the result is statistically significant. 


e A randomized block design forms groups (blocks) of experimental units 
that are similar with respect to a variable that is expected to affect the re- 
sponse. ‘Treatments are assigned at random within each block. Responses are 
then compared within each block and combined with the responses of other 
blocks after accounting for the differences between the blocks. When blocks 
are chosen wisely, it is easier to determine if one treatment is more effective 
than another. 
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e A matched pairs design is a common form of blocking for comparing just 
two treatments. In some matched pairs designs, each subject receives both 
treatments in a random order. In others, two very similar subjects are paired, 
and the two treatments are randomly assigned within each pair. 


Learning biology with computers An educator wants 
to compare the effectiveness of computer software for 
teaching biology with that of a textbook presentation. 
She gives a biology pretest to each of a group of high 
school juniors, then randomly divides them into two 
groups. One group uses the computer, and the other 
studies the text. At the end of the year, she tests all the 
students again and compares the increase in biology 
test scores in the two groups. Is this an observational 
study or an experiment? Justify your answer. 


Cell phones and brain cancer One study of cell 
phones and the risk of brain cancer looked at a 

group of 469 people who have brain cancer. The 
investigators matched each cancer patient with a 
person of the same age, gender, and race who did 
not have brain cancer, then asked about the use of 
cell phones. Result: “Our data suggest that the use of 
handheld cellular phones is not associated with risk 
of brain cancer.”*! Is this an observational study or an 
experiment? Justify your answer. 


Chocolate and happy babies A University of Helsinki 
(Finland) study wanted to determine if chocolate 
consumption during pregnancy had an effect on infant 
temperament at age 6 months. Researchers began by 
asking 305 healthy pregnant women to report their 
chocolate consumption. Six months after birth, the re- 
searchers asked mothers to rate their infants’ tempera- 
ment, including smiling, laughter, and fear. The babies 
born to women who had been eating chocolate daily 
during pregnancy were found to be more active and 
“positively reactive” —a measure that the investigators 
said encompasses traits like smiling and laughter. 


Was this an observational study or an experiment? 
Justify your answer. 

What are the explanatory and response variables? 
Does this study show that eating chocolate regularly 
during pregnancy helps produce infants with good 
temperament? Explain. 

Child care and aggression A study of child care 
enrolled 136+ infants and followed them through 
their sixth year in school. Later, the researchers pub- 
lished an article in which they stated that “the more 
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time children spent in child care from birth to age 
four-and-a-half, the more adults tended to rate them, 
both at age four-and-a-half and at kindergarten, as 
less likely to get along with others, as more assertive, 
as disobedient, and as aggressive.”** 


Is this an observational study or an experiment? 
Justify your answer. 


What are the explanatory and response variables? 


Does this study show that child care causes children 
to be more aggressive? Explain. 


Effects of class size Do smaller classes in elementary 
school really benefit students in areas such as scores 
on standardized tests, staying in school, and going on 
to college? We might do an observational study that 
compares students who happened to be in smaller and 
larger classes in their early school years. Identify a vari- 
able that may lead to confounding with the effects of 
small classes. Explain how confounding might occur. 


Effects of binge drinking A common definition of 
“binge drinking” is 5 or more drinks at one sitting 

for men and 4 or more for women. An observational 
study finds that students who binge drink have lower 
average GPA than those who don’t. Identify a variable 
that may be confounded with the effects of binge 
drinking. Explain how confounding might occur. 


For the experiments described in Exercises 51 to 56, identify 
the experimental units, the explanatory and response vari- 
ables, and the treatments. 


il 


237 
wo 


52. 


Growing in the shade Ability to grow in shade may 
help pines found in the dry forests of Arizona to resist 
drought. How well do these pines grow in shade? Inves- 
tigators planted pine seedlings in a greenhouse in either 
full light, light reduced to 25% of normal by shade 
cloth, or light reduced to 5% of normal. At the end of 
the study, they dried the young trees and weighed them. 


Internet telephone calls You can use Voice over In- 
ternet Protocol (VoIP) to make long-distance calls over 
the Internet. One of the most popular VoIP services 

is Skype. How will the appearance of ads during calls 
affect the use of this service? Researchers design an 
experiment to find out. They recruit 300 people who 
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have not used Skype before to participate. Some people 
get the current version of Skype with no ads. Others 

see ads whenever they make calls. The researchers are 
interested in frequency and length of phone calls. 


Improving response rate How can we reduce the 

rate of refusals in telephone surveys? Most people who 
answer at all listen to the interviewer's introductory re- 
marks and then decide whether to continue. One study 
made telephone calls to randomly selected households 
to ask opinions about the next election. In some calls, 
the interviewer gave her name; in others, she identified 
the university she was representing; and in still others, 
she identified both herself and the university. For each 
type of call, the interviewer either did or did not offer 
to send a copy of the final survey results to the person 
interviewed. Do these differences in the introduction 
affect whether the interview is completed? 


Eat well and exercise Most American adolescents 
don’t eat well and don’t exercise enough. Can 
middle schools increase physical activity among 
their students? Can they persuade students to eat 
better? Investigators designed a “physical activity in- 
tervention” to increase activity in physical education 
classes and during leisure periods throughout the 
school day. They also designed a “nutrition interven- 
tion” that improved school lunches and offered ideas 
for healthy home-packed lunches. Each participating 
school was randomly assigned to one of the interven- 
tions, both interventions, or no intervention. The in- 
vestigators observed physical activity and lunchtime 
consumption of fat. 


Fabric science A maker of fabric for clothing is setting 
up a new line to “finish” the raw fabric. The line will 
use either metal rollers or natural-bristle rollers to raise 
the surface of the fabric; a dyeing-cycle time of either 
30 or 40 minutes; and a temperature of either 150° or 
175°C. An experiment will compare all combinations 
of these choices. Three specimens of fabric will be 
subjected to each treatment and scored for quality. 


Exercise and heart rate A student project measured 
the increase in the heart rates of fellow students 
when they stepped up and down for 3 minutes to 
the beat of a metronome. The step was either 5.75 
or 11.5 inches high and the metronome beat was 

14, 21, or 28 steps per minute. Thirty students took 
part in the experiment. Five of them stepped at each 
combination of height and speed. 


Cocoa and blood flow A study conducted by Nor- 
man Hollenberg, professor of medicine at Brigham 
and Women’s Hospital and Harvard Medical School, 
involved 27 healthy people aged 18 to 72. Each subject 
consumed a cocoa beverage containing 900 milligrams 
of flavonols (a class of flavonoids) daily for 5 days. 
Using a finger cuff, blood flow was measured on the 
first and fifth days of the study. After 5 days, researchers 
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measured what they called “significant improvement” 
in blood flow and the function of the cells that line the 
blood vessels. ** What flaw in the design of this experi- 
ment makes it impossible to say whether the cocoa 
really caused the improved blood flow? Explain. 


Reducing unemployment Will cash bonuses speed 
the return to work of unemployed people? A state 
department of labor notes that last year 68% of people 
who filed claims for unemployment insurance found 
a new job within 15 weeks. As an experiment, this year 
the state offers $500 to people filing unemployment 
claims if they find a job within 15 weeks. The percent 
who do so increases to 77%. What flaw in the design 
of this experiment makes it impossible to say whether 
the bonus really caused the increase? Explain. 


Layoffs and “survivor guilt” Workers who survive 

a layoff of other employees at their location may 
suffer from “survivor guilt.” A study of survivor guilt 
and its effects used as subjects 120 students who 
were offered an opportunity to earn extra course 
credit by doing proofreading. Each subject worked 
in the same cubicle as another student, who was an 
accomplice of the experimenters. Ata break midway 
through the work, one of three things happened: 


‘Treatment I: The accomplice was told to leave; it was 
explained that this was because she performed poorly. 
‘Treatment 2: It was explained that unforeseen cir- 
cumstances meant there was only enough work for 
one person. By “chance,” the accomplice was chosen 


to be laid off. 


Treatment 3: Both students continued to work after 


the break. 

The subjects’ work performance after the break 

was compared with performance before the break.*” 
Describe how you would randomly assign the sub- 
jects to the treatments 

using slips of paper. 

using technology. 

using ‘Table D. 

Effects of TV advertising Figure +.2 (page 239) dis- 
plays the 6 treatments for a two-factor experiment on 
TV advertising. Suppose we have 150 students who 
are willing to serve as subjects. Describe how you 
would randomly assign the subjects to the treatments 
using slips of paper. 

using technology. 

using ‘Table D. 

Stronger players A football coach hears that a new 
exercise program will increase upper-body strength 
better than lifting weights. He is eager to test this new 
program in the off-season with the players on his high 
school team. The coach decides to let his players 
choose which of the two treatments they will undergo 
for 3 weeks—exercise or weight lifting. He will use the 
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number of push-ups a player can do at the end of the 
experiment as the response variable. Which principle 
of experimental design does the coach’s plan violate? 
Explain how this violation could lead to confounding. 


Killing weeds A biologist would like to determine 
which of two brands of weed killer, X or Y, is less 
likely to harm the plants in a garden at the university. 
Before spraying near the plants, the biologist decides 
to conduct an experiment using 24 individual plants. 
Which of the following two plans for randomly assign- 
ing the treatments should the biologist use? Why? 


Plan A: Choose the 12 healthiest-looking plants. Then 
flip a coin. If it lands heads, apply Brand X weed killer 
to these plants and Brand Y weed killer to the remain- 
ing 12 plants. If it lands tails, do the opposite. 

Plan B: Choose 12 of the 24 plants at random. Apply 
Brand X weed killer to those 12 plants and Brand Y 
weed killer to the remaining 12 plants. 


Do diets work? Dr. Linda Stern and her colleagues 
recruited 132 obese adults at the Philadelphia 
Veterans Affairs Medical Center in Pennsylvania. 

Half the participants were randomly assigned to a low- 
carbohydrate diet and the other half to a low-fat diet. 
Researchers measured each participant’s change in 
weight and cholesterol level after six months and again 
after one year. Explain how each of the four principles 
of experimental design was used in this study. 


The effects of day care Does day care help low-in- 
come children stay in school and hold good jobs later 
in life? ‘The Carolina Abecedarian Project (the name 
suggests the ABCs) has followed a group of 111 chil- 
dren since 1972. Back then, these individuals were all 
healthy but low-income black infants in Chapel Hill, 
North Carolina. All the infants received nutritional 
supplements and help from social workers. Half were 
also assigned at random to an intensive preschool 
program. *° Explain how each of the four principles of 
experimental design was used in this study. 


Headache relief Doctors identify “chronic tension- 
type headaches” as headaches that occur almost 
daily for at least six months. Can antidepressant med- 
ications or stress management training reduce the 
number and severity of these headaches? Are both to- 
gether more effective than either alone? Researchers 
want to compare four treatments: antidepressant 
alone, placebo alone, antidepressant plus stress 
management, and placebo plus stress management. 
Describe a completely randomized design involving 
36 headache sufferers who are willing to participate 
in this experiment. Write a few sentences describing 
how you would implement your design. 


More rain for California? ‘The changing climate will 
probably bring more rain to California, but we don’t 
know whether the additional rain will come during 
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the winter wet season or extend into the long dry 
season in spring and summer. Kenwyn Suttle of the 
University of California at Berkeley and his coworkers 
wanted to compare the effects of three treatments: 
added water equal to 20% of annual rainfall either 
during January to March (winter) or during April to 
June (spring), and no added water (control). Eighteen 
plots of open grassland, each with area 70 square 
meters, were available for this study. One response 
variable was total plant biomass, in grams per square 
meter, produced in a plot over a year.” 

Describe a completely randomized design for 
this experiment. Write a few sentences describing 
how you would implement your design. 


Treating prostate disease A large study used 
records from Canada’s national health care system 
to compare the effectiveness of two ways to treat 
prostate disease. ‘The two treatments are traditional 
surgery and a new method that does not require 
surgery. I'he records described many patients whose 
doctors had chosen each method. The study found 
that patients treated by the new method were sig- 
nificantly more likely to die within 8 years. **® 


Further study of the data showed that this conclusion 
was wrong. he extra deaths among patients who got 
the new method could be explained by other vari- 
ables. What other variables might be confounded with 
a doctor’s choice of surgical or nonsurgical treatment? 


You have 300 prostate patients who are willing to 
serve as subjects in an experiment to compare the 
two methods. Describe a completely randomized 
design for this experiment. Write a few sentences 
describing how you would implement your design. 


Getting teachers to come to school Elementary 
schools in rural India are usually small, with a single 
teacher. The teachers often fail to show up for work. 
Here is an idea for improving attendance: give the 
teacher a digital camera with a tamperproof time 
and date stamp and ask a student to take a photo of 
the teacher and class at the beginning and end of the 
day. Offer the teacher better pay for good attendance, 
verified by the photos. Will this work? Researchers 
obtained permission to use 120 rural schools in Raj- 
asthan for an experiment to find out.” 


Explain why it would not be a good idea to offer bet- 
ter pay for good attendance to the teachers in all 120 
schools and then to compare this year’s attendance 
with last year’s. 


Describe a completely randomized design for an 
experiment involving these 120 schools. Write a few 
sentences describing how you would implement 
your design. 

Do placebos really work? Researchers in Japan 
conducted an experiment on 13 individuals who 
were extremely allergic to poison ivy. On one arm, 
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each subject was rubbed with a poison ivy leaf and 
told the leaf was harmless. On the other arm, each 
subject was rubbed with a harmless leaf and told it 
was poison ivy. All the subjects developed a rash on 
the arm where the harmless leaf was rubbed. Of the 
13 subjects, 11 did not have any reaction to the real 
poison ivy leaf.*? Explain how the results of this study 
support the idea of a placebo effect. 


Pain relief study Fizz Laboratories, a pharmaceuti- 
cal company, has developed a new drug for relieving 
chronic pain. Sixty patients suffering from arthritis 
and needing pain relief are available. Each patient 
will be treated and asked an hour later, “About what 
percent of pain relief did you experience?” 


Why should Fizz not simply give the new drug to 30 
patients and no treatment to the other 30 patients, 
and then record the patients’ responses? 


Should the patients be told whether they are getting 
the new drug or a placebo? How would this knowl- 
edge probably affect their reactions? 


Meditation for anxiety An experiment that claimed 
to show that meditation lowers anxiety proceeded as 
follows. ‘The experimenter interviewed the subjects 
and rated their level of anxiety. Then the subjects 
were randomly assigned to two groups. The experi- 
menter taught one group how to meditate and they 
meditated daily for a month. The other group was 
simply told to relax more. At the end of the month, 
the experimenter interviewed all the subjects again 
and rated their anxiety level. The meditation group 
now had less anxiety. Psychologists said that the re- 
sults were suspect because the ratings were not blind. 
Explain what this means and how lack of blindness 
could bias the reported results. 


Testosterone for older men As men age, their 
testosterone levels gradually decrease. ‘This may 
cause a reduction in lean body mass, an increase in 
fat, and other undesirable changes. Do testosterone 
supplements reverse some of these effects? A study 
in the Netherlands assigned 237 men aged 60 to 80 
with low or low-normal testosterone levels to either 
a testosterone supplement or a placebo. The report 
in the Journal of the American Medical Association 
described the study as a “double-blind, randomized, 
placebo-controlled trial.”*! Explain each of these 
terms to someone who knows nothing about 
statistics. 


Do diets work? Refer to Exercise 63. Subjects in the 
low-carb diet group lost significantly more weight than 
subjects in the low-fat diet group during the first six 
months. At the end of a year, however, the average 
weight loss for subjects in the two groups was not 
significantly different.” 

Why did researchers randomly assign the subjects to 
the diet treatments? 
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Explain to someone who knows little statistics what 
“lost significantly more weight” means. 


The subjects in the low-carb diet group lost an average 
of 5.1 kg in a year. The subjects in the low-fat diet 
group lost an average of 3.1 kg. Explain how this infor- 
mation could be consistent with the fact that weight 
loss in the two groups was not significantly different. 


Acupuncture and pregnancy A study sought to deter- 
mine whether the ancient Chinese art of acupuncture 
could help infertile women become pregnant.’ One 
hundred sixty healthy women undergoing assisted 
reproductive therapy were recruited for the study. 
Half of the subjects were randomly assigned to receive 
acupuncture treatment 25 minutes before embryo 
transfer and again 25 minutes after the transfer. The 
remaining 80 subjects were instructed to lie still for 
25 minutes after the embryo transfer. Results: In the 
acupuncture group, 34 women became pregnant. In 
the control group, 21 women became pregnant. 


Why did researchers randomly assign the subjects to 
the two treatments? 


The difference in the percent of women who 
became pregnant in the two groups is statistically 
significant. Explain what this means to someone who 
knows little statistics. 


Explain why the design of the study prevents us from 
concluding that acupuncture caused the difference 
in pregnancy rates. 


Doctors and nurses Nurse-practitioners are nurses 
with advanced qualifications who often act much 
like primary-care physicians. Are they as effective as 
doctors at treating patients with chronic conditions? 
An experiment was conducted with 1316 patients 
who had been diagnosed with asthma, diabetes, or 
high blood pressure. Within each condition, patients 
were randomly assigned to either a doctor or a 
nurse-practitioner. ‘The response variables included 
measures of the patients’ health and of their satisfac- 
tion with their medical care after 6 months.** 


Which are the blocks in this experiment: the differ- 
ent diagnoses (asthma, and so on) or the type of care 
(nurse or doctor)? Why? 


Explain why a randomized block design is preferable 
to a completely randomized design here. 


Comparing cancer treatments ‘The progress of a 
type of cancer differs in women and men. Researchers 
want to design an experiment to compare three thera- 
pies for this cancer. They recruit 500 male and 300 
female patients who are willing to serve as subjects. 


Which are the blocks in this experiment: the cancer 
therapies or the two sexes? Why? 
What are the advantages of a randomized block 


design over a completely randomized design using 
these 800 subjects? 
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Suppose the researchers had 800 male and no fe- 
male subjects available for the study. What advantage 
would this offer? What disadvantage? 


In the cornfield An agriculture researcher wants to 
compare the yield of 5 corn varieties: A, B, C, D, and 
E. The field in which the experiment will be carried 
out increases in fertility from north to south. The 
researcher therefore divides the field into 25 plots 

of equal size, arranged in 5 east-west rows of 5 plots 
each, as shown in the diagram. 


North 


Explain why a randomized block design would be bet 
ter than a completely randomized design in this setting. 
Should the researcher use the rows or the columns of 
the field as blocks? Justify your answer. 

Use technology or Table D to carry out the random 
assignment required by your design. Explain your 
method clearly. 

Comparing weight-loss treatments ‘Twenty over- 
weight females have agreed to participate in a study 
of the effectiveness of four weight-loss treatments: 

A, B, C, and D. The researcher first calculates how 
overweight each subject is by comparing the subject’s 
actual weight with her “ideal” weight. The subjects 
and their excess weights in pounds are as follows: 


Birnbaum 35 Hernandez 25 Moses 25 Smith 29 
Brown 34 Jackson 33 Nevesky 39 Stall oe 
Brunk 30 Kendall 28 Obrach 30 Tran 35 
Cruz 34 Loren 32 Rodriguez 30 Wilansky 42 
Deng 24 Mann 28 Santiago 27 Williams 22 


The response variable is the weight lost after 8 weeks 
of treatment. Previous studies have shown that the 
effects of a diet may vary based on a subject’s initial 
weight. 

Explain why a randomized block design would be bet 
ter than a completely randomized design in this setting. 
Should researchers form blocks of size 4 based on 
subjects’ last names in alphabetical order or by how 
overweight the subjects are? Explain. 

Use technology or Table D to carry out the random 
assignment required by your design. Explain your 
method clearly. 

Aw, rats! A nutrition experimenter intends to com- 
pare the weight gain of newly weaned male rats fed 
Diet A with that of rats fed Diet B. To do this, she 
will feed each diet to 10 rats. She has available 10 rats 
from one litter and 10 rats from a second litter. Rats 
in the first litter appear to be slightly healthier. 
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If the 10 rats from Litter 1 were fed Diet A, the effects of 
genetics and diet would be confounded, and the experi- 
ment would be biased. Explain this statement carefully. 
Describe a better design for this experiment. 
Technology for teaching statistics The Brigham 
Young University (BYU) statistics department is per- 
forming experiments to compare teaching methods. 
Response variables include students’ final-exam 
scores and a measure of their attitude toward statis- 
tics. One study compares two levels of technology 

for large lectures: standard (overhead projectors 

and chalk) and multimedia. There are eight lecture 
sections of a basic statistics course at BYU, each with 
about 200 students. ‘There are four instructors, each 
of whom teaches two sections.” Suppose the sections 
and lecturers are as follows: 


Section Lecturer Section Lecturer 
1 Hilton 0) Tolley 
2 Christensen 6 Hilton 
3 Hadfield Uf Tolley 
4 Hadfield 8 Christensen 


Suppose we randomly assign two lecturers to use 
standard technology in their sections and the other 
two lecturers to use multimedia technology. Explain 
how this could lead to confounding. 

Describe a better design for this experiment. 

Look, Ma, no hands! Does talking on a hands-free cell 
phone distract drivers? Researchers recruit 40 student 
subjects for an experiment to investigate this question. 
They have a driving simulator equipped with a hands- 
free phone for use in the study. Each subject will com- 
plete two sessions in the simulator: one while talking on 
the hands-free phone and the other while just driving. 
The order of the two sessions for each subject will be 
determined at random. The route, driving conditions, 
and traffic flow will be the same in both sessions. 

What type of design did the researchers use in their study? 
Explain why the researchers chose this design instead 
of a completely randomized design. 

Why is it important to randomly assign the order of 
the treatments? 

Explain how and why researchers controlled for 
other variables in this experiment. 

Chocolate gets my heart pumping Cardiologists at 
Athens Medical School in Greece wanted to test wheth- 
er chocolate affected blood flow in the blood vessels. 
The researchers recruited 17 healthy young volunteers, 
who were each given a 3.5-ounce bar of dark chocolate, 
either bittersweet or fake chocolate. On another day, 
the volunteers received the other treatment. The order 
in which subjects received the bittersweet and fake 
chocolate was determined at random. The subjects 
had no chocolate outside the study, and investigators 
didn’t know whether a subject had eaten the real or 
the fake chocolate. An ultrasound was taken of each 
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volunteer’s upper arm to see the functioning of the 
cells in the walls of the main artery. ‘The researchers 
found that blood vessel function was improved when 
the subjects ate bittersweet chocolate, and that there 
were no such changes when they ate the placebo 
(fake chocolate).* 

What type of design did the researchers use in their 
study? 

Explain why the researchers chose this design instead 
of a completely randomized design. 

Why is it important to randomly assign the order of 
the treatments for the subjects? 

Explain how and why researchers controlled for 
other variables in this experiment. 

Room temperature and dexterity An expert on 
worker performance is interested in the effect of room 
temperature on the performance of tasks requiring 
manual dexterity. She chooses temperatures of 70°F 
and 90°F as treatments. ‘The response variable is the 
number of correct insertions, during a 30-minute pe- 
riod, in a peg-and-hole apparatus that requires the use 
of both hands simultaneously. Each subject is trained 
on the apparatus and then asked to make as many in- 
sertions as possible in 30 minutes of continuous effort. 
Describe a completely randomized design to compare 
dexterity at 70° and 90° using 20 volunteer subjects. 
Because individuals differ greatly in dexterity, the 
wide variation in individual scores may hide the sys- 
tematic effect of temperature unless there are many 
subjects in each group. Describe in detail the design 
of a matched pairs experiment in which each subject 
serves as his or her own control. 

Carbon dioxide and tree growth ‘The concentration 
of carbon dioxide (CO2) in the atmosphere is increas- 
ing rapidly due to our use of fossil fuels. Because 
plants use CO; to fuel photosynthesis, more CO may 
cause trees and other plants to grow faster. An elabo- 
rate apparatus allows researchers to pipe extra CO? to 
a 30-meter circle of forest. We want to compare the 
growth in base area of trees in treated and untreated 
areas to see if extra CO? does in fact increase growth. 
We can afford to treat three circular areas.” 

Describe the design of a completely randomized 
experiment using six well-separated 30-meter circular 
areas in a pine forest. Sketch the circles and carry out 
the randomization your design calls for. 

Areas within the forest may differ in soil fertility. 
Describe a matched pairs design using three pairs of 
circles that will account for the extra variation due to 
different fertility. Sketch the circles and carry out the 
randomization your design calls for. 

Got deodorant? A group of students wants to 
perform an experiment to determine whether Brand 
A or Brand B deodorant lasts longer. One group 
member suggests the following design: Recruit 
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40 student volunteers — 20 male and 20 female. 
Separate by gender, because male and female bodies 
might respond differently to deodorant. Give all the 
males Brand A deodorant and all the females Brand 
B. Have each student rate how well the deodorant is 
still working at the end of the school day on a 0 to 10 
scale. ‘Then compare ratings for the two treatments. 
Identify any flaws you see in the proposed design for 
this experiment. 

Describe how you would design the experiment. 
Explain how your design addresses each of the prob- 
lems you identified in part (a). 

Close shave Which of two brands (X or Y) of electric 
razor shaves closer? Researchers want to design and 
carry out an experiment to answer this question using 
50 adult male volunteers. Here’s one idea: Have all 50 
subjects shave the left sides of their faces with the Brand 
X razor and shave the right sides of their faces with the 
Brand Y razor. Then have each man decide which 
razor gave the closer shave and compile the results. 
Identify any flaws you see in the proposed design for 
this experiment. 

Describe how you would design the experiment. 
Explain how your design addresses each of the prob- 
lems you identified in part (a). 


Multiple choice: Select the best answer for Exercises 87 
to 94. 


87. 


Can changing diet reduce high blood pressure? Veg- 
etarian diets and low-salt diets are both promising. 
Men with high blood pressure are assigned at ran- 
dom to four diets: (1) normal diet with unrestricted 
salt; (2) vegetarian with unrestricted salt; (3) normal 
with restricted salt; and (4) vegetarian with restricted 
salt. This experiment has 

one factor, the type of diet. 

two factors, high blood pressure and type of diet. 

two factors, normal/vegetarian diet and unrestricted/ 
restricted salt. 

three factors, men, high blood pressure, and type of diet. 
four factors, the four diets being compared. 

In the experiment of the previous exercise, the 
subjects were randomly assigned to the different 
treatments. What is the most important reason for 
this random assignment? 

Random assignment eliminates the effects of other 
variables such as stress and body weight. 

Random assignment is a good way to create groups of 
subjects that are roughly equivalent at the beginning 
of the experiment. 

Random assignment makes it possible to make a 
conclusion about all men. 

Random assignment reduces the amount of variation 
in blood pressure. 


(e) 


89. 


SIE 


Random assignment prevents the placebo effect from 
ruining the results of the study. 


To investigate whether standing up while studying 
affects performance in an algebra class, a teacher 
assigns half of the 30 students in his class to stand up 
while studying and assigns the other half to not stand 
up while studying. ‘To determine who receives which 
treatment, the teacher identifies the two students 
who did best on the last exam and randomly assigns 
one to stand and one to not stand. The teacher does 
the same for the next two highest-scoring students 
and continues in this manner until each student is 
assigned a treatment. Which of the following best 
describes this plan? 


This is an observational study. 

This is an experiment with blocking. 

This is a completely randomized experiment. 
This is a stratified random sample. 


This is a cluster sample. 


A gardener wants to try different combinations of fer- 
tilizer (none, | cup, 2 cups) and mulch (none, wood 
chips, pine needles, plastic) to determine which 
combination produces the highest yield for a variety 
of green beans. He has 60 green-bean plants to use 
in the experiment. If he wants an equal number of 
plants to be assigned to each treatment, how many 
plants will be assigned to each treatment? 


lb 3) ee yes eye 


Corn variety | yielded 140 bushels per acre last year 
at a research farm. This year, corn variety 2, planted 
in the same location, yielded only 110 bushels 

per acre. Based on these results, is it reasonable to 
conclude that corn variety | is more productive than 
corn variety 2? 

Yes, because 140 bushels per acre is greater than 110 
bushels per acre. 

Yes, because the study was done at a research farm. 


No, because there may be other differences between 
the two years besides the corn variety. 


No, because there was no use of a placebo in the 
experiment. 


No, because the experiment wasn’t double-blind. 


A report in a medical journal notes that the risk of 
developing Alzheimer’s disease among subjects who 
regularly opted to take the drug ibuprofen was about 
half the risk among those who did not. Is this good 
evidence that ibuprofen is effective in preventing 
Alzheimer’s disease? 


Yes, because the study was a randomized, compara- 
tive experiment. 

No, because the effect of ibuprofen is confounded 
with the placebo effect. 
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Yes, because the results were published in a repu- 
table professional journal. 

No, because this is an observational study. An experi- 
ment would be needed to confirm (or not confirm) 
the observed effect. 

Yes, because a 50% reduction can’t happen just by 
chance. 

A farmer is conducting an experiment to determine 
which variety of apple tree, Fuji or Gala, will 
produce more fruit in his orchard. The orchard is 
divided into 20 equally sized square plots. He has 
10 trees of each variety and randomly assigns each 
tree to a separate plot in the orchard. What are the 
experimental unit(s) in this study? 
The trees (c) The apples 
The plots (d) ‘The farmer 


‘Two essential features of all statistically designed 
experiments are 


(e) The orchard 


compare several treatments; use the double-blind 
method. 

compare several treatments; use chance to assign 
subjects to treatments. 

always have a placebo group; use the double-blind 
method. 

use a block design; use chance to assign subjects to 
treatments. 

use enough subjects; always have a control group. 
Seed weights (2.2) Biological measurements on 
the same species often follow a Normal distribu- 
tion quite closely. The weights of seeds of a variety 
of winged bean are approximately Normal with 


mean 525 milligrams (mg) and standard deviation 
110 mg. 


What percent of seeds weigh more than 500 mg? 
Show your method. 

If we discard the lightest 10% of these seeds, what 

is the smallest weight among the remaining seeds? 
Show your method. 

Twins (1.3, 3.1) A researcher studied a group of 
identical twins who had been separated and adopted 
at birth. In each case, one twin (‘I'win A) was adopted 
by a low-income family and the other (‘Twin B) by a 
high-income family. Both twins were given an IQ test 
as adults. Here are their scores:*® 


TwinA: 120 99 99 94111 97 99 94 104 114 113 100 
TwinB: 128 104 108 100 116 105 100 100 103 124 114 112 
(a) How well does one twin’s IQ predict the other’s? 


(b) 


Give appropriate evidence to support your answer. 
Do identical twins living in low-income homes 
tend to have lower IQs later in life than their twins 
who live in high-income homes? Give appropriate 
evidence to support your answer. 
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Using Studies Wisely 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e Describe the scope of inference that is appropriate e Evaluate whether a statistical study has been carried 
in a statistical study. out in an ethical manner.* 


Researchers who conduct statistical studies often want to draw conclusions (make 
inferences) that go beyond the data they produce. Here are two examples. 


e The U.S. Census Bureau carries out a monthly Current Population Survey 
of about 60,000 households. Their goal is to use data from these randomly 
selected households to estimate the percent of unemployed individuals in the 
population. 


e Scientists performed an experiment that randomly assigned 21 volunteer sub- 
jects to one of two treatments: sleep deprivation for one night or unrestricted 
sleep. The experimenters hoped to show that sleep deprivation causes a de- 
crease in performance two days later.” 


What type of inference can be made from a particular study? The answer depends 
on the design of the study. 


Scope of Inference 


In the Census Bureau’s sample survey, the individuals who responded were cho- 
sen at random from the population of interest. Random sampling avoids bias and 
produces trustworthy estimates of the truth about the population. The Census 
Bureau should be safe making an inference about the population based on the 
results of the sample. 

In the sleep deprivation experiment, subjects were randomly assigned to the 
sleep deprivation and unrestricted sleep treatments. Random assignment helps 
ensure that the two groups of subjects are as alike as possible before the treat- 
ments are imposed. If the unrestricted sleep group performs much better than 
the sleep deprivation group, and the difference is too large to be explained by 
chance variation in the random assignment, it must be due to the treatments. 
In that case, the scientists could safely conclude that sleep deprivation caused 
the decrease in performance. That is, they can make an inference about cause 
and effect. However, because the experiment used volunteer subjects, this lim- 
its scientists’ ability to generalize their findings to some larger population of 
individuals. 

Let’s recap what we’ve learned about the scope of inference ina statistical study. 
Random selection of individuals allows inference about the population. Random 
assignment of individuals to groups permits inference about cause and effect. The 
following chart summarizes the possibilities.” 


*This is an important topic, but it is not required for the AP® Statistics exam. 


Section 4.3 Using Studies Wisely \, 267 


Both random sampling Were individuals randomly assigned to groups? 

and random assignment Yes No 

introduce chance variation ' . 

into a statistical study. Inference about the population: YES Inference about the population: YES 


When performing inference, Inference about cause and effect: YES Inference about cause and effect: NO 


sane Were individuals 
statisticians use the laws of 


randomly selected? 


probability to describe this Inference about the population: NO Inference about the population: NO 
chance variation. You'll learn Inference about cause and effect: YES _—_ Inference about cause and effect: NO 
how this works later in the 

book. 


Well-designed experiments randomly assign individuals to treatment groups. 
However, most experiments don’t select experimental units at random from the 
larger population. That limits such experiments to inference about cause and 
effect. Observational studies don’t randomly assign individuals to groups, which 
tules out inference about cause and effect. An observational study that uses 
random sampling can make an inference about the population. The following 
example illustrates all four cases from the table above in a single setting. 


Vitamin C and Canker Sores 


Determining scope of inference 


A small-town dentist wants to know if a daily dose of 500 milligrams (mg) of vita- 
min C will result in fewer canker sores in the mouth than taking no vitamin C.! 


The dentist is considering the following four study designs: 


Design 1: Get all dental patients in town with appointments in the next two weeks 
to take part in a study. Give each patient a survey with two questions: (1) Do you 
take at least 500 mg of vitamin C each day? (2) Do you frequently have canker 
sores? Based on patients’ answers to Question 1, divide them into two groups: 
those who take at least 500 mg of vitamin C daily and those who don’t. 

Design 2: Get all dental patients in town with appointments in the next two weeks 
to take part in a study. Randomly assign half of them to take 500 mg of vitamin 
C each day and the other half to abstain from taking vitamin C for three months. 

Design 3: Select a random sample of dental patients in town and get them to take 
part in a study. Divide the patients into two groups as in Design 1. 

Design 4: Select a random sample of dental patients in town and get them to take 
part in a study. Randomly assign half of them to take 500 mg of vitamin C each 
day and the other half to abstain from taking vitamin C for three months. 


For whichever design the dentist chooses, suppose she compares the proportion 
of patients in each group who complain of canker sores. Also suppose that she 
finds a statistically significant difference, with a smaller proportion of those tak- 
ing vitamin C having canker sores. 


PROBLEM: What can the dentist conclude for each design? 
SOLUTION: 


Design 1: Because the patients were not randomly selected, the dentist cannot infer that this 
result holds for a larger population of dental patients. This was an observational study because 
no treatments were deliberately imposed on the patients. With no random assignment to the two 
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groups, no inference about cause and effect can be made. The dentist just knows that for these 
patients, those who took vitamin C had fewer canker sores than those who didn't. 


Design 2: Asin Design 1, the dentist can't make any inference about this result holding for a larger 
population of dental patients. However, the treatments were randomly assigned to the subjects. 
Assuming proper control in the experiment, she can conclude that taking vitamin C reduced the 
chance of getting canker sores in her subjects. 


Design 3: Because the patients were randomly selected from the population of dental patients in the 
town, the dentist can generalize the results of this study to the population. Because this was an obser- 
vational study, no inference about cause and effect can be made. The dentist would conclude that for the 
population of dental patients in this town, those taking vitamin C have fewer canker sores than those who 
don't. She can’t say whether the vitamin C causes this reduction or some other confounding variable. 


Design 4: Asin Design 3, the random sampling allows the dentist to generalize the results of this study 
to the population of dental patients in the town. As in Design 2, the random assignment would allow 
her to conclude (assuming proper control in the experiment) that taking vitamin C reduced the chance 
of getting canker sores. So the dentist would conclude that for the population of dental patients in 

this town, those taking vitamin C will tend to have fewer canker sores than those who don’t due to the 
vitamin C. 


For Practice Try Exercise 


The Challenges of Establishing Causation 


A well-designed experiment tells us that changes in the explanatory variable cause 
changes in the response variable. More precisely, it tells us that this happened 
for specific individuals in the specific environment of this specific experiment. 
A serious threat is that the treatments, the subjects, or the environment of our 
experiment may not be realistic. Lack of realism can limit our ability to apply the 
conclusions of an experiment to the settings of greatest interest. 


Do Center Brake Lights Reduce 


Rear-End Crashes? 


Lack of realism 


Do those high center brake lights, required on all cars sold in the United States 
since 1986, really reduce rear-end collisions? Randomized comparative experi- 
ments with fleets of rental and business cars, done before the lights were required, 
showed that the third brake light reduced rear-end collisions by as much as 50%. 
But requiring the third light in all cars led to only a 5% drop. 


What happened? Most cars did not have the extra brake light when the experi- 
ments were carried out, so it caught the eye of following drivers. Now that almost 
all cars have the third light, they no longer capture attention. 


In some cases, it isn’t practical or even ethical to do an experiment. Consider 
these important questions: 
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¢ Does texting while driving increase the risk of having an accident? 
¢ Does going to church regularly help people live longer? 
¢ Does smoking cause lung cancer? 


To answer these cause-and-effect questions, we just need to perform a randomized 
comparative experiment. Unfortunately, we can’t randomly assign people to text 
while driving or to attend church or to smoke cigarettes. The best data we have about 
these and many other cause-and-effect questions come from observational studies. 

It is sometimes possible to build a strong case for causation in the absence of 
experiments. The evidence that smoking causes lung cancer is about as strong as 
nonexperimental evidence can be. 


Does Smoking Cause Lung Cancer? 
Living with observational studies 


Doctors had long observed that most lung cancer patients were smokers. Com- 
parison of smokers and similar nonsmokers showed a very strong association 
between smoking and death from lung cancer. Could the association be due to 
some other variable? Is there some genetic factor that makes people both more 
likely to get addicted to nicotine and to develop lung cancer? If so, then smoking 
and lung cancer would be strongly associated even if smoking had no direct ef- 
fect on the lungs. Or maybe confounding is to blame. It might be that smokers 
live unhealthy lives in other ways (diet, alcohol, lack of exercise) and that some 
other habit confounded with smoking is a cause of lung cancer. How were these 
objections overcome? 


Whatare the criteria for establishing causation when we can’t do an experiment? 


e The association is strong. The association between smoking and lung cancer 
is very strong. 


¢ The association is consistent. Many studies of different kinds of people in many 
countries link smoking to lung cancer. That reduces the chance that some 
other variable specific to one group or one study explains the association. 


¢ Larger values of the explanatory variable are associated with stronger responses. 
People who smoke more cigarettes per day or who smoke over a longer period 
get lung cancer more often. People who stop smoking reduce their risk. 


¢ The alleged cause precedes the effect in time. Lung cancer develops after years 
of smoking. The number of men dying of lung cancer rose as smoking became 
more common, with a lag of about 30 years. Lung cancer kills more men than 
any other form of cancer. Lung cancer was rare among women until women 
began to smoke. Lung cancer in women rose along with smoking, again with 
a lag of about 30 years, and has now passed breast cancer as the leading cause 
of cancer death among women. 


¢ The alleged cause is plausible. Experiments with animals show that tars from 
cigarette smoke do cause cancer. 


Medical authorities do not hesitate to say that smoking causes lung cancer. 
The U.S. Surgeon General states that cigarette smoking is “the largest avoidable 
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cause of death and disability in the United States.”** The evidence for causation 
is overwhelming —but it is not as strong as the evidence provided by well-designed 
experiments. Conducting an experiment in which some subjects were forced to 
smoke and others were not allowed to would be unethical. In cases like this, ob- 
servational studies are our best source of reliable information. 


Data Ethics 


Medical professionals are taught to follow the basic principle “First, do no harm.” 
Shouldn’t those who carry out statistical studies follow the same principle? Most 
reasonable people think so. But this may not always be as simple as it sounds. De- 
cide whether you think each of the following studies is ethical or unethical: 


e A promising new drug has been developed for treating cancer in humans. 
Before giving the drug to human subjects, researchers want to administer the 
drug to animals to see if there are any potentially serious side effects. 


e Are companies discriminating against some individuals in the hiring process? 
To find out, researchers prepare several equivalent résumés for fictitious job 
applicants, with the only difference being the gender of the applicant. They 
send the fake résumés to companies advertising positions and keep track of the 
number of males and females who are contacted for interviews. 


e Will people try to stop someone from driving drunk? A television news pro- 
gram hires an actor to play a drunk driver and uses a hidden camera to record 
the behavior of individuals who encounter the driver. 


The most complex issues of data ethics arise when we collect data from people. 
The ethical difficulties are more severe for experiments that impose some treatment 
on people than for sample surveys that simply gather information. ‘Trials of new 
medical treatments, for example, can do harm as well as good to their subjects. Here 
are some basic standards of data ethics that must be obeyed by all studies that gather 
data from human subjects, both observational studies and experiments. 


BASIC DATA ETHICS 


All planned studies must be reviewed in advance by an institutional review 
board charged with protecting the safety and well-being of the subjects. 


All individuals who are subjects in a study must give their informed consent 
before data are collected. 


All individual data must be kept confidential. Only statistical summaries for 
groups of subjects may be made public. 


The law requires that studies carried out or funded by the federal government 
obey these principles.** But neither the law nor the consensus of experts is com- 
pletely clear about the details of their application. 


Institutional review boards The purpose of an institutional review board 
is not to decide whether a proposed study will produce valuable information or 
whether it is statistically sound. The board’s purpose is, in the words of one uni- 


*This is an important topic, but it is not required for the AP® Statistics exam. 
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versity’s board, “to protect the rights and welfare of human subjects (including 
patients) recruited to participate in research activities.” The board reviews the 
plan of the study and can require changes. It reviews the consent form to be sure 
that subjects are informed about the nature of the study and about any potential 
risks. Once research begins, the board monitors its progress at least once a year. 


Informed consent Both words in the phrase “informed consent” are impor- 
tant, and both can be controversial. Subjects must be informed in advance about 
the nature of a study and any risk of harm it may bring. In the case of a sample 
survey, physical harm is not possible. But a survey on sensitive issues could result 
in emotional harm. The participants should be told what kinds of questions the 
survey will ask and about how much of their time it will take. Experimenters must 
tell subjects the nature and purpose of the study and outline possible risks. Sub- 
jects must then consent in writing. 


Confidentiality Ethical problems do not disappear once a study has been 
cleared by the review board, has obtained consent from its participants, and has 
actually collected data about them. It is important to protect individuals’ privacy 
by keeping all data about them confidential. The report of an opinion poll may 
say what percent of the 1200 respondents believed that legal immigration should 
be reduced. It may not report what you said about this or any other issue. 

Confidentiality is not the same as anonymity. Anonymity 
means that individuals are anonymous—their names are not 
known even to the director of the study. Anonymity is rare in 
statistical studies. Even where anonymity is possible (mainly 
in surveys conducted by mail), it prevents any follow-up to 
improve nonresponse or inform individuals of results. 

Any breach of confidentiality is a serious violation of 
data ethics. The best practice is to separate the identity of 
the study’s participants from the rest of the data at once. 
A clever computer search of several data bases might be 
able, by combining information, to identify you and learn a 
great deal about you even if your name and other identifica- 
tion have been removed from the data available for search. 
Privacy and confidentiality of data are hot issues among stat- 
isticians in the computer age. 


“| realize the participants in this study are to be 
anonymous, but you're going to have to expose your eyes.” 


ACTIVITY | Response bias 


In this Activity, your team will design and conduct an experiment to investigate 
the effects of response bias in surveys.” You may choose the topic for your sur- 
veys, but you must design your experiment so that it can answer at least one of the 
following questions: 


e Can the wording of a question create response bias? 
e Do the characteristics of the interviewer create response bias? 
¢ Does anonymity change the responses to sensitive questions? 


¢ Does manipulating the answer choices change the response? 
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1. Write a proposal describing the design of your experiment. Be sure to include 
(a) your chosen topic and which of the above questions you'll try to answer. 
(b) a detailed description of how you will obtain your subjects (minimum 
of 50). Your plan must be practical! 

(c) what your questions will be and how they will be asked. 
(d) a clear explanation of how you will implement your design. 
(e) precautions you will take to collect data ethically. 


Here are two examples of successful student experiments: 


“Make-Up,” by Caryn S. and Trisha T. (all questions asked to males) 
i. “Do you find females who wear makeup attractive?” (questioner wearing 
makeup: 75% answered yes) 
ii. “Do you find females who wear makeup attractive?” (questioner not 
wearing makeup: 30% answered yes) 


“Cartoons” by Sean W. and Brian H. 
i. “Do you watch cartoons?” (90% answered yes) 
ii. “Do you still watch cartoons?” (60% answered yes) 


2. Once your teacher has approved your design, carry out the experiment. 
Record your data in a table. 


3. Analyze your data. What conclusion do you draw? Provide appropriate 
graphical and numerical evidence to support your answer. 


4. Prepare a report that includes the data you collected, your analysis from 
Step 3, and a discussion of any problems you encountered and how you dealt 
with them. 


=> 


Can Magnets Help Reduce Pain? 


Re-read the chapter-opening Case Study on page 207. Then use what 
you have learned in this chapter to help answer the following questions. 


1. Why is the magnet study an experiment, not an observational 
study? 

2. What type of design was used in this experiment? Identify the 
experimental units, the treatments, and the response variable. 

3. There were two distinct purposes for having the doctors select a 
sealed envelope at random from the box. Describe them both. 
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4. The dotplot shows the improvement in pain ratings for both 
groups. Write a few sentences comparing the two distributions. 


2 < 6 8 
Improvement in pain rating 


5. The mean difference in pain ratings was 5.24 for the active- 
magnet group and 1.10 for the inactive-magnet group. This differ- 
ence is statistically significant. What conclusion should we draw? 


oF. 
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Summary 


¢ Most statistical studies aim to make inferences that go beyond the data actu- 
ally produced. Inference about a population requires that the individuals 
taking part in a study be randomly selected from the population. A well- 
designed experiment that randomly assigns experimental units to treatments 
allows inference about cause and effect. 


e Lack of realism in an experiment can prevent us from generalizing its results. 


e Inthe absence of an experiment, good evidence of causation requires a strong 
association that appears consistently in many studies, a clear explanation for 
the alleged causal link, and careful examination of other variables. 


¢ Studies involving humans must be screened in advance by an institutional re- 
view board. All participants must give their informed consent before taking part. 
Any information about the individuals in the study must be kept confidential. 


Exercises 


Random sampling versus random assignment Ex- Early Intervention Project found that the answer is a 
plain the difference between the types of inference clear “Yes.” The subjects were 136 young children 
that can be made as a result of random sampling and abandoned at birth and living in orphanages in 
random assignment. Bucharest, Romania. Half of the children, chosen at 
Observation versus experimentation Explain the random, were placed in foster homes. The other half 
difference between the types of inference than can remained in the orphanages.” (Foster care was not 
usually be made from an observational study and an easily available in Romania at the time and so was 
experiment. paid for by the study.) What conclusion can we draw 


Foster care versus orphanages Do abandoned chil- from this study? Explain. 


dren placed in foster homes do better than similar 100. Frozen batteries Will storing batteries in a freezer 
children placed in an institution? ‘The Bucharest make them last longer? To find out, a company that 


274 


102. 


103. 


104. 


CHAPTER 4 DESIGNING STUDIES 


produces batteries takes a random sample of 100 AA 
batteries from its warehouse. ‘The company statistician 
randomly assigns 50 batteries to be stored in the freezer 
and the other 50 to be stored at room temperature for 

3 years. At the end of that time period, each battery’s 
charge is tested. Result: Batteries stored in the freezer 
had a higher average charge, and the difference 
between the groups was statistically significant. What 
conclusion can we draw from this study? Explain. 
Who talks more—women or men? According to 
Louann Brizendine, author of The Female Brain, 
women say nearly three times as many words per day 
as men. Skeptical researchers devised a study to test 
this claim. They used electronic devices to record 
the talking patterns of 396 university students who 
volunteered to participate in the study. The device 
was programmed to record 30 seconds of sound every 
12.5 minutes without the carrier’s knowledge. Ac- 
cording to a published report of the study in Scientific 
American, “Men showed a slightly wider variability 
in words uttered.... But in the end, the sexes came 
out just about even in the daily averages: women at 
16,215 words and men at 15,669.””° This difference 
was not statistically significant. What conclusion can 
we draw from this study? Explain. 

Attend church, live longer? One of the better studies 
of the effect of regular attendance at religious services 
gathered data from a random sample of 3617 adults. 
‘The researchers then measured lots of variables, not 
just the explanatory variable (religious activities) and 
the response variable (length of life). A news article 
said: “Churchgoers were more likely to be nonsmok- 
ers, physically active, and at their right weight. But 
even after health behaviors were taken into account, 
those not attending religious services regularly still 
were about 25% more likely to have died.”*” What 
conclusion can we draw from this study? Explain. 
Daytime running lights Canada and the European 
Union require that cars be equipped with “daytime 
running lights,” headlights that automatically come 
on ata low level when the car is started. Many manu- 
facturers are now equipping cars sold in the United 
States with running lights. Will running lights reduce 
accidents by making cars more visible? An experiment 
conducted in a driving simulator suggests that the 
answer may be “Yes.” What concerns would you have 
about generalizing the results of such an experiment? 
Studying frustration A psychologist wants to study 
the effects of failure and frustration on the relation- 
ships among members of a work team. She forms a 
team of students, brings them to the psychology lab, 
and has them play a game that requires teamwork. 
The game is rigged so that they lose regularly. 

The psychologist observes the students through a 
one-way window and notes the changes in their 
behavior during an evening of game playing. Can 
the psychologist generalize the results of her study to 


a team of employees that spends months developing 
a new product that never works right and is finally 
abandoned by their company? Explain. 

105.*Minimal risk? You have been invited to serve on 

a college’s institutional review board. You must 

decide whether several research proposals qualify 

for lighter review because they involve only minimal 

risk to subjects. Federal regulations say that “minimal 

risk” means the risks are no greater than “those 

ordinarily encountered in daily life or during the 

performance of routine physical or psychological 

examinations or tests.” ‘That’s vague. Which of these 

do you think qualifies as “minimal risk”? 

Draw a drop of blood by pricking a finger to mea- 

sure blood sugar. 

(b) Draw blood from the arm for a full set of blood tests. 

(c) Insert a tube that remains in the arm, so that blood 
can be drawn regularly. 

106.*Who reviews? Government regulations require that 
institutional review boards consist of at least five peo- 
ple, including at least one scientist, one nonscientist, 
and one person from outside the institution. Most 
boards are larger, but many contain just one outsider. 


(a) Why should review boards contain people who are 
not scientists? 
(b) Do you think that one outside member is enough? 


How would you choose that member? (For example, 
would you prefer a medical doctor? A member of 
the clergy? An activist for patients’ rights?) 
107.*No consent needed? In which of the circumstances 
below would you allow collecting personal informa- 
tion without the subjects’ consent? 
(a) A government agency takes a random sample of 
income tax returns to obtain information on the 
average income of people in different occupations. 
Only the incomes and occupations are recorded 
from the returns, not the names. 
A social psychologist attends public meetings of a reli- 
gious group to study the behavior patterns of members. 
A social psychologist pretends to be converted to 
membership in a religious group and attends private 
meetings to study the behavior patterns of members. 
108.* Surveys of youth A survey asked teenagers whether 
they had ever consumed an alcoholic beverage. 
Those who said “Yes” were then asked, “How old 
were you when you first consumed an alcoholic bev- 
erage?” Should consent of parents be required to ask 
minors about alcohol, drugs, and other such issues, 
or is consent of the minors themselves enough? Give 
reasons for your opinion. 
109.* Anonymous? Confidential? One of the most impor- 
tant nongovernment surveys in the United States is the 
National Opinion Research Center’s General Social 


*Exercises 105 to 112: This is an important topic, but it is not 
required for the AP® Statistics exam. 


Survey. The GSS regularly monitors public opinion on 
a wide variety of political and social issues. Interviews 
are conducted in person in the subject’s home. Are 
a subject’s responses to GSS questions anonymous, 
confidential, or both? Explain your answer. 
110.*Anonymous? Confidential? ‘Texas A&M, like many 
universities, offers screening for HIV, the virus that 
causes AIDS. Students may choose either anonymous 
or confidential screening. An announcement says, 
“Persons who sign up for screening will be assigned a 
number so that they do not have to give their name.” 
They can learn the results of the test by telephone, 
still without giving their name. Does this describe the 
anonymous or the confidential screening? Why? 


111.*The Willowbrook hepatitis studies In the 1960s, 
children entering the Willowbrook State School, an 
institution for the intellectually disabled on Staten 
Island in New York, were deliberately infected with 
hepatitis. ‘he researchers argued that almost all 
children in the institution quickly became infected 
anyway. ‘he studies showed for the first time that two 
strains of hepatitis existed. ‘This finding contributed to 
the development of effective vaccines. Despite these 
valuable results, the Willowbrook studies are now 
considered an example of unethical research. Explain 
why, according to current ethical standards, useful 
results are not enough to allow a study. 


112.*Unequal benefits Researchers on aging proposed to 
investigate the effect of supplemental health services 
on the quality of life of older people. Eligible patients 
on the rolls of a large medical clinic were to be 
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randomly assigned to treatment and control groups. 
The treatment group would be offered hearing aids, 
dentures, transportation, and other services not avail- 
able without charge to the control group. ‘The review 
board felt that providing these services to some but 
not other persons in the same institution raised ethi- 
cal questions. Do you agree? 

. Animal testing (1.1) “It is right to use animals for 
medical testing if it might save human lives.” The 
General Social Survey asked 1152 adults to react to this 
statement. Here is the two-way table of their responses: 


Male Female 


Strongly agree 76 59 
Agree 270 247 
Neither agree nor disagree 87 139 
Disagree 61 123 
Strongly disagree 22. 68 


How do the distributions of opinion differ between 
men and women? Give appropriate graphical and 
numerical evidence to support your answer. 


. Initial public offerings (1.3) The business magazine 


114 
Z Forbes reports that 4567 companies sold their first 


stock to the public between 1990 and 2000. The mean 
change in the stock price of these companies since the 
first stock was issued was + 111%. The median change 
was —31%.°> Explain how this could happen. 


*Exercises 105 to 112: This is an important topic, but it is not 
required for the AP® Statistics exam. 


Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


In a recent study, 166 adults from the St. Louis area were 
recruited and randomly assigned to receive one of two treat- 
ments for a sinus infection. Half of the subjects received an an- 
tibiotic (amoxicillin) and the other half received a placebo.” 

(a) Describe how the researchers could have assigned 
treatments to subjects if they wanted to use a com- 
pletely randomized design. 

(b) All the subjects in the experiment had moderate, 
severe, or very severe symptoms at the beginning of 
the study. Describe one statistical benefit and one 
statistical drawback for using subjects with moder- 


ate, severe, or very severe symptoms instead of just 
using subjects with very severe symptoms. 

(c) At different stages during the next month, all sub- 

jects took the sino-nasal outcome test. After 10 
days, the difference in average test scores was not 
statistically significant. In this context, explain 
what it means for the difference to be not statisti- 
cally significant. 
One possible way that researchers could have 
improved the study is to use a randomized block 
design. Explain how the researchers could have 
incorporated blocking in their design. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you think 
each solution is “complete,” “substantial,” “developing,” or “mini- 
mal.” If the solution is not complete, what improvements would you 
suggest to the student who wrote it? Finally, your teacher will provide 
you with a scoring rubric. Score your response and note what, if any- 
thing, you would do differently to improve your own score. 


276 


Chapter Review 


Section 4.1: Sampling and Surveys 


In this section, you learned that a population is the group of 
all individuals that we want information about. A sample is 
the subset of the population that we use to gather this infor- 
mation. The goal of most sample surveys is to use sample in- 
formation to draw conclusions about the population. Choos- 
ing people for a sample because they are located nearby or 
letting people choose whether or not to be in the sample are 
poor ways to choose a sample. Because convenience sam- 
ples and voluntary response samples will produce estimates 
that are consistently too large or consistently too small, these 
methods of choosing a sample are biased. 

To avoid bias in the way the sample is formed, the mem- 
bers of the sample should be chosen at random. One way 
to do this is with a simple random sample (SRS), which is 
equivalent to pulling well-mixed slips of paper from a hat. 
It is often more convenient to select an SRS using technol- 
ogy or a table of random digits. 

‘Two other random sampling methods are stratified sam- 
pling and cluster sampling. ‘To obtain a stratified random 
sample, divide the population into groups (strata) of simi- 
lar individuals, take an SRS from each stratum, and com- 
bine the chosen individuals to form the sample. Stratified 
random samples can produce estimates with much greater 
precision than simple random samples. To obtain a clus- 
ter sample, divide the population into groups (clusters) of 
individuals that are in similar locations, randomly select 
clusters, and use every individual in the chosen clusters. 
Cluster samples are easier to obtain than simple random 
samples or stratified random samples, but they may not pro- 
duce very precise estimates. 

Finally, you learned about other issues in sample sur- 
veys that can lead to bias: undercoverage occurs when the 
sampling method systematically excludes one part of the 
population. Nonresponse describes when answers cannot 
be obtained from some people that were chosen to be in 
the sample. Bias can also result when some people in the 
sample don’t give accurate responses due to question word- 
ing, interviewer characteristics, or other factors. 


Section 4.2: Experiments 


In this section, you learned about the difference between ob- 
servational studies and experiments. Experiments deliberate- 
ly impose a treatment to see if there is a cause-and-effect rela- 
tionship between two variables. Observational studies look at 
relationships between two variables, but cannot show cause 
and effect because other variables may be confounded with 
the explanatory variable. Variables are confounded when it 
is impossible to determine which of the variables is causing 
a change in the response variable. 


Acommon type of experiment uses a completely random- 
ized design. In this type of design, the experimental units are 
divided into groups, one group for each of the treatments. 
To determine which experimental units are in which group, 
we use random assignment. With random assignment, the 
effects of variables (other than the explanatory variable) 
are roughly balanced out between the groups. Replication 
means giving each treatment to as many experimental units 
as possible. This makes it easier to see the effects of the treat- 
ments because the effects of other variables are more likely 
to be balanced among the treatment groups. 

During an experiment, it is important that other vari- 
ables be controlled (kept the same) for each experimen- 
tal unit. Doing so helps avoid confounding and removes a 
possible source of variation in the response variable. Also, 
beware of the placebo effect—the tendency for people to 
improve because they expect to, not because of the treat- 
ment they are receiving. One way to make sure that all 
experimental units have the same expectations is to make 
them blind—unaware of which treatment they are receiv- 
ing. When the person measuring the response variable is 
also blind, the experiment is called double-blind. 

The results of an experiment are statistically significant 
if the difference in the response is too large to be accounted 
for by the random assignment of experimental units to 
treatments. ‘To make it more likely to obtain statistically 
significant results, experiments can incorporate blocking. 
Blocking in experiments is similar to stratifying in sam- 
pling. To form blocks, group together experimental units 
that are similar with respect to a variable that is associated 
with the response. Then randomly assign the treatments 
within each block. A design that uses blocks with two ex- 
perimental units is called a matched pairs design. Block- 
ing helps us estimate the effects of the treatments more 
precisely because we can account for the variability intro- 
duced by the variables used to form the blocks. 


Section 4.3: Using Studies Wisely 


In this section, you learned that the different types of conclu- 
sions we can draw depend on how the data are produced. 
When samples are selected at random, we can make in- 
ferences about the population from which the sample was 
drawn. When treatments are applied to groups formed at 
random, we can conclude cause and effect. 

Making a cause-and-effect conclusion is often difficult 
because it is impossible or unethical to perform certain 
types of experiments. Good data ethics requires that stud- 
ies should be approved by an institutional review board, 
subjects should give informed consent, and individual data 
must be kept confidential. 


What Did You Learn? 


Learning Objective 


Section 


Related Example 
on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Identify the population and sample in a statistical study. 


210 


R4.1 


Identify voluntary response samples and convenience samples. 
Explain how these sampling methods can lead to bias. 


Zs) 


R4.2 


Describe how to obtain a random sample using slips 
of paper, technology, or a table of random digits. 


214, 217 


R4.3 


Distinguish a simple random sample from a stratified random 
sample or cluster sample. Give the advantages and 
disadvantages of each sampling method. 


Explain how undercoverage, nonresponse, question wording, 
and other aspects of a sample survey can lead to bias. 


Distinguish between an observational study and an experiment. 


Explain the concept of confounding and how it limits the ability 
to make cause-and-effect conclusions. 


235 


Identify the experimental units, explanatory and response 
variables, and treatments in an experiment. 


237, 239 


Explain the purpose of comparison, random assignment, control, 
and replication in an experiment. 


243 


Describe a completely randomized design for an experiment, 
including how to randomly assign treatments using slips of 
paper, technology, or a table of random digits. 


246 


R4.7, R4.10 


Describe the placebo effect and the purpose of blinding in 
an experiment. 


247 


R4.9 


Interpret the meaning of statistically significant in the context 
of an experiment. 


Explain the purpose of blocking in an experiment. Describe 
a randomized block design or a matched pairs design for 
an experiment. 


249 (Activity) 


291, 254 


R4.9 


R4.7, R4.10 


Describe the scope of inference that is appropriate in a 
statistical study. 


267 


R4.8 


*Evaluate whether a statistical study has been carried out in 
an ethical manner. 


*This is an important topic, but it is not required for the AP® Statistics exam. 


Discussion on 270 
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CHAPTER 4 


DESIGNING STUDIES 


Chapter 4 Chapter Review Exercises 


These exercises are designed to help you review the impor- 
tant ideas and methods of the chapter. 


R4.1 


(a) 


(b) 


R4.2 


R4.3 


R4.4 


Ontario Health Survey ‘The Ministry of Health 

in the province of Ontario, Canada, wants to know 
whether the national health care system is achieving 
its goals in the province. Much information about 
health care comes from patient records, but that 
source doesn’t allow us to compare people who use 
health services with those who don’t. So the Minis- 
try of Health conducted the Ontario Health Survey, 
which interviewed a random sample of 61,239 
people who live in the province of Ontario.°° 

What is the population for this sample survey? What 
is the sample? 

The survey found that 76% of males and 86% of 
females in the sample had visited a general practi- 
tioner at least once in the past year. If a census were 
conducted, do you think that the percentages would 
be the same as in the sample? Explain. 

Bad sampling A large high school wants to gather 
student opinion about parking for students on cam- 
pus. It isn’t practical to contact all students. 

Give an example of a way to choose a voluntary re- 
sponse sample of students. Explain how this method 
could lead to bias. 


Give an example of a way to choose a convenience 
sample of students. Explain how this method could 
lead to bias. 

Drug testing A baseball team regularly conducts 
random drug tests on its players. The 25 members 
of the team are listed below. 


Agarwal Chen Healy = Moser Roberts 
Andrews Frank Hixson Musselman Shen 
Baer Fuest Lee Pavnica Smith 
Berger Fuhrmann Lynch  Petrucelli Sundheim 
Brockman Garcia Milhalko Reda Wilson 


Explain how you would use the line of random 
digits below to select an SRS of 3 team members for 
a random drug test. 

Use your method from part (a) to choose the SRS 
using the digits below. Show your work. 


Lal 78009 46239 84569 03316 


Polling the faculty A researcher wants to study 
the attitudes of college faculty members about the 
work habits of entering freshmen. These attitudes 
appear to differ depending on the type of college. 
The American Association of University Professors 
classifies colleges as follows: 


R45 


Class I: Offer doctorate degrees and award at least 
15 per year. 

Class IIA: Award degrees above the bachelor’s but 
are not in Class I. 

Class IIB: Award no degrees beyond the bachelor’s. 
Class III: ‘Two-year colleges. 

The researcher would like to survey about 200 faculty 
members. Would you recommend a simple random 
sample, stratified random sample, or cluster sample? 
Justify your answer. 


Been to the movies? An opinion poll calls 2000 
ran domly chosen residential telephone numbers, 
then asks to speak with an adult member of the 
household. The interviewer asks, “How many mov- 
ies have you watched in a movie theater in the past 
12 months?” In all, 1131 people responded. ‘The 
researchers used the responses to estimate the mean 
number of movies adults have watched in a movie 
theater in the past 12 months. 

Describe a potential source of bias related to the 
wording of the question. Suggest a change that 
would help fix this problem. 

Describe how using only residential phone num- 
bers might lead to bias and how this will affect the 
estimate. 

Describe how nonresponse might lead to bias and 
how this will affect the estimate. 


R4.6 Are anesthetics safe? ‘The National Halothane Study 


— 


was a major investigation of the safety of anesthetics 
used in surgery. Records of over 850,000 operations 
performed in 34 major hospitals showed the following 
death rates for four common anesthetics:*! 


Anesthetic: A B C D 
Death rate: 1.7% 1.7% 3.4% 1.9% 


There seems to be a clear association between the 
anesthetic used and the death rate of patients. Anes- 
thetic C appears to be more dangerous. 


Explain why we call the National Halothane Study 
an observational study rather than an experiment, 
even though it compared the results of using differ- 
ent anesthetics in actual surgery. 


(b) When the study looked at other variables that are 


related to a doctor’s choice of anesthetic, it found 
that Anesthetic C was not causing extra deaths. 
Explain the concept of confounding in this context 
and identify a variable that might be confounded 
with the doctor’s choice of anesthetic. 


R4.7 Ugly fries Few people want to eat discolored french 
fries. Potatoes are kept refrigerated before being 
cut for french fries to prevent spoiling and preserve 
flavor. But immediate processing of cold potatoes 
causes discoloring due to complex chemical reac- 
tions. ‘The potatoes must therefore be brought to 
room temperature before processing. Researchers 
want to design an experiment in which tasters will 
rate the color and flavor of french fries prepared 
from several groups of potatoes. The potatoes will 
be freshly picked or stored for a month at room tem- 
perature or stored for a month refrigerated. ‘They 
will then be sliced and cooked either immediately 
or after an hour at room temperature. 

(a) Identify the experimental units, the explanatory and 
response variables, and the treatments. 

(b) ‘The researchers plan to use a completely random- 
ized design. Describe how they should assign 
treatments to the experimental units if there are 300 
potatoes available for the experiment. 

(c) The researchers decided to do a follow-up experi- 
ment using sweet potatoes as well as regular potatoes. 
Describe how they should change the design of the ex- 
periment to account for the addition of sweet potatoes. 


R4.8 Don’t catch a cold! A recent study of 1000 stu- 
dents at the University of Michigan investigated 
how to prevent catching the common cold. The stu- 
dents were randomly assigned to three different cold 
prevention methods for 6 weeks. Some wore masks, 
some wore masks and used hand sanitizer, and oth- 
ers took no precautions. ‘The two groups who used 
masks reported 10-50% fewer cold symptoms than 
those who did not wear a mask. 


— 
fo 
na 


Does this study allow for inference about a popula- 
tion? Explain. 


Ss 


Does this study allow for inference about cause and 
effect? Explain. 

R4.9 An herb for depression? Does the herb Saint- 
John’s-wort relieve major depression? Here is an 
excerpt from the report of a study of this issue: 
“Design: Randomized, Double-Blind, Placebo- 
Controlled Clinical Trial.” The study concluded 
that the difference in effectiveness of Saint-John’s- 
wort and a placebo was not statistically significant. 
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(a) How did the design of this experiment account for 
the placebo effect? 


(b) Explain the purpose of the random assignment. 


(c) Why is a double-blind design a good idea in this 
setting? 

(d) Explain what “not statistically significant” means 
in this context. 

R4.10 How long did I work? A psychologist wants to 
know if the difficulty of a task influences our 
estimate of how long we spend working at it. She 
designs two sets of mazes that subjects can work 
through on a computer. One set has easy mazes 
and the other has difficult mazes. Subjects work 
until told to stop (after 6 minutes, but subjects do 
not know this). They are then asked to estimate how 
long they worked. The psychologist has 30 students 
available to serve as subjects. 

(a) Describe an experiment using a completely ran- 
domized design to learn the effect of difficulty on 
estimated time. 

(b) Describe a matched pairs experimental design us- 
ing the same 30 subjects. 

(c) Which design would be more likely to detect a dif- 
ference in the effects of the treatments? Explain. 

R4.11* Deceiving subjects Students sign up to be 
subjects in a psychology experiment. When they 
arrive, they are told that interviews are running late 
and are taken to a waiting room. The experiment- 
ers then stage a theft of a valuable object left in the 
waiting room. Some subjects are alone with the 
thief, and others are in pairs—these are the treat- 


ments being compared. Will the subject report the 
theft? 


(a) The students had agreed to take part in an unspec- 
ified study, and the true nature of the experiment 
is explained to them afterward. Does this meet the 
requirement of informed consent? Explain. 


(b) What two other ethical principles should be fol- 
lowed in this study? 


*This is an important topic, but it is not required for the 
AP® Statistics exam. 


Chapter 4 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T4.1 When we take a census, we attempt to collect data from 


(a) a stratified random sample. 
(b) every individual chosen in a simple random sample. 
(c) every individual in the population. 


(d) a voluntary response sample. 


(e) a convenience sample. 


T4.2 You want to take a simple random sample (SRS) of 


50 of the 816 students who live in a dormitory on 
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campus. You label the students 001 to 816 in alpha- 
betical order. In the table of random digits, you read 
the entries 


95592 94007 69769 33547 72450 16632 81194 14873 


The first three students in your sample have labels 
(a) 955, 929° 400: (d) 929, 400, 769. 
(b) 400, 769, 769. (e) 400, 769, 335. 
(ce) 559; 294,007, 


14.3 A study of treatments for angina (pain due to low 
blood supply to the heart) compared bypass surgery, 
angioplasty, and use of drugs. ‘The study looked at 
the medical records of thousands of angina patients 
whose doctors had chosen one of these treatments. It 
found that the average survival time of patients given 
drugs was the highest. What do you conclude? 

(a) This study proves that drugs prolong life and should 
be the treatment of choice. 


(b) We can conclude that drugs prolong life because the 
study was a comparative experiment. 

(c) We can’t conclude that drugs prolong life because 
the patients were volunteers. 

(d) We can’t conclude that drugs prolong life because 
this was an observational study. 


(e) We can’t conclude that drugs prolong life because no 
placebo was used. 

14.4 ‘Tonya wanted to estimate the average amount of 
time that students at her school spend on Facebook 
each day. She gets an alphabetical roster of students 
in the school from the registrar’s office and numbers 
the students from | to 1137. Then Tonya uses a ran- 
dom number generator to pick 30 distinct labels from 
1 to 1137. She surveys those 30 students about their 
Facebook use. ‘Tonya’s sample is a simple random 
sample because 

(a) it was selected using a chance process. 


(b) it gave every individual the same chance to be 

selected. 

(c) it gave every possible sample of the same size an 

equal chance to be selected. 

(d) it doesn’t involve strata or clusters. 

(e) it is guaranteed to be representative of the population. 
14.5 Consider an experiment to investigate the effective- 
ness of different insecticides in controlling pests and 
their impact on the productivity of tomato plants. 
What is the best reason for randomly assigning treat- 
ment levels (spraying or not spraying) to the experi- 
mental units (farms)? 

Random assignment allows researchers to generalize 
conclusions about the effectiveness of the insecticides 
to all farms. 

Random assignment will tend to average out all other 
uncontrolled factors such as soil fertility so that they 
are not confounded with the treatment effects. 


(c) Random assignment eliminates the effects of other 
variables, like soil fertility. 
(d) Random assignment eliminates chance variation in 
the responses. 
(e) Random assignment helps avoid bias due to the 
placebo effect. 
14.6 The most important advantage of experiments over 
observational studies is that 
(a) experiments are usually easier to carry out. 
(b) experiments can give better evidence of causation. 
(c) confounding cannot happen in experiments. 
(d) an observational study cannot have a response variable. 
(e) observational studies cannot use random samples. 
T4.7 A‘TYV station wishes to obtain information on the T'V 
viewing habits in its market area. The market area 
contains one city of population 170,000, another city of 
70,000, and four towns of about 5000 inhabitants each. 
‘The station suspects that the viewing habits may be dif 
ferent in larger and smaller cities and in the rural areas. 
Which of the following sampling designs would give 
the type of information that the station requires? 


market area 

(e) An online poll that invites all people from the cities 

and towns in the market area to participate 
14.8 Bias in a sampling method is 

(a) any difference between the sample result and the 
truth about the population. 

(b) the difference between the sample result and the 
truth about the population due to using chance to 
select a sample. 

(c) any difference between the sample result and the 
truth about the population due to practical difficul- 
ties such as contacting the subjects selected. 


(d 


—J 


any difference between the sample result and the 
truth about the population that tends to occur in 
the same direction whenever you use this sampling 
method. 

racism or sexism on the part of those who take the 
sample. 


T4.9 You wonder if 'l'V ads are more effective when they 
are longer or repeated more often or both. So you 
design an experiment. You prepare 30-second and 
60-second ads for a camera. Your subjects all watch 
the same T'V program, but you assign them at ran- 
dom to four groups. One group sees the 30-second ad 
once during the program; another sees it three times; 
the third group sees the 60-second ad once; and the 
last group sees the 60-second ad three times. You ask 
all subjects how likely they are to buy the camera. 


(e 


— 


(a) This is a randomized block design, but not a 


matched pairs design. 


(b) This is a matched pairs design. 


(c) This isa completely randomized design with one 


(d) 


(e 


explanatory variable (factor). 

This is a completely randomized design with two 
explanatory variables (factors). 

This is a completely randomized design with four 
explanatory variables (factors). 


T4.10 A researcher wishes to compare the effects of two fer- 


4 


tilizers on the yield of soybeans. She has 20 plots of 
land available for the experiment, and she decides 
to use a matched pairs design with 10 pairs of plots. 
To carry out the random assignment for this design, 
the researcher should 


use a table of random numbers to divide the 20 
plots into 10 pairs and then, for each pair, flip a 
coin to assign the fertilizers to the 2 plots. 
subjectively divide the 20 plots into 10 pairs (mak- 
ing the plots within a pair as similar as possible) and 
then, for each pair, flip a coin to assign the fertiliz- 
ers to the 2 plots. 


use a table of random numbers to divide the 20 
plots into 10 pairs and then use the table of 
random numbers a second time to decide on the 
fertilizer to be applied to each member of the 
pair. 

flip a coin to divide the 20 plots into 10 pairs and 
then, for each pair, use a table of random numbers 
to assign the fertilizers to the 2 plots. 
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(e) use a table of random numbers to assign the two 


fertilizers to the 20 plots and then use the table of 
random numbers a second time to place the plots 
into 10 pairs. 


T4.11 You want to know the opinions of American high 


(d 


7 


See 


=e 


school teachers on the issue of establishing a national 
proficiency test as a prerequisite for graduation from 
high school. You obtain a list of all high school teach- 
ers belonging to the National Education Association 
(the country’s largest teachers’ union) and mail a 
survey to a random sample of 2500 teachers. In all, 
1347 of the teachers return the survey. Of those 

who responded, 32% say that they favor some kind 
of national proficiency test. Which of the following 
statements about this situation is true? 

Because random sampling was used, we can feel 
confident that the percent of all American high 
school teachers who would say they favor a national 
proficiency test is close to 32%. 

We cannot trust these results, because the survey 
was mailed. Only survey results from face-to-face 
interviews are considered valid. 

Because over half of those who were mailed the 
survey actually responded, we can feel fairly confi- 
dent that the actual percent of all American high 
school teachers who would say they favor a national 
proficiency test is close to 32%. 

The results of this survey may be affected by nonre- 
sponse bias. 


(e) ‘The results of this survey cannot be trusted due to 


voluntary response bias. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


14.12 Elephants sometimes damage trees in Africa. It 


— 


turns out that elephants dislike bees. They recognize 
beehives in areas where they are common and avoid 
them. Can this be used to keep elephants away from 
trees? Will elephant damage be less in trees with 
hives? Will even empty hives keep elephants away? 
Researchers want to design an experiment to answer 
these questions using 72 acacia trees. 

Identify the experimental units, treatments, and the 
response variable. 


Describe how the researchers could carry out a 
completely randomized design for this experiment. 
Include a description of how the treatments should 
be assigned. 


14.13 A New York Times article on public opinion about 


steroid use in baseball discussed the results of a 
sample survey. ‘The survey found that 34% of adults 
think that at least half of Major League Baseball 
(MLB) players “use steroids to enhance their 
athletic performance.” Another 36% thought that 


about a quarter of MLB players use steroids; 8% had 
no opinion. Here is part of the Times’s statement on 
“How the Poll Was Conducted”: 


The latest New York Times/CBS News Poll is based 
on telephone interviews conducted March 15 
through March 18 with 1,067 adults throughout the 
United States.... The sample of telephone num- 
bers called was randomly selected by a computer 
from a list of more than 42,000 active residential 
exchanges across the country. The exchanges were 
chosen to ensure that each region of the country 
was represented in proportion to its population. In 
each exchange, random digits were added to form 
a complete telephone number, thus permitting ac- 
cess to listed and unlisted numbers. In each house- 
hold, one adult was designated by a random proce- 
dure to be the respondent for the survey.” 


(a) Explain why the sampling method used in this 


survey was not a simple random sample. 


282 CHAPTER 4 


(b) Why was one adult chosen at random in each 
household to respond to the survey? 

(c) Explain how undercoverage could lead to bias in 
this sample survey. 


14.14 Many people start their day with a jolt of caffeine 


from coffee or a soft drink. Most experts agree that 
people who take in large amounts of caffeine each 
day may suffer from physical withdrawal symptoms 

if they stop ingesting their usual amounts of caffeine. 
Researchers recruited 11 volunteers who were caf 
feine dependent and who were willing to take part in 
a caffeine withdrawal experiment. The experiment 
was conducted on two 2-day periods that occurred 
one week apart. During one of the 2-day periods, each 
subject was given a capsule containing the amount 
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of caffeine normally ingested by that subject in one 
day. During the other study period, the subjects were 
given placebos. The order in which each subject 
received the two types of capsules was randomized. 
The subjects’ diets were restricted during each of the 
study periods. At the end of each 2-day study period, 
subjects were evaluated using a tapping task in which 
they were instructed to press a button 200 times as fast 
as they could. 


(a) How and why was blocking used in the design of 
this experiment? 

(b) Why did researchers randomize the order in which 
subjects received the two treatments? 

(c) Could this experiment have been carried out in a 
double-blind manner? Explain. 
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AP 1.1 You look at real estate ads for houses in Sarasota, 


Florida. Many houses range from $200,000 to 
$400,000 in price. The few houses on the water, 
however, have prices up to $15 million. Which of 
the following statements best describes the distribu- 
tion of home prices in Sarasota? 

(a) The distribution is most likely skewed to the left, 
and the mean is greater than the median. 

(b) The distribution is most likely skewed to the left, 
and the mean is less than the median. 

(c) The distribution is roughly symmetric with a few 
high outliers, and the mean is approximately equal 
to the median. 

(d) The distribution is most likely skewed to the right, 
and the mean is greater than the median. 

(e) The distribution is most likely skewed to the right, 
and the mean is less than the median. 


AP 1.2 A child is 40 inches tall, which places her at the 90th 


percentile of all children of similar age. The heights 
for children of this age form an approximately 
Normal distribution with a mean of 38 inches. Based 
on this information, what is the standard deviation of 


the heights of all children of this age? 
(a) 0.20 inches (c) 0.65 inches (e) 1.56 inches 
(b) 0.31 inches (d) 1.21 inches 


AP1.3 A large set of test scores has mean 60 and standard 


deviation 18. If each score is doubled, and then 5 is 
subtracted from the result, the mean and standard 
deviation of the new scores are 


(a) mean 115; std. dev. 31. (d) mean 120; std. dev. 31. 
(b) mean 115; std. dev. 36. (e) mean 120; std. dev. 36. 
(c) mean 120; std. dev. 6. 


Section I: Multiple Choice Choose the best answer for Questions API.1 to AP1.14. 


AP 1.4 For a certain experiment, the available experimen- 


tal units are eight rats, of which four are female 
(F1, F2, F3, F4) and four are male (M1, M2, M3, 
M4). There are to be four treatment groups, A, B, 
C, and D. Ifa randomized block design is used, 
with the experimental units blocked by gender, 
which of the following assignments of treatments 
is impossible? 

(a) A> (FI, M1), B > (F2, M2), 
C > (F3, M3), D— (F4, M4) 


(b) A> (F1, M2), B > (F2, M3), 
C > (F3, M4), D> (F4, M1) 
(c) A (F1, M2), B= (F3, F2), 
C > (F4, M1), D> (M3, M4) 
(d) A> (F4, M1), B > (F2, M3), 
C > (F3, M2), D> (Fl, M4) 
(ec) A (F4, M1), B> (Fl, M4), 
C > (F3, M2), D> (F2, M3) 


AP1.5 Fora biology project, you measure the weight in 


grams (g) and the tail length in millimeters (mm) of 
a group of mice. The equation of the least-squares 
line for predicting tail length from weight is 

predicted tail length = 20 + 3 X weight 
Which of the following is not correct? 

(a) The slope is 3, which indicates that a mouse’s 
weight should increase by about 3 grams for each 
additional millimeter of tail length. 

(b) The predicted tail length of a mouse that weighs 
38 grams is 134 millimeters. 

(c) By looking at the equation of the least-squares line, 
you can see that the correlation between weight 
and tail length is positive. 
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(d) Ifyou had measured the tail length in centimeters (a) The student with an IQ of 96 is considered an 
instead of millimeters, the slope of the regression outlier by the 1.5 x JOR rule. 
line would have been 3/10 = 0.3. (b) The five-number summary of the 10 IQ scores is 
(e) One mouse weighed 29 grams and had a tail length of 96, 118, 123.5, 130, 145. 
100 millimeters. The residual for this mouse is —7. (c) If the value 96 were removed from the data set, 
AP 1.6 The figure below shows a Normal density curve. the mean of the remaining 9 IQ scores would be 
Which of the following gives the best estimates for greater than the mean of all 10 IQ scores. 
the mean and standard deviation of this Normal (d) If the value 96 were removed from the data set, 
distribution? the standard deviation of the remaining 9 IQ 
0.010 = scores would be less than the standard deviation of 


all 10 IQ scores. 


(e) Ifthe value 96 were removed from the data set, 
the JOR of the remaining 9 IQ scores would be 
less than the JOR of all 10 IQ scores. 


AP1.9 Before he goes to bed each night, Mr. Kleen pours 
dishwasher powder into his dishwasher and turns 
it on. Each morning, Mrs. Kleen weighs the box 
of dishwasher powder. From an examination of 
the data, she concludes that Mr. Kleen dispenses 
a rather consistent amount of powder each night. 
Which of the following statements is true? 


50 100 150 200 250 300 350 =©400 


I. There is a high positive correlation between the 


(a) pw = 200, o = 50 (d) w = 225,0 =25 number of days that have passed since the box of 
(b) 4 = 200, 0 = 25 (ej = 2256 = 215 dishwasher powder was opened and the amount 
c) p =225,0 = 50 of powder left in the box. 


IL. A scatterplot with days since purchase as the 
explanatory variable and amount of dishwasher 
powder used as the response variable would display 
a strong positive association. 


( 
AP1.7 ‘The owner of a chain of supermarkets notices that 
there is a positive correlation between the sales of 
beer and the sales of ice cream over the course of 
the previous year. During seasons when sales of beer 


were above average, sales of ice cream also tended II. The correlation between the amount of powder 
to be above average. Likewise, during seasons when left in the box and the amount of powder used 
sales of beer were below average, sales of ice cream should be —1. 
also tended to be below average. Which of the fol- (a) lonly (d) Il and III only 
lowing would be a valid conclusion from these facts? (b) Il only (e) 1, I, and If 

(a) Sales records must be in error. There should be no (c) II only 


association between beer and ice cream sales. 


AP1.10 The General Social Survey (GSS), conducted 
by the National Opinion Research Center at the 
University of Chicago, is a major source of data on 
social attitudes in the United States. Once each 
year, 1500 adults are interviewed in their homes 
all across the country. The subjects are asked their 
opinions about sex and marriage, attitudes toward 
women, welfare, foreign policy, and many other 
issues. ‘The GSS begins by selecting a sample of 
counties from the 3000 counties in the country. 


(b) Evidently, for a significant proportion of customers of 
these supermarkets, drinking beer causes a desire for 
ice cream or eating ice cream causes a thirst for beer. 

(c) Ascatterplot of monthly ice cream sales versus 
monthly beer sales would show that a straight line 
describes the pattern in the plot, but it would have 
to be a horizontal line. 


(d) There is a clear negative association between beer 
sales and ice cream sales. 


(e) ‘The positive correlation is most likely a result of The counties are divided into urban, rural, and 
the variable temperature; that is, as temperatures suburban; a separate sample is chosen at random 
increase, so do both beer sales and ice cream sales. from each group. This is a 

AP1.8 Here are the IQ scores of 10 randomly chosen fifth- (a) simple random sample. 
grade students: (b) systematic random sample. 
145 139 126 122 125 130 96 110 118 118 (c) cluster sample. 


Which of the following statements about this data (d) stratified random sample. 
set is not true? (e) voluntary response sample. 
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AP1.11 You are planning an experiment to determine 


the effect of the brand of gasoline and the weight 
of a car on gas mileage measured in miles per 
gallon. You will use a single test car, adding 
weights so that its total weight is 3000, 3500, or 
4000 pounds. The car will drive on a test track 
at each weight using each of Amoco, Marathon, 
and Speedway gasoline. Which is the best way to 
organize the study? 


Start with 3000 pounds and Amoco and run the 
car on the test track. Then do 3500 and 4000 
pounds. Change to Marathon and go through the 
three weights in order. Then change to Speedway 
and do the three weights in order once more. 
Start with 3000 pounds and Amoco and run the 
car on the test track. Then change to Marathon 
and then to Speedway without changing the 
weight. Then add weights to get 3500 pounds and 
go through the three gasolines in the same order. 
Then change to 4000 pounds and do the three 
gasolines in order again. 

Choose a gasoline at random, and run the car with 
this gasoline at 3000, 3500, and 4000 pounds in 
order. Choose one of the two remaining gasolines 
at random and again run the car at 3000, then 
3500, then 4000 pounds. Do the same with the 
last gasoline. 

There are nine combinations of weight and gasoline. 
Run the car several times using each of these combi- 
nations. Make all these runs in random order. 
Randomly select an amount of weight and a brand 
of gasoline, and run the car on the test track. 
Repeat this process a total of 30 times. 


AP1.12 A linear regression was performed using the five 


(a) 


following data points: A(2, 22), B(10, 4), C(6, 14), 
D(14, 2), E(18, —4). The residual for which of the 
five points has the largest absolute value? 


A ()B ()C @D (JE 
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AP 1.13 


(a) 
(b) 
(c) 
(d) 
(e) 


The frequency table below summarizes the times 
in the last month that patients at the emergency 
room of a small-city hospital waited to receive 
medical attention. 


Waiting time Frequency 
Less than 10 minutes 5 
At least 10 but less than 20 minutes 24 
At least 20 but less than 30 minutes 45 
At least 30 but less than 40 minutes 38 
At least 40 but less than 50 minutes 19 
At least 50 but less than 60 minutes ul 
At least 60 but less than 70 minutes 2 


Which of the following represents possible values 
for the median and mean waiting times for the 
emergency room last month? 


median = 27 minutes and mean = 24 minutes 
median = 28 minutes and mean = 30 minutes 
median = 31 minutes and mean = 35 minutes 
median = 35 minutes and mean = 39 minutes 


median = 45 minutes and mean = 46 minutes 


Boxplots of two data sets are shown. 


'— ia Plot 1 
a 


Plot 2 


Based on the boxplots, which statement below is 
true? 


a) The range of both plots is about the same. 
The means of both plots are approximately equal. 


Plot 2 contains more data points than Plot 1. 


The medians are approximately equal. 


Plot | is more symmetric than Plot 2. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded 
on the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


AP1.15 The manufacturer of exercise machines for fitness 


centers has designed two new elliptical machines 
that are meant to increase cardiovascular fit- 

ness. ‘The two machines are being tested on 30 
volunteers at a fitness center near the company’s 
headquarters. The volunteers are randomly as- 
signed to one of the machines and use it daily for 
two months. A measure of cardiovascular fitness 
is administered at the start of the experiment and 
again at the end. The following table contains the 
differences in the two scores (After — Before) for 


the two machines. Note that higher scores indicate 
larger gains in fitness. 


Machine A Machine B 
0 2 
54 1 0 
876320 2 159 
97411 3 2489 
61 4 2517 
5 359 


(a) Write a few sentences comparing the distributions of 
cardiovascular fitness gains from the two elliptical 
machines. 


(b) Which machine should be chosen if the com- 
pany wants to advertise it as achieving the highest 
overall gain in cardiovascular fitness? Explain your 
reasoning. 


(c) Which machine should be chosen if the company 
wants to advertise it as achieving the most consis- 
tent gain in cardiovascular fitness? Explain your 
reasoning. 


(d) Give one reason why the advertising claims of the 
company (the scope of inference) for this experi- 
ment would be limited. Explain how the company 
could broaden that scope of inference. 


AP 1.16 Those who advocate for monetary incentives in a 
work environment claim that this type of incen- 
tive has the greatest appeal because it allows the 
winners to do what they want with their winnings. 
Those in favor of tangible incentives argue that 
money lacks the emotional appeal of, say, a week- 
end for two at a romantic country inn or elegant 
hotel or a weeklong trip to Europe. 


A few years ago a national tire company, in an ef- 
fort to improve sales of a new line of tires, decided 
to test which method — offering cash incentives or 
offering non-cash prizes such as vacations —was 
more successful in increasing sales. ‘The company 
had 60 retail sales districts of various sizes across 
the country and data on the previous sales volume 
for each district. 


(a) Describe a completely randomized design using 
the 60 retail sales districts that would help answer 
this question. 


(b) Explain how you would use the table of random 
digits below to do the randomization that your de- 
sign requires. Then use your method to make the 
first three assignments. Show your work clearly. 


07511 88915 41267 16853 84569 79367 32337 03316 
81486 69487 60513 09297 00412 71238 27649 39950 


(c) One of the company’s officers suggested that it 
would be better to use a matched pairs design in- 
stead of a completely randomized design. Explain 
how you would change your design to accomplish 
this. 

AP 1.17 In retail stores, there is a lot of competition for 
shelf space. There are national brands for most 
products, and many stores carry their own line 
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of in-house brands, too. Since shelf space is not 
infinite, the question is how many linear feet to 
allocate to each product and which shelf (top, 
bottom, or somewhere in the middle) to put it on. 
The middle shelf is the most popular and lucra- 
tive, because many shoppers, if undecided, will 
simply pick the product that is at eye level. 


A local store that sells many upscale goods is trying 
to determine how much shelf space to allocate 

to its own brand of men’s personal-grooming 
products. The middle shelf space is randomly 
varied between 3 and 6 linear feet over the next 12 
weeks, and weekly sales revenue (in dollars) from 
the store’s brand of personal-grooming products 
for men is recorded. Below is some computer 
output from the study, along with a scatterplot. 


1300 
1200 
;” 
3 1000 
900 
800 
3.0 35 4.0 45 5.0 55 6.0 
Shelf Length (feet) 
Predictor Coef SE Coef T P 
Constant 317.94 a3? 16.15 0,000 
Shelf length 152.680 6.445 23.69 0',,000 


S = 22.9212 R-Sq = 98.2% R-Sq(adj) = 98.1% 


(a) Describe the relationship between shelf length 
and sales. 

(b) Write the equation of the least-squares regression 
line. Be sure to define any variables you use. 

(c) Ifthe store manager were to decide to allocate 5 
linear feet of shelf space to the store’s brand of 
men’s grooming products, what is the best esti- 
mate of the weekly sales revenue? 

(d) Interpret the value of s. 

(e) Identify and interpret the coefficient of 
determination. 

(f) The store manager questions the intercept of the 
regression line: “Am I supposed to believe that this 
analysis tells me that I can sell these products with 
no shelf space?” How do you answer her? 
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Probability: 
What Are the Chances? 


Calculated Risks 


Many high schools now have drug-testing programs for athletes. The main goal of these programs is to 
reduce the use of banned substances by students who play sports. It is not practical to test every athlete 
for drug use regularly. Instead, school administrators give drug tests to randomly selected student athletes 
at unannounced times during the school year. Students who test positive face serious consequences, 
including letters to their parents, required counseling, and suspension from athletic participation. 

Drug tests aren’t perfect. Sometimes the tests say that athletes took a banned substance when they did 
not. This is known as a false positive. Other times, drug tests say that athletes are “clean” when they did 
take a banned substance. This is called a false negative. 

Suppose that 16% of the high school athletes in a large school district have taken a banned substance. 
The drug test used by this district has a false positive rate of 5% and a false negative rate of 10%. 
If a randomly chosen athlete tests positive, what’s the chance that the student actually took a banned 
substance? 
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fe Introduction 


Chance is all around us. You and your friend play rock-paper-scissors to determine 
who gets the last slice of pizza. A coin toss decides which team gets to receive the 
ball first in a football game. Many adults regularly play the lottery, hoping to win 
a big jackpot with a few lucky numbers. Others head to casinos or racetracks, hop- 
ing that some combination of luck and skill will pay off. People young and old 
play games of chance involving cards or dice or spinners. The traits that children 
inherit— gender, hair and eye color, blood type, handedness, dimples, whether 
they can roll their tongues—are determined by the chance involved in which 
genes get passed along by their parents. 

A roll of a die, a simple random sample, and even genetic 
inheritance represent chance behavior that we can understand 
and work with. We can roll the die again and again and again. 
The outcomes are governed by chance, but in many repetitions a 
pattern emerges. We use mathematics to understand the regular 
patterns of chance behavior when we repeat the same chance 
process again and again. 

The mathematics of chance is called probability. Probability is 
the topic of this chapter. Here is an Activity that gives you some 
idea of what lies ahead. 


ACTIVITY | The “1 in 6 wins” game 


MATERIALS: As a special promotion for its 20-ounce bottles of soda, a soft drink company 
One six-sided die for each printed a message on the inside of each bottle cap. Some of the caps said, “Please 
student try again!” while others said, “You’re a winner!” The company advertised the pro- 


motion with the slogan “1 in 6 wins a prize.” The prize is a free 20-ounce bottle 
of soda, which comes out of the store owner’s profits. 

Seven friends each buy one 20-ounce bottle at a local convenience store. The 
store clerk is surprised when three of them win a prize. The store owner is con- 
cerned about losing money from giving away too many free sodas. She wonders 
if this group of friends is just lucky or if the company’s 1-in-6 claim is inaccurate. 
In this Activity, you and your classmates will perform a simulation to help answer 
this question. 

For now, let’s assume that the company is telling the truth, and that every 
20-ounce bottle of soda it fills has a 1-in-6 chance of getting a cap that says, “You’re 
a winner!” We can model the status of an individual bottle with a six-sided die: let 
1 through 5 represent “Please try again!” and 6 represent “You're a winner!” 


1. Roll your die seven times to imitate the process of the seven friends buying 
their sodas. How many of them won a prize? 


2. Your teacher will draw and label axes for a class dotplot. Plot the number of 
prize winners you got in Step | on the graph. 
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3. Repeat Steps 1 and 2 if needed to get a total of at least 40 repetitions of the 
simulation for your class. 


4. Discuss the results with your classmates. What percent of the time did the friends 
come away with three or more prizes, just by chance? Does it seem plausible that the 
company is telling the truth, but that the seven friends just got lucky? Explain. 


As the Activity shows, simulation is a powerful method for modeling chance 
behavior. Section 5.1 begins by examining the idea of probability and then 
illustrates how simulation can be used to estimate probabilities. In Sections 5.2 
and 5.3, we develop the basic rules of probability. Along the way, we introduce 
some helpful tools for displaying possible outcomes from a chance process: 
two-way tables, Venn diagrams, and tree diagrams. 

Probability calculations are the basis for inference. When we produce data by 
random sampling or randomized comparative experiments, the laws of probabil- 
ity answer the question “What would happen if we repeated the random sampling 
or random assignment process many times?” Many of the examples, exercises, 
and activities in this chapter focus on the connection between probability and 
inference. 


PB Randomness, Probability, 


and Simulation 


WHAT YOU WILL LEARN __By the end of the section, you should be able to: 


e Interpret probability as a long-run relative frequency. e Use simulation to model chance behavior. 


Toss a coin 10 times. How likely are you to get a run of 3 or more consecutive 
heads or tails? An airline knows that a certain percent of customers who pur- 
chased tickets will not show up for a flight. If the airline overbooks a particular 
flight, what are the chances that they'll have enough seats for 
the passengers who show up? A couple plans to have chil- 
dren until they have at least one child of each gender. How 
many children should they expect to have? To answer these 
questions, you need a better understanding of how chance 
behavior operates. 


The Idea of Probability 


In football, a coin toss helps determine which team gets the ball 
first. Why do the rules of football require a coin toss? Because 
tossing a coin seems a “fair” way to decide. That’s one reason 
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why statisticians recommend random samples and randomized experiments. 
They avoid bias by letting chance decide who gets selected or who receives which 
treatment. 

A big fact emerges when we watch coin tosses or the results of random sampling 
and random assignment closely: chance behavior is unpredictable in the short run 
but has a regular and predictable pattern in the long run. This remarkable fact is 
the basis for the idea of probability. 


ACTIVITY | Probability applet 


MATERIALS: If you toss a fair coin, what’s the chance that it shows heads? It’s 1/2, or 0.5, right? 
Computer with Internet In this Activity, you'll use the Probability applet at www.whfreeman.com/tps5e to 
connection investigate what probability really means. 


1. If you toss a fair coin 10 times, how many heads will you get? Before you 
answer, launch the Probability applet. Set the number of tosses at 10 and click 
“Toss.” What proportion of the tosses were heads? Click “Reset” and toss the 
coin 10 more times. What proportion of heads did you get this time? Repeat this 
process several more times. What do you notice? 


2. What if you toss the coin 100 times? Reset the applet 
QSS0Q66686 and have it do 100 tosses. Is the proportion of heads exactly 
Heads = 410 = 0.400 equal to 0.5? Close to 0.5? 


3. Keep on tossing without hitting “Reset.” What happens 
to the proportion of heads? 


4. Asa class, discuss what the following statement means: 
“If you toss a fair coin, the probability of heads is 0.5.” 


5. Predict what will happen if you change the probability 
of heads to 0.3 (an unfair coin). Then use the applet to test 
your prediction. 


6. If you toss a coin, it can land heads or tails. If you “toss” 

a thumbtack, it can land with the point sticking up or with 
the point down. Does that mean that the probability of a tossed thumbtack land- 
ing point up is 0.5? How could you find out? Discuss with your classmates. 


We might suspect that a coin has probability 0.5 of coming up heads just 
because the coin has two sides. But we can’t be sure. In fact, spinning a penny 
on a flat surface, rather than tossing the coin, gives heads a probability of about 
0.45 rather than 0.5.! What about thumbtacks? They also have two ways to land— 
point up or point down —but the chance that a tossed thumbtack lands point up 
isn’t 0.5. How do we know that? From tossing a thumbtack over and over and 
over again. Probability describes what happens in very many trials, and we must 
actually observe many tosses of a coin or thumbtack to pin down a probability. 


Section 5.1 Randomness, Probability, and Simulation 291 


Tossing Coins 
Short-run and long-run behavior 


When you toss a coin, there are only two possible outcomes, heads or tails. Figure 
5.1(a) shows the results of tossing a coin 20 times. For each number of tosses from 
1 to 20, we have plotted the proportion of those tosses that gave a head. You can 
see that the proportion of heads starts at 1 on the first toss, falls to 0.5 when the 
second toss gives a tail, then rises to 0.67, and then falls to 0.5, and 0.4 as we get 
two more tails. After that, the proportion of heads continues to fluctuate but never 
exceeds 0.5 again. 


Proportion of heads 


(a) 


Proportion of heads 


Tosses Tosses 


(b) 


FIGURE 5.1 (a) The proportion of heads in the first 20 tosses of a coin. (b) The proportion of heads in the first 
500 tosses of a coin. 


Suppose we keep tossing the coin until we have made 500 tosses. Figure 5.1(b) 
shows the results. The proportion of tosses that produce heads is quite variable 
at first. As we make more and more tosses, however, the proportion of heads gets 
close to 0.5 and stays there. 


The fact that the proportion of heads in many tosses eventually closes in on 0.5 
is guaranteed by the law of large numbers. This important result says that if we 
observe more and more repetitions of any chance process, the proportion of times 
that a specific outcome occurs approaches a single value. We call this value the 
probability. The previous example confirms that the probability of getting a head 
when we toss a fair coin is 0.5. Probability 0.5 means “occurs half the time in a 
very large number of trials.” 


DEFINITION: Probability 


The probability of any outcome of a chance process is a number between 0 and 1 
that describes the proportion of times the outcome would occur in a very long series 
of repetitions. 
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Outcomes that never occur have probability 0. An outcome that happens on every 
repetition has probability 1. An outcome that happens half the time in a very long series 
of trials has probability 0.5. Of course, we can never observe a probability exactly. 
We could always continue tossing the coin, for example. The mathematics of prob- 
ability is based on imagining what would happen in an indefinitely long series of trials. 

Recall from Chapter 4 that “random” Probability gives us a language to describe the long-term regularity of random 
doesn’t mean haphazard. In statistics, — behavior. The outcome of a coin toss and the gender of the next baby born in a 
(even nine A enenere local hospital are both random. So is the result of a random sample or a random 
assignment. Even life insurance, for example, is based on the fact that deaths 
occur at random among many individuals. 


Life Insurance 
Probability and risk 


How do insurance companies decide how much to charge for life insurance? We 
can’t predict whether a particular person will die in the next year. But the National 
Center for Health Statistics says that the proportion of men aged 20 to 24 years 
who die in any one year is 0.0015. This is the probability that a randomly selected 
young man will die next year. For women that age, the probability of death is about 
0.0005. If an insurance company sells many policies to people aged 20 to 24, it 
knows that it will have to pay off next year on about 0.15% of the policies sold to 
men and on about 0.05% of the policies sold to women. Therefore, the company 
will charge about three times more to insure a man because the probability of 
having to pay is three times higher. 


We often encounter the unpredictable side of randomness in our everyday lives, 
but we rarely see enough repetitions of the same chance process to observe the 
long-run regularity that probability describes. Life insurance companies, casinos, 
and others who make important decisions based on probability rely on the long- 
run predictability of random behavior. 


ig/ CHECK YOUR UNDERSTANDING 


1. According to the Book of Odds Web site www.bookofodds.com, the probability that 
a randomly selected U.S. adult usually eats breakfast is 0.61. 


(a) Explain what probability 0.61 means in this setting. 


(b) Why doesn’t this probability say that if 100 U.S. adults are chosen at random, exactly 
61 of them usually eat breakfast? 


2. Probability isa measure of how likely an outcome is to occur. Match one of the 
probabilities that follow with each statement. Be prepared to defend your answer. 


0 0.01 0.3 0.6 0.99 l 


a) ‘This outcome is impossible. It can never occur. 


b) This outcome is certain. It will occur on every trial. 


( 
( 
(c) This outcome is very unlikely, but it will occur once in a while in a long sequence of 
trials. 

( 


d) This outcome will occur more often than not. 


ACTIVITY 
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Myths about Randomness 


The idea of probability seems straightforward. It answers the question “What 
would happen if we did this many times?” In fact, both the behavior of random 
phenomena and the idea of probability are a bit subtle. 


Investigating randomness 


1. Pretend that you are flipping a fair coin. Without actually flipping a coin, imagine 
the first toss. Write down the result you see in your mind, heads (H) or tails (‘T). 


2. Imagine a second coin flip. Write down the result. 


3. Keep doing this until you have recorded the results of 50 imaginary flips. Write your 
results in groups of 5 to make them easier to read, like this: HTHTH TTHHT, etc. 


4. Arun is a repetition of the same result. In the example in Step 3, there is a 
run of two tails followed by a run of two heads in the first 10 coin flips. Read 
through your 50 imagined coin flips and find the longest run. 


5. Your teacher will draw and label a number line for a class dotplot. Plot the 
length of the longest run you got in Step 4 on the graph. 


6. Use Table D, technology, or a coin to generate a similar list of 50 coin flips. 
Find the longest run that you have. 


7. Your teacher will draw and label a number line with the same scale immedi- 
ately above or below the one in Step 5. Plot the length of the longest run you got 
in Step 6 on the new dotplot. 


8. Compare the distributions of longest run from imagined tosses and random 
tosses. What do you notice? 


The idea of probability is that randomness is predictable in the long run. 
Unfortunately, our intuition about randomness tries to tell us that random phe- 
nomena should also be predictable in the short run. When they aren’t, we look for 
some explanation other than chance variation. 


Runs in Coin Tossing 
What looks random? 


Toss a coin six times and record heads (H) or tails (T) on each toss. Which of the 
following outcomes is more probable? 


HTHTTH TTTHHH 


Almost everyone says that HTHTTH is more probable, because T’TTHHH does 
not “look random.” In fact, both are equally likely. That heads and tails are equally 
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probable says only that about half of a very long sequence of tosses will be heads. 
It doesn’t say that heads and tails must come close to alternating in the short run. 
The coin has no memory. It doesn’t know what past outcomes were, and it can’t 
try to create a balanced sequence. 


The outcome TTTHHH in tossing six coins looks unusual because of the runs 
of 3 straight tails and 3 straight heads. Runs seem “not random” to our intuition 
but are quite common. Here’s a more striking example than tossing coins. 

q 


That Shooter Seems “Hot” 


Chance variation or skill? 


Is there such a thing as a “hot hand” in basketball? Belief that runs must 
result from something other than “just chance” influences behavior. If a 
basketball player makes several consecutive shots, both the fans and her 
teammates believe that she has a “hot hand” and is more likely to make the 
next shot. Several studies have shown that runs of baskets made or missed 
are no more frequent in basketball than would be expected if the result 
of each shot is unrelated to the outcomes of the player’s previous shots. If 
a player makes half her shots in the long run, her made shots and misses 
behave just like tosses of a coin—and that means that runs of makes and 
misses are more common than our intuition expects.” 


Free throws may be a different story. A recent study suggests that players 
who shoot two free throws are slightly more likely to make the second shot 
if they make the first one.’ 


Once, at a convention in Las Vegas, one of the authors roamed the gambling 
floors, watching money disappear into the drop boxes under the tables. You can 
see some interesting human behavior in a casino. When the shooter in the dice 
game called craps rolls several winners in a row, some gamblers think she has 

Don’t confuse the law of large a “hot hand” and bet that she will keep on winning. Others say that “the law 

numbers, which describes the big of averages” means that she must now lose so that wins and losses will balance 

GeO Pron TELE AW out. Believers in the law of averages think that if you toss a coin six times and get 

ae Seve TTTTTT, the next toss must be more likely to give a head. It’s true that in the long 
run heads will appear half the time. What is a myth is that future outcomes must 
make up for an imbalance like six straight tails. 

Coins and dice have no memories. A coin doesn’t know that the first six out- 
comes were tails, and it can’t try to get a head on the next toss to even things out. 
Of course, things do even out in the long run. That’s the law of large numbers in 
action. After 10,000 tosses, the results of the first six tosses don’t matter. They are 
overwhelmed by the results of the next 9994 tosses. 


STEP 
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Aren’t We Due for a Boy? 


Don’t fall for the “law of averages” 


Belief in this phony “law of averages” can lead to 
serious consequences. A few years ago, an advice 
columnist published a letter from a distraught 
mother of eight girls. She and her husband had 
planned to limit their family to four children, but 
they wanted to have at least one boy. When the first 
four children were all girls, they tried again—and 
again and again. After seven straight girls, even her 
doctor had assured her that “the law of averages 
was in our favor 100 to 1.” Unfortunately for this 
couple, having children is like tossing coins. Eight 


“So the law of averages doesn't guarantee me a girl after seven straight boys, but | girls in a row is highly unlikely, but once seven girls 
Gal nlebtdewabion ally alecoune on Menelvay tee have been born, it is not at all unlikely that the next 


child will be a girl—and it was. 


Simulation 


The imitation of chance behavior, based on a model that accurately reflects 
the situation, is called a simulation. You already have some experience with 
simulations. In Chapter 1’s “Hiring Discrimination—It Just Won't Fly!” Activity 
(page 5), you drew beads or slips of paper to imitate a random lottery to choose 
which pilots would become captains. Chapter 4’s “Distracted Driving” Activity 
(page 249) asked you to shuffle and deal piles of cards to mimic the random 
assignment of subjects to treatments. The “1 in 6 wins” game that opened this 
chapter had you roll a die several times to simulate buying 20-ounce sodas and 
looking under the cap. These simulations involved different chance “devices” — 
beads, slips of paper, cards, or dice. But the same basic strategy was followed in 
all three simulations. We can summarize this strategy using our familiar four-step 
process: State, Plan, Do, Conclude. 


PERFORMING A SIMULATION 


State: Ask a question of interest about some chance process. 


Plan: Describe how to use a chance device to imitate one repetition of the 
process. Tell what you will record at the end of each repetition. 


Do: Perform many repetitions of the simulation. 


Conclude: Use the results of your simulation to answer the question of 
interest. 


The following table summarizes this four-step process for each of our previous 
simulations. 
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State How likely is it that a fair lottery Is it plausible that just by the luck of the random as-_ | What’s the probability that 
would result in 5 or more female signment of 48 subjects into two groups of 24 each, | 3 or more of 7 people who buy 
pilots being selected from an initial | 12 of the 15 subjects who were going to miss the a 20-ounce bottle of soda win 
pool of 15 male and 10 female rest area anyway ended up in the cell-phone group, | a prize if each bottle has a 1/6 
pilots? while only 3 of the 15 were assigned to the passen- | chance of saying, “You’re a 

ger conversation group? winner!”? 

Plan Prepare a bag with 25 beads (15 of Using index cards: Write “Stop” on 33 cards and Use a six-sided die to determine 
one color and 10 of another) or “Don’t” on 15 cards. the outcome for each person’s 
25 identical slips of paper (15 labeled | ¢ Using playing cards: Remove jokers, specialty bottle of soda. 

“M” and 10 labeled “F”). Mix well. cards, the ace of spades, and three 2s from the deck. | 6 = wins a prize 
Then without looking, remove 8 J, Q, K, A = miss rest area 1 to 5 = no prize 
beads/slips from the bag. 2 through 10 = stop at rest area 
Shuffle well and deal two piles of 24 cards—the Roll the die seven times, once for 
cell-phone group and the passenger group. each person. 
Record the number of female pilots | Record the number of drivers who miss the rest area | Record the number of people who 
selected. in the cell-phone group. win a prize. 

Do Have each student in class do this | Have each pair of students repeat the process Have each student perform 
several times. several times. several repetitions. 

Conclude In 100 repetitions, 18 yielded 5 or | In 300 repetitions of the random assignment, there Out of 125 total repetitions of the 


more female pilots. So about 18% 
of the time, a fair lottery would 
choose at least 5 female pilots to 
become captains. It seems plau- 
sible that the company carried out 
a fair lottery. 


were only 2 times when 12 or more drivers who 
missed the rest area ended up in the cell-phone 
group. That’s less than 1% of the time! It doesn’t 
seem plausible that the random assignment is the 
explanation for the difference between the groups. 


simulation, there were 15 times 
when three or more of the seven 
people won a prize. So our estimate 
of the probability is 15/125, or 
about 12%. It seems plausible that 
the company is telling the truth. 


So far, we have used physical devices for our simulations. Using random 
numbers from Table D or technology is another option, as the following examples 


illustrate. 


Golden Ticket Parking Lottery 


Simulations with Table D 


Ata local high school, 95 students have permission to park on campus. Each month, 
the student council holds a “golden ticket parking lottery” at a school assembly. The 
two lucky winners are given reserved parking spots next to the school’s main entrance. 
Last month, the winning tickets were drawn by a student council member from the 
AP® Statistics class. When both golden tickets went to members of that same class, 
some people thought the lottery had been rigged. There are 28 students in the AP® 
Statistics class, all of whom are eligible to park on campus. Design and carry out a 
simulation to decide whether it’s plausible that the lottery was carried out fairly. 


STATE: What's the probability that.a fair lottery would result in two winners from the AP® Statistics 


class? 


PLAN: We'lluse Table D to simulate choosing the golden ticket lottery winners. Because there are 
95 eligible students in the lottery, welll label the students in the AP® Statistics class from 01 to 28, 


AP® EXAM TIP On the 
AP® exam, you may be 
asked to describe how you 
will perform a simulation 
using rows of random 


digits. If so, provide a clear 
enough description of your 
simulation process for the 
reader to get the same 
results you did from only 
your written explanation. 
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and the remaining students from 29 to 95. Numbers from 96 to OO will be skipped. Moving left to right 
across the row, we'll look at pairs of digits until we come across two different labels from 01 to 95. The 
two students with these labels will win the prime parking spaces. We will record the number of winners 
from the AP® Statistics class. 


DO: Let’s perform many repetitions of our simulation. We'll use Table D starting at line 139. The 
digits from that row are shown below. We have drawn vertical bars to separate the pairs of digits. 
Underneath each pair, we have marked a-V if the chosen student is in the AP® Statistics class, X if 
the student is not in the class, and “skip” ifthe number isn't between 01 and 95 or if that student 
was already selected. (Note that if the consecutive “70” labels had beenin the same repetition, we 
would have skipped the second one.) We also recorded the number of students in the AP® Statistics 
class for each repetition. 


Rep 1 | Rep2 | Rep3 | Rep4 | Rep5d | Rep6 | Rep7 | Reps | Rep9Y | Rep 10 

55158 | 89194 | 04170 | 70184 | 10198143) 56135 | 69134 | 48139 | 45117 | 19112 

xx|xx| ax | xX x \skipx| xx | xx | xx | xv | vv 
0 0 1 0 1 0 0 0 1 2 


In the first 10 repetitions, there was one time when the two winners were both from the AP® Statis- 
tics class. But 10 isn’t many repetitions of the simulation. Continuing where we left off, 


Rep11 |Rep 12|Rep 13|Rep 14|Rep 15|Rep 16) Rep 17 Rep 18 Rep 19 |Rep 20 
97151132 | 58113 | 04184 | 51144 | 72132 | 18119 | 40100136 | 00124128 | 96176173 | 59164 


skip X X|X VJ 4% X|X x|x X|%W% |x skip x|SkiD YW WIskip x x] x x 
0 1 1 0 0 2 0 2 0 0 


So after 20 repetitions, there have been 3 times when both winners were in the AP® Statistics 
class. If we keep going for 30 more repetitions (to bring our total to 50), we find 28 more “No” and 2 
more “Yes” results. All totaled, that’s 5 “Yes” and 45 “No” results. 

CONCLUDE: Inour simulation of a fair lottery, both winners came from the AP® Statistics class 
in 10% of the repetitions. So about 1 in every 10 times the student council holds the golden ticket 
lottery, this will happen just by chance. It seems plausible that the lottery was conducted fairly. 


For Practice Try Exercise 


In the previous example, we could have saved a little time by using 
randInt (1,95) repeatedly instead of Table D (so we wouldn’t have to worry 
about numbers 96 to 00). We'll take this alternate approach in the next example. 


NASCAR Cards and Cereal Boxes 


Simulations with technology 


In an attempt to increase sales, a breakfast cereal company decides to offer a NAS- 
CAR promotion. Each box of cereal will contain a collectible card featuring one 
of these NASCAR drivers: Jeff Gordon, Dale Earnhardt, Jr., Tony Stewart, Danica 
Patrick, or Jimmie Johnson. The company says that each of the 5 cards is equally 
likely to appear in any box of cereal. A NASCAR fan decides to keep buying boxes 
of the cereal until she has all 5 drivers’ cards. She is surprised when it takes her 
23 boxes to get the full set of cards. Should she be surprised? Design and carry out 
a simulation to help answer this question. 
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STATE: Whatis the probability that it will take 23 or more boxes to get a full set of 5 
NASCAR collectible cards? 


PLAN: We need five numbers to represent the five possible cards. Let's let 1 = Jeff 
Gordon; 2 = Dale Earnhardt, Jr.; 3 = Tony Stewart; 4 = Danica Patrick; and 5 = Jimmie 
Johnson. We'll use randInt(1,5) to simulate buying one box of cereal and looking at which 
card is inside. Because we want a full set of cards, we'll keep pressing Enter until we get 
all five of the labels from 1 to 5. We'll record the number of boxes that we had to open. 


DO: It’s time to perform many repetitions of the simulation. Here are our first few 


results: 

Rep1: 352152354 boxes Rep2: 5125141412224453 16boxes 
Rep3: 5552412153 10boxes Rep4: 435351115315452 15boxes 
Rep5: 3322124334223332334225 22boxes 


The Fathom dotplot shows the number of boxes we had to buy in 50 repetitions of the 
simulation. 


NASCAR cereal problem _{ Dot Piot i 


CONCLUDE: We never had to buy more than 22 boxes to get the full set of NASCAR drivers’ 
cards in 50 repetitions of our simulation. So our estimate of the probability that it takes 23 or 
more boxes to get a full set is roughly O. The NASCAR fan should be surprised by how many boxes she 
had to buy. 


For Practice Try Exercise 


In the golden ticket lottery example, we ignored repeated numbers from 01 to 
95 within a given repetition. That’s because the chance process involved sampling 
students without replacement. In the NASCAR example, we allowed repeated 
numbers from | to 5 in a given repetition. That’s because we are selecting a small 
number of cards from a very large population of cards in thousands of cereal box- 
es. So the probability of getting, say, a Danica Patrick card in the next box of cereal 
is still very close to 1/5 even if we have already selected a Danica Patrick card. 


What don’t these simulations tell us? For the golden ticket parking 
THINK 1 3 ; 
ottery, we concluded that it’s plausible the drawing was done fairly. Does that 
ABOUT IT mean the lottery was conducted fairly? Not necessarily. All we did was estimate 
that the probability of getting two winners from the AP® Statistics class was about 
10% if the drawing was fair. So the result isn’t unlikely enough to convince us that 
the lottery was rigged. What about the cereal box simulation? It took our NASCAR 
fan 23 boxes to complete the set of 5 cards. Does that mean the company didn’t 
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tell the truth about how the cards were distributed? Not necessarily. Our simula- 
tion says that it’s very unlikely for someone to have to buy 23 boxes to get a full 
set if each card is equally likely to appear in a box of cereal. The evidence sug- 
gests that the company’s statement is incorrect. It’s still possible, however, that the 
NASCAR fan was just very unlucky. 


3 


CHECK YOUR UNDERSTANDING 


1. Refer to the golden ticket parking lottery example. At the following month’s school 
assembly, the two lucky winners were once again members of the AP® Statistics class. 
This raised suspicions about how the lottery was being conducted. How would you 
modify the simulation in the example to estimate the probability of getting two win- 
ners from the AP® Statistics class in back-to-back months just by chance? 


2. Refer to the NASCAR and breakfast cereal example. What if the cereal company 
decided to make it harder to get some drivers’ cards than others? For instance, suppose 
the chance that each card appears in a box of the cereal is Jeff Gordon, 10%; Dale 
Earnhardt, Jr., 30%; Tony Stewart, 20%; Danica Patrick, 25%; and Jimmie Johnson, 15%. 
How would you modify the simulation in the example to estimate the chance that a fan 
would have to buy 23 or more boxes to get the full set? 


Summary 


e <A chance process has outcomes that we cannot predict but that nonethe- 
less have a regular distribution in very many repetitions. The law of large 
numbers says that the proportion of times that a particular outcome occurs 
in many repetitions will approach a single number. This long-run relative 
frequency of a chance outcome is its probability. A probability is a number 
between 0 (never occurs) and | (always occurs). 


e Probabilities describe only what happens in the long run. Short runs of ran- 
dom phenomena like tossing coins or shooting a basketball often don’t look 
random to us because they do not show the regularity that emerges only in 
very many repetitions. 


e A simulation is an imitation of chance behavior, most often carried out with 
random numbers. To perform a simulation, follow the four-step process: 


Sie 4 STATE: Ask a question of interest about some chance process. 
. PLAN: Describe how to use a chance device to imitate one repetition of the 
process. Tell what you will record at the end of each repetition. 


DO: Perform many repetitions of the simulation. 


CONCLUDE: Use the results of your simulation to answer the question 
of interest. 
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CHAPTER 5 


Exercises 


Liar, liar! Sometimes police use a lie detector (also 
known as a polygraph) to help determine whether 

a suspect is telling the truth. A lie detector test isn’t 
foolproof —sometimes it suggests that a person is 
lying when he or she is actually telling the truth (a 
“false positive”). Other times, the test says that the 
suspect is being truthful when the person is actually 
lying (a “false negative”). For one brand of polygraph 
machine, the probability of a false positive is 0.08. 


Interpret this probability as a long-run relative frequency. 


Which is a more serious error in this case: a false 
positive or a false negative? Justify your answer. 


Mammograms Many women choose to have annual 
mammograms to screen for breast cancer after age 
40. A mammogram isn’t foolproof. Sometimes the 
test suggests that a woman has breast cancer when 
she really doesn’t (a “false positive”). Other times 

the test says that a woman doesn’t have breast cancer 
when she actually does (a “false negative”). Suppose 
the false negative rate for a mammogram is 0.10. 


Interpret this probability as a long-run relative frequency. 


Which is a more serious error in this case: a false 
positive or a false negative? Justify your answer. 


Genetics Suppose a married man and woman 
both carry a gene for cystic fibrosis but don’t have 
the disease themselves. According to the laws of 
genetics, the probability that their first child will 
develop cystic fibrosis is 0.25. 


Explain what this probability means. 


If the couple has 4 children, is one of them guaran- 
teed to get cystic fibrosis? Explain. 


Texas hold ’em In the popular ‘Texas hold ’em vari- 
ety of poker, players make their best five-card poker 
hand by combining the two cards they are dealt with 
three of five cards available to all players. You read in 
a book on poker that if you hold a pair (two cards of 
the same rank) in your hand, the probability of get- 
ting four of a kind is 88/1000. 


Explain what this probability means. 


If you play 1000 such hands, will you get four of a 
kind in exactly 88 of them? Explain. 


Spinning a quarter With your forefinger, hold a 
new quarter (with a state featured on the reverse) 
upright, on its edge, on a hard surface. Then flick it 
with your other forefinger so that it spins for some 
time before it falls and comes to rest. Spin the coin a 
total of 25 times, and record the results. 
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What’s your estimate for the probability of heads? Why? 
Explain how you could get an even better estimate. 


Nickels falling over You may feel it’s obvious that 
the probability of a head in tossing a coin is about 
1/2 because the coin has two faces. Such opinions 
are not always correct. Stand a nickel on edge on a 
hard, flat surface. Pound the surface with your hand 
so that the nickel falls over. Do this 25 times, and 
record the results. 


What’s your estimate for the probability that the coin 
falls heads up? Why? 


Explain how you could get an even better estimate. 


Free throws ‘The figure below shows the results of a 
virtual basketball player shooting several free throws. 
Explain what this graph says about chance behavior 
in the short run and long run. 
—______—_——— 
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Shots 


Keep on tossing ‘The figure below shows the results 
of two different sets of 5000 coin tosses. Explain what 
this graph says about chance behavior in the short 
run and the long run. 
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Due for a hit A very good professional baseball 
player gets a hit about 35% of the time over an entire 
season. After the player failed to hit safely in six 
straight at-bats, a ‘T'V commentator said, “He is due 
for a hit by the law of averages.” Is that right? Why? 


Cold weather coming ATV weather man, predict- 
ing a colder-than-normal winter, said, “First, in 
looking at the past few winters, there has been a lack 
of really cold weather. Even though we are not sup- 
posed to use the law of averages, we are due.” Do you 
think that “due by the law of averages” makes sense 
in talking about the weather? Why or why not? 


Playing “Pick 4” The Pick 4 games in many state 
lotteries announce a four-digit winning number 
each day. You can think of the winning number as a 
four-digit group from a table of random digits. You 
win (or share) the jackpot if your choice matches 
the winning number. The winnings are divided 
among all players who matched the winning num- 
ber. That suggests a way to get an edge. 


The winning number might be, for example, either 
2873 or 9999. Explain why these two outcomes have 
exactly the same probability. 


If you asked many people whether 2873 or 9999 

is more likely to be the randomly chosen winning 
number, most would favor one of them. Use the 
information in this section to say which one and to 
explain why. How might this affect the four-digit 
number you would choose? 


An unenlightened gambler 


A gambler knows that red and black are equally 
likely to occur on each spin of a roulette wheel. He 
observes five consecutive reds occur and bets heavily 
on black at the next spin. Asked why, he explains that 
black is “due by the law of averages.” Explain to the 
gambler what is wrong with this reasoning. 


After hearing you explain why red and black are still 
equally likely after five reds on the roulette wheel, 
the gambler moves to a poker game. He is dealt five 
straight red cards. He remembers what you said and 
assumes that the next card dealt in the same hand 

is equally likely to be red or black. Is the gambler 
right or wrong, and why? 


Free throws A basketball player has probability 
0.75 of making a free throw. Explain how you 
would use each chance device to simulate one 
free throw by the player. 


A standard deck of playing cards 
‘Table D of random digits 


A calculator or computer’s random integer generator 
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Stoplight On her drive to work every day, Ilana 
passes through an intersection with a traffic light. 
The light has probability 1/3 of being green when 
she gets to the intersection. Explain how you would 
use each chance device to simulate whether the light 
is red or green on a given day. 


A six-sided die 
Table D of random digits 
A calculator or computer’s random integer generator 


Simulation blunders Explain what’s wrong with 
each of the following simulation designs. 


A roulette wheel has 38 colored slots—18 red, 18 
black, and 2 green. To simulate one spin of the 
wheel, let numbers 00 to 18 represent red, 19 to 37 
represent black, and 38 to 40 represent green. 


About 10% of U.S. adults are left-handed. ‘To simu- 
late randomly selecting one adult at a time until 
you find a left-hander, use two digits. Let 00 to 09 
represent being left-handed and 10 to 99 represent 
being right-handed. Move across a row in Table D, 
two digits at a time, skipping any numbers that have 
already appeared, until you find a number between 
00 and 09. Record the number of people selected. 


Simulation blunders Explain what’s wrong with 
each of the following simulation designs. 


According to the Centers for Disease Control and 
Prevention, about 36% of U.S. adults were obese in 
2012. To simulate choosing 8 adults at random and 
seeing how many are obese, we could use two digits. 
Let 00 to 35 represent obese and 36 to 99 represent 
not obese. Move across a row in Table D, two digits 
at a time, until you find 8 distinct numbers (no re- 
peats). Record the number of obese people selected. 


Assume that the probability of a newborn being 

a boy is 0.5. To simulate choosing a random sample 
of 9 babies who were born at a local hospital today 
and observing their gender, use one digit. Use 
randInt (0,9) on your calculator to determine 
how many babies in the sample are male. 


Is this valid? Determine whether each of the follow- 
ing simulation designs is valid. Justify your answer. 


According to a recent poll, 75% of American adults 
regularly recycle. To simulate choosing a random 
sample of 100 U.S. adults and seeing how many of 
them recycle, roll a 4-sided die 100 times. A result of 
1, 2, or 3 means the person recycles; a + means that 
the person doesn’t recycle. 


An archer hits the center of the target with 60% of 
her shots. ‘To simulate having her shoot 10 times, use 
a coin. Flip the coin once for each of the 10 shots. If 
it lands heads, then she hits the center of the target. 
If the coin lands tails, she doesn’t. 
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Is this valid? Determine whether each of the follow- 
ing simulation designs is valid. Justify your answer. 


According to a recent survey, 50% of people aged 
13 and older in the United States are addicted to 
texting. To simulate choosing a random sample of 
20 people in this population and seeing how many 
of them are addicted to texting, use a deck of cards. 
Shuffle the deck well, and then draw one card at a 
time. A red card means that person is addicted to 
texting; a black card means he isn’t. Continue until 
you have drawn 20 cards (without replacement) for 
the sample. 


A tennis player gets 95% of his serves in play during 
practice (that is, the ball doesn’t go out of bounds). 
To simulate the player hitting 5 serves, look at 5 
pairs of digits going across a row in Table D. If 

the number is between 00 and 94, the serve is in; 
numbers between 95 and 99 indicate that the serve 
is out. 


Airport security ‘The Transportation Security 
Administration (‘TSA) is responsible for airport safety. 
On some flights, TSA officers randomly select passen- 
gers for an extra security check prior to boarding. One 
such flight had 76 passengers— 12 in first class and 64 
in coach class. Some passengers were surprised when 
none of the 10 passengers chosen for screening were 
seated in first class. We can use a simulation to see if 
this result is likely to happen by chance. 


State the question of interest using the language of 
probability. 


How would you use random digits to imitate one repeti- 
tion of the process? What variable would you measure? 


Use the line of random digits below to perform one 
repetition. Copy these digits onto your paper. Mark 
directly on or above them to show how you deter- 
mined the outcomes of the chance process. 


71487 09984 29077 14863 61683 47052 62224 51025 


(d) 


20. 


In 100 repetitions of the simulation, there were 15 times 
when none of the 10 passengers chosen was seated in 
first class. What conclusion would you draw? 


Scrabble In the game of Scrabble, each player begins 
by drawing 7 tiles from a bag containing 100 tiles. 
There are 42 vowels, 56 consonants, and 2 blank tiles 
in the bag. Cait chooses her 7 tiles and is surprised to 
discover that all of them are vowels. We can use a simu- 
lation to see if this result is likely to happen by chance. 


State the question of interest using the language of 
probability. 


How would you use random digits to imitate one repeti- 
tion of the process? What variable would you measure? 


(c) 


PROBABILITY: WHAT ARE THE CHANCES? 


Use the line of random digits below to perform one 
repetition. Copy these digits onto your paper. Mark 
directly on or above them to show how you deter- 
mined the outcomes of the chance process. 


00694 05977 19664 65441 20903 62371 22725 53340 


(d) 


Pale 


PB 


In 1000 repetitions of the simulation, there were 2 
times when all 7 tiles were vowels. What conclusion 
would you draw? 


The birthday problem What's the probability 
that in a randomly selected group of 30 unrelated 
people, at least two have the same birthday? Let’s 
make two assumptions to simplify the problem. 
First, we'll ignore the possibility of a February 

29 birthday. Second, we assume that a randomly 
chosen person is equally likely to be born on 
each of the remaining 365 days of the year. 


How would you use random digits to imitate one 
repetition of the process? What variable would you 
measure? 


Use technology to perform 5 repetitions. Record the 
outcome of each repetition. 


Would you be surprised to learn that the theoretical 
probability is 0.71? Why or why not? 


. Monty Hall problem In Parade magazine, a reader 


posed the following question to Marilyn vos Savant 
and the “Ask Marilyn” column: 


Suppose you're on a game show, and youre given 
the choice of three doors. Behind one door is a car, 
behind the others, goats. You pick a door, say #1, 
and the host, who knows what’s behind the doors, 
opens another door, say #3, which has a goat. He 
says to you, “Do you want to pick door #2?” Is it to 
your advantage to switch your choice of doors?* 


The game show in question was Let’s Make a Deal and 
the host was Monty Hall. Here’s the first part of Mari- 
lyn’s response: “Yes; you should switch. The first door 
has a 1/3 chance of winning, but the second door has a 
2/3 chance.” Thousands of readers wrote to Marilyn to 
disagree with her answer. But she held her ground. 


Use an online Let’s Make a Deal applet to perform 
at least 50 repetitions of the simulation. Record 
whether you stay or switch (try to do each about half 
the time) and the outcome of each repetition. 


Do you agree with Marilyn or her readers? Explain. 


Recycling Do most teens recycle? To find out, an 
AP® Statistics class asked an SRS of 100 students at 
their school whether they regularly recycle. In the 
sample, 55 students said that they recycle. Is this 
convincing evidence that more than half of the stu- 
dents at the school would say they regularly recycle? 
The Fathom dotplot below shows the results of 


(a) 


(b) 


Dee 


STEP A 
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taking 200 SRSs of 100 students from a population 
in which the true proportion who recycle is 0.50. 
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Explain why the sample result does not give convinc- 
ing evidence that more than half of the school’s 
students recycle. 


Suppose instead that 63 students in the class’s sample 
had said “Yes.” Explain why this result would give strong 
evidence that a majority of the school’s students recycle. 


Brushing teeth, wasting water? A recent study 
reported that fewer than half of young adults turn off 
the water while brushing their teeth. Is the same true 
for teenagers? ‘To find out, a group of statistics stu- 
dents asked an SRS of 60 students at their school if 
they usually brush with the water off. In the sample, 
27 students said “No.” The Fathom dotplot below 
shows the results of taking 200 SRSs of 60 students 
from a population in which the true proportion who 
brush with the water off is 0.50. 
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Proportion who say “No” 


Explain why the sample result does not give convinc- 
ing evidence that fewer than half of the school’s 
students brush their teeth with the water off. 


Suppose instead that 18 students in the class’s sample 
had said “No.” Explain why this result would give 
strong evidence that fewer than 50% of the school’s 
students brush their teeth with the water off. 


Color-blind men About 7% of men in the United 
States have some form of red-green color blindness. 
Suppose we randomly select 4 U.S. adult males. What's 
the probability that at least one of them is red-green 
color-blind? Design and carry out a simulation to 
answer this question. Follow the four-step process. 


26. Lotto In the United Kingdom’s Lotto game, a 
player picks six numbers from | to 49 for each 
ticket. Rosemary bought one ticket for herself. She 
had the lottery computer randomly select the six 
numbers. When the six winning numbers were 
drawn, Rosemary was surprised to find that none 
of these numbers appeared on the Lotto ticket she 
had bought. Should she be? Design and carry out 
a simulation to answer this question. Follow the 
four-step process. 


Color-blind men Refer to Exercise 25. Suppose we 
randomly select one U.S. adult male at a time until 
we find one who is red-green color-blind. Should we 
be surprised if it takes us 20 or more men? Design and 
carry out a simulation to answer this question. Follow 
the four-step process. 


peer 
A 


28. Scrabble Refer to Exercise 20. About 3% of the 
"A time, the first player in Scrabble can “bingo” by 
playing all 7 tiles on the first turn. Should we be sur- 
prised if it takes 30 or more games for this to happen? 
Design and carry out a simulation to answer this 
question. Follow the four-step process. 


Random assignment Researchers recruited 20 
volunteers—8 men and 12 women —to take part in 
an experiment. They randomly assigned the subjects 
into two groups of 10 people each. To their surprise, 
6 of the 8 men were randomly assigned to the same 
treatment. Should they be surprised? Design and 
carry out a simulation to estimate the probability that 
the random assignment puts 6 or more men in the 
same group. Follow the four-step process. 


STEP 2). 
4 


30. ‘Taking the train According to New Jersey Transit, 
the 8:00 a.m. weekday train from Princeton to New 
York City has a 90% chance of arriving on time. ‘To 
test this claim, an auditor chooses 6 weekdays at 
random during a month to ride this train. The train 
arrives late on 2 of those days. Does the auditor have 
convincing evidence that the company’s claim isn’t 
true? Design and carry out a simulation to estimate 
the probability that a train with a 90% chance of 
arriving on time each day would be late on 2 or more 
of 6 days. Follow the four-step process. 


Multiple choice: Select the best answer for Exercises 31 
to 36. 


31. You read in a book about bridge that the probability 
that each of the four players is dealt exactly one ace 
is about 0.11. This means that 


(a) in every 100 bridge deals, each player has one ace 
exactly 11 times. 


(b) in | million bridge deals, the number of deals on 
which each player has one ace will be exactly 110,000. 
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(a) 
(b) 
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in a very large number of bridge deals, the percent of 
deals on which each player has one ace will be very 
close to 11%. 


in a very large number of bridge deals, the average 


number of aces in a hand will be very close to 0.11. 


If each player gets an ace in only 2 of the first 50 
deals, then each player should get an ace in more 


than 11% of the next 50 deals. 


If I toss a fair coin five times and the outcomes are 
TTTTT, then the probability that tails appears on 
the next toss is 

0.5. (c) greater than 0.5. (e) 1. 


less than 0.5. (d) 0. 


Exercises 33 to 35 refer to the following setting. A basketball 
player claims to make 47% of her shots from the field. We 
want to simulate the player taking sets of 10 shots, assum- 
ing that her claim is true. 


B33 


‘To simulate the number of makes in 10 shot at- 
tempts, you would perform the simulation as follows: 


Use 10 random one-digit numbers, where 0-4 are a 
nake and 5—9 are a miss. 


=| 


se 10 random two-digit numbers, where 00-46 are 
make and 47-99 are a miss. 


U 
al 
Use 10 random two-digit numbers, where 00-47 are 
a make and 48-99 are a miss. 

U 


se +7 random one-digit numbers, where 0 is a 
make and 1-9 are a miss. 


Use 47 random two-digit numbers, where 00-46 are 
a make and 47-99 are a miss. 


‘Twenty-five repetitions of the simulation were per- 
formed. The simulated number of makes in each set 
of 10 shots was recorded on the dotplot below. What 
is the approximate probability that a 47% shooter 
makes 5 or more shots in 10 attempts? 


eke 4 5 6 7 
Number of Made Shots 
5/10 (b) 3/10 (c) 12/25 (d) 3/25 (e) 47/100 


Suppose this player attempts 10 shots in a game and 
only makes 3 of them. Does this provide convincing 
evidence that she is less than a 47% shooter? 

Yes, because 3/10 (30%) is less than 47%. 


Yes, because she never made 47% of her shots in the 
simulation. 


No, because it is plausible that she would make 3 or 
fewer shots by chance alone. 


Descriptive Statistics: 
Waiting n 


No 


PROBABILITY: WHAT ARE THE CHANCES? 


No, because the simulation was only repeated 25 times. 


No, because the distribution is approximately 
symmetric. 

Ten percent of U.S. households contain 5 or more 
people. You want to simulate choosing a household 
at random and recording “Yes” if it contains 5 or 
more people. Which of these are correct assignments 
of digits for this simulation? 

Odd = Yes; Even = No 

0 = Yes; 1-9 = No 

0-5 = Yes; 6-9 = No 

0-4 = Yes; 5-9 = No 

None of these 

Are you feeling stressed? (4.1) A Gallup Poll asked 
whether people experienced stress “a lot of the day 
yesterday.” About 41 percent said they did. Gallup’s 
report said, “Results are based on telephone inter- 
views conducted ... Jan. 1—Dec. 31, 2012, with a ran- 


dom sample of 353,564 adults aged 18 and older.” 
Identify the population and the sample. 


Explain how undercoverage could lead to bias in this 
survey. 


Waiting to park (1.3, 4.2) Do drivers take longer to 
leave their parking spaces when someone is waiting? 
Researchers hung out in a parking lot and collected 
some data. The graphs and numerical summaries 
below display information about how long it took 
drivers to exit their spaces. 

Write a few sentences comparing these distributions. 
Can we conclude that having someone waiting 
causes drivers to leave their spaces more slowly? Why 
or why not? 


90 4 
80 4 
70 + 


60 7 


Time (seconds) 


50 7 


40 4 


30 44 


Someone waiting? 


Time 


Mean StDev Minimum Q, Median Q; Maximum 


20 44.42 14.10 33.76 35.61 39.56 48.48 84.92 


Yes 20 54.11 14.39 41.61 43.41 47.14 66.44 85.97 
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ae: Probability Rules 


WHAT YOU WILL LEARN __ By the end of the section, you will be able to: 
e Describe a probability model for a chance process. e Use a two-way table or Venn diagram to model a chance 


e Use basic probability rules, including the complement process and calculate probabilities involving two events. 
rule and the addition rule for mutually exclusive events. e Use the general addition rule to calculate probabilities. 


The idea of probability rests on the fact that chance behavior is predictable in the 
long run. In Section 5.1, we used simulation to imitate chance behavior. Do we 
always need to repeat a chance process many times to determine the probability 
of a particular outcome? Fortunately, the answer is no. 


Probability Models 


In Chapter 2, we saw that a Normal density curve could be used to model some 
distributions of data. In Chapter 3, we modeled linear relationships between two 
quantitative variables with a least-squares regression line. Now we’re ready to 
develop a model for chance behavior. 

Let’s start with a very simple chance process: tossing a coin once. When we toss 
a coin, we can’t know the outcome in advance. What do we know? We are willing 
to say that the outcome will be either heads or tails. We believe that each of these 
outcomes has probability 1/2. This description of coin tossing has two parts: 


e A list of possible outcomes (the sample space S) 
e A probability for each outcome 


Such a description is the basis for a probability model. Here is the basic vocabu- 
lary we use. 


DEFINITION: Sample space, probability model 
The sample space S of a chance process is the set of all possible outcomes. 


A probability model is a description of some chance process that consists of two 
parts: a sample space S and a probability for each outcome. 


A sample space S can be very simple or very complex. When we toss a coin 
once, there are only two possible outcomes, heads and tails. We can write the 
sample space using set notation as S = {H, T}. When Gallup draws a random 
sample of 1523 adults and asks a survey question, the sample space contains all 
possible sets of responses from 1523 of the 235 million adults in the country. 
This S is extremely large. Each member of S lists the answers from one possible 
sample, which explains the term sample space. 

Let’s look at how to set up a probability model in a familiar setting—rolling a 
pair of dice. 
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Roll the Dice 
Building a probability model 


Many board games involve rolling dice. Imagine rolling two fair, six-sided dice— 
one that’s red and one that’s green. 


PROBLEM: Give a probability model for this chance process. 


SOLUTION: There are 36 possible outcomes when we roll two dice and record the number of spots 
showing on the up-faces. Figure 5.2 displays these outcomes. They make up the sample space 9. If the 
dice are perfectly balanced, all 36 outcomes will be equally likely. That is, each of the 36 outcomes will 
come up on one-thirty-sixth of all rolls in the long run. So each outcome has probability 1/36. 


FIGURE 5.2 The 36 possible outcomes in rolling two dice. If the dice are carefully made, all of 
these outcomes have the same probability. 


For Practice Try Exercise 


A probability model does more than just assign a probability to each outcome. It 
allows us to find the probability of any collection of outcomes, which we call an event. 


DEFINITION: Event 


An event is any collection of outcomes from some chance process. That is, an event 
is a subset of the sample space. Events are usually designated by capital letters, like 
A, B, C, and so on. 


If A is any event, we write its probability as P(A). In the dice-rolling example, 
suppose we define event A as “sum is 5.” What’s P(A), the probability that event A 
occurs? ‘There are four outcomes in event A: 


Bi) BY BW BU 


Because each of these outcomes has probability 1/36, P(A) = 4/36. Now consider 
event B: sum is not 5. To find P(B), we could list all the outcomes that make up 
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event B, but that would take a while. Fortunately, there’s an easier way. Of the 36 
equally likely outcomes in Figure 5.2, event A (sum is 5) occurs in 4 of them. So 
event A does not occur in 32 of these outcomes. Then P(B) = P(sum isn’t 5) = 
P(not A) = 32/36. Notice that P(A) + P(B) = 1. 

Let’s consider one more event, which we'll call C: sum is 6. The outcomes in 
event C are 


BL BY BY BY Be 
So P(C) = 5/36. What’s the probability that we get a sum of 5 or 6, that is, P(A 


or C)? Because these two events have no outcomes in common, we can add the 
probabilities of the individual events: 


P(sum is 5 or sum is 6) = P(sum is 5) + P(sum is 6) = 4/36 + 5/36 = 9/36 
In other words, P(A or C) = P(A) + P(C). 


Basic Rules of Probability 


Our dice-rolling scenario revealed some basic rules that any probability model 
must obey: 


¢ = The probability of any event is a number between 0 and 1. The probability of an event 
is the long-run proportion of repetitions on which that event occurs. Any proportion 
is a number between 0 and 1, so any probability is also a number between 0 and 1. 
An event with probability 0 never occurs, and an event with probability 1 occurs 
on every trial. An event with probability 0.5 occurs in half the trials in the long run. 

e All possible outcomes together must have probabilities that add up to 1. Be- 
cause some outcome must occur on every trial, the sum of the probabilities 
for all possible outcomes must be exactly 1. 

e [fall outcomes in the sample space are equally likely, the probability that event 
A occurs can be found using the formula 


number of outcomes corresponding to event A 


P(A) = 


total number of outcomes in sample space 


e The probability that an event does not occur is | minus the probability that the 
event does occur. If an event occurs in (say) 70% of all trials, it fails to occur in 
the other 30%. The probability that an event occurs and the probability that it 
does not occur always add to 100%, or 1. (This explains why P(sum isn’t 5) = 
1 — P(sum is 5) in the dice-rolling example.) We refer to the event “not A” as 
the complement of A and denote it by A®. 


e If two events have no outcomes in common, the probability that one or the 
other occurs is the sum of their individual probabilities. If one event occurs 
in 40% of all trials, a different event occurs in 25% of all trials, and the two 
can never occur together, then one or the other occurs on 65% of all trials 
because 40% + 25% = 65%. When two events have no outcomes in com- 
mon, we refer to them as mutually exclusive or disjoint. 


DEFINITION: Mutually exclusive (disjoint) 


Two events A and B are mutually exclusive (disjoint) if they have no outcomes in 
common and so can never occur together—that is, if P(A and B) = 0. 
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We can summarize the basic probability rules more concisely in symbolic form. 


BASIC PROBABILITY RULES 


e For any event A, 0 = P(A) = 1. 
e IfS is the sample space in a probability model, P(S) = 1. 
e Inthe case of equally likely outcomes, 


number of outcomes corresponding to event A 


P(A) = 


total number of outcomes in sample space 
¢ Complement rule: P(A°) = 1 — P(A). 


e Addition rule for mutually exclusive events: If A and B are mutually 
exclusive, P(A or B) = P(A) + P(B). 


The earlier dice-rolling example involved equally likely outcomes. Here’s an 
. example that illustrates use of the basic probability rules when the outcomes of a 
) chance process are not equally likely. 


Distance Learning 
Applying probability rules 


Distance-learning courses are rapidly gaining popularity among college students. 
Randomly select an undergraduate student who is taking a distance-learning 
course for credit, and record the student’s age. Here is the probability model:° 


Age group (yr): 18 to 23 24 to 29 30 to 39 40 or over 
Probability: 0.57 0.17 0.14 0.12 


PROBLEM: 

(a) Show that this is a legitimate probability model. 

(b) Find the probability that the chosen student is not in the traditional college age group (18 to 
23 years). 


SOLUTION: 
(a) The probability of each outcome is a number between O and 1, and the probabilities of all the pos- 
sible outcomes add to 1, 50 this is a legitimate probability model. 


(b) There are two ways to find this probability. By the complement rule, 
P(not 18 to 23 years) = 1 — P(18 to 23 years)= 1 — 0.57 = 0.43 
That is, if 57% of distance learners are 18 to 23 years old, then the remaining 43% are not in this 
age group. 
Using the addition rule for mutually exclusive events, 
P(not 18 to 23 years) = P(24 to 29 years) + P(30 to 39 years) + P(40 years or over) 
=0.174+ 0.14 + 0.12 =0.43 
There is a43% chance that the chosen student is not in the traditional college age group. 


For Practice Try Exercise 


Section 5.2 Probability Rules 44,309 


CHECK YOUR UNDERSTANDING 


Choose an American adult at random. Define two events: 


A = the person has a cholesterol level of 240 milligrams per deciliter of blood (mg/dl) or 
above (high cholesterol) 

B = the person has a cholesterol level of 200 to 239 mg/dl (borderline high cholesterol) 

According to the American Heart Association, P(A) = 0.16 and P(B) = 0.29. 

1. Explain why events A and B are mutually exclusive. 

2. Say in plain language what the event “A or B” is. What is P(A or B)? 


3. IfC is the event that the person chosen has normal cholesterol (below 200 mg/dl), 
what’s P(C)? 


Two-Way Tables, Probability, 
and the General Addition Rule 


When we're trying to find probabilities involving two events, a two-way table can 
display the sample space in a way that makes probability calculations easier. 


fe 


Who Has Pierced Ears? 
Two-way tables and probability 


Students in a college statistics class wanted to find out how common it is for young 
adults to have their ears pierced. They recorded data on two variables—gender and 
whether the student had a pierced ear—for all 178 people in the class. The two- 
way table below displays the data. 


Gender 
Pierced Ears? Male Female Total 
Yes 19 84 103 
No 71 4 75 
Total 90 88 178 


PROBLEM: Suppose we choose a student from the class at random. Find the probability that 
the student 


(a) has pierced ears. 

(b) is male and has pierced ears. 

(c) is male or has pierced ears. 

SOLUTION: We'lldefine events A: is male and B: has pierced ears. 


(a) Because each of the 178 students in the class is equally likely to be chosen, and there are 103 
students with pierced ears, P(pierced ears) = P(B) = 103/178. 

(b) We want to find P(male and pierced ears), that is, P(A and B). Looking at the intersection of 

the “Male” column and “Yes” row, we see that there are 19 males with pierced ears. So P(male and 
pierced ears) = P(Aand B) = 19/178. 

(c) This time, we're interested in P(male or pierced ears), that is, P(A or B). (Note the mathematical use 
of the word orhere—the person could be a male or have pierced ears or both.) From the two-way table, 
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we see that there are 90 males inthe class, so P(A) = 90/178. Can we just add P(A) to F{B) to get 
the correct answer? No! These two events are not mutually exclusive, because there are 19 males with 
pierced ears. (Ifwe did add the two probabilities, we'd get 90/178 + 103/178 = 193/178, whichis 
clearly wrong, because the probability is bigger than 1!) From the two-way table, we see that there are 
19+ 71 + 64 = 174 students who are male orhave pierced ears. So P(A or B) = 174/178. 


For Practice Try Exercise 


When we found the probability of 
getting a male with pierced ears in 

the example, we could have described 
this as either P(A and B) or Band A). 
Why? Because “A and B” describes the 
same event as “Band A.” Likewise, 
P(A or B) is the same as P(B or A). 
Don’t get so caught up in the notation 
that you lose sight of what's really 
happening! 


FIGURE 5.3 Two-way table showing 
events A and B from the pierced- 
ears example. These events are 

not mutually exclusive, so we can’t 
find P(A or B) by just adding the 
probabilities of the two events. 


The previous example revealed two important facts about finding the prob- 
ability P(A or B) when the two events are not mutually exclusive. First, the use 
of the word “or” in probability questions is different from that in everyday life. If 
someone says, “I'll either watch a movie or go to the football game,” that usually 
means they'll do one thing or the other, but not both. In statistics, “A or B” could 
mean one or the other or both. Second, we can’t use the addition rule for mutu- 
ally exclusive events unless two events have no outcomes in common. 

If events A and B are not mutually exclusive, they can occur together. The 
probability that one or the other occurs is then /ess than the sum of their prob- 
abilities. As Figure 5.3 illustrates, outcomes common to both are counted twice 
when we add probabilities. 


We can fix the double-counting problem illustrated in the two-way table by 
subtracting the probability P(A and B) from the sum. That is, 


P(A or B) = P(A) + P(B) — P(A and B) 
This result is known as the general addition rule. Let’s check that it works for 
the pierced-ears example: 
P(A or B) = P(A) + P(B) — P(A and B) 
= 90/178 + 103/178 — 19/178 
= 174/178 


This matches our earlier result. 


GENERAL ADDITION RULE FOR TWO EVENTS 


If A and B are any two events resulting from some chance process, then 


P(A or B) = P(A) + P(B) — P(A and B) 


Sometimes it’s easier to designate 
events with letters that relate to the 


context, like F for “face card” and H for 


“heart.” 


FIGURE 5.4 Venn diagrams 
showing: (a) event A and its 
complement A° and (b) mutually 


exclusive (disjoint) events A and B. 


Here’s a way to keep the symbols 
straight: U for union; M for 
intersection. 


FIGURE 5.5 Venn diagrams 
showing (a) the intersection and 
(b) the union of events A and B. 
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on, 


What happens if we use the general addition rule for two mutually exclu- 


sive events A and B? In that case, P(A and B) = 0, and the formula reduces 
to P(A or B) = P(A) + P(B). In other words, the addition rule for mutually 
exclusive events is just a special case of the general addition rule. 


You might be wondering whether there is also a rule for finding P(A and B). 


There is, but it’s not quite as intuitive. Stay tuned for that later. 


CHECK YOUR UNDERSTANDING 


A standard deck of playing cards (with jokers removed) consists of 52 cards in four suits— 
clubs, diamonds, hearts, and spades. Each suit has 13 cards, with denominations ace, 2, 3, 
4,5, 6,7, 8,9, 10, jack, queen, and king. The jack, queen, and king are referred to as “face 
cards.” Imagine that we shuffle the deck thoroughly and deal one card. Let’s define events 
F: getting a face card and H: getting a heart. 


1. Make a two-way table that displays the sample space. 

2. Find P(F and H). 

3. Explain why P(F or H) # P(F) + P(H). Then use the general addition rule to find 
P(F or H). 


Venn Diagrams and Probability 


We have already seen that two-way tables can be used to illustrate the sample 
space of a chance process involving two events. So can Venn diagrams. Because 
Venn diagrams have uses in other branches of mathematics, some standard vo- 
cabulary and notation have been developed. 


We introduced the complement of an event earlier. In Figure 5.4(a), the com- 
plement A® contains exactly the outcomes that are not in A. 


The events A and B in Figure 5.4(b) are mutually exclusive (disjoint) because 
they do not overlap; that is, they have no outcomes in common. 


(a) (b) 


Figure 5.5(a) shows the event “A and B.” You can see why this event is also 
called the intersection of A and B. The corresponding notation is AM B. 


The event “A or B” is shown in Figure 5.5(b). This event is also known as the 
union of A and B. The corresponding notation is A U B. 


ANB 
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FIGURE 5.6 Completed Venn 
diagram for the large group of 
college students. The circles rep- 
resent the two events A: is male 
and B: has pierced ears. 


Who Has Pierced Ears? 


Understanding Venn diagrams 


In the preceding example, we looked at data from a survey on gender and ear 
piercings for a large group of college students. The chance process came from se- 
lecting a student in the class at random. Our events of interest were A: is male and 
B: has pierced ears. Here is the two-way table that summarizes the sample space: 


Gender 
Pierced Ears? Male Female 
Yes 19 84 
No 71 4 


How would we construct a Venn diagram that displays the information in the two- 
way table? 


There are four distinct regions in the Venn diagram shown in Figure 5.6. These 
regions correspond to the four cells in the two-way table. We can describe this cor- 
respondence in tabular form as follows: 


Region in Venn diagram In words In symbols Count 
In the intersection of two circles Male and pierced ears ANB 
Inside circle A, outside circle B Male and no pierced ears An Be 
Inside circle B, outside circle A Female and pierced ears A OB 
Outside both circles Female and no pierced ears ASO Bo 


We have added the appropriate counts of students to the four regions in Figure 5.6. 


With this new notation, we can rewrite the general addition rule in symbols as 
P(A U B) = P(A) + P(B) — P(AN B) 
This Venn diagram shows why the formula works in the pierced-ears example. 


Sample space 


Outcomes here are 
double-counted by 


P(A) + P(B) Event B 
pierced ears 
Event A P(B) = 103/178 
male 


P(A) = 90/178 


male and pierced ears 
P(A NM B) = 19/178 


Event A and B | 


The following example ties all this together. 
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Who Reads the Paper? 


Venn diagrams, two-way tables, and probability 


In an apartment complex, 40% of residents read USA Today. Only 25% read 
the New York Times. Five percent of residents read both papers. Suppose we 
select a resident of the apartment complex at random and record which of 
the two papers the person reads. 


PROBLEM: 

(a) Make a two-way table that displays the sample space of this chance process. 
(b) Construct a Venn diagram to represent the outcomes of this chance process. 
(c) Find the probability that the person reads at least one of the two papers. 

(d) Find the probability that the person doesn't read either paper. 


SOLUTION: We'lldefine events A: reads USA Today and B: reads New York Times. From the problem 
statement, we know that P(A) = 0.40, A.B) = 0.25, and P(A M B) = 0.05. 


(a) We can enter the value 0.40 as the total for the “Yes” column, 0.25 as the total for the “Yes” 
row, 0.05 in the “Yes, Yes” cell, and 1 as the grand total in the two-way table shown here. This gives 
us enough information to fill in the empty cells of the table, starting with the missing row total for 
“No” (1 — 0.25 = 0.75) and the missing column total for “No” (1 — 0.40 = 0.60). Ina similar 
way, we can determine the missing number in the “Yes” row (0.25 — 0.05 = 0.20) and the “Yes” 
column (0.40 — 0.05 = 0.35). That leaves 0.40 for the “No, No” cell. 


Reads USA Today? 
A: reads B: reads Reads New York Times Yes No Total 
USA Today New York Times Yes 0.05 0.20 0.25 
No 0.35 0.40 0.75 
Total 0.40 0.60 1.00 


(b) Figure 5.7 shows the Venn diagram that corresponds to the completed two- 
way table from (a). 

(c) Ifthe randomly selected person reads at least one of the two papers, then he or 
she reads USA Today, the New York Times, or both papers. But that’s the same as the 
event A U B, From the two-way table, the Venn diagram, or the general addition rule, 
FIGURE 5.7 Venn diagram we have 


showing the residents of an AU B) = PIA) + =—PAN 
apartment complex who A: read i" 4) a Aleit oe ts) )aan | B) 
USA Today and B: read the New = 0.40 + 0.25 — 0.05 = 0.60 


York Times. So there's a 60% chance that the randomly selected resident reads at least one of the two papers. 
(d) From the two-way table or Venn diagram, P(reads neither paper) = P(A’ M BY) = 0.40. 


For Practice Try Exercise 


AP® EXAM TIP Many probability problems involve simple computations that you can do on 
your calculator. It may be tempting to just write down your final answer without showing the 
supporting work. Don’t do it! A “naked answer,” even if it’s correct, will usually earn you no 
credit on a free-response question. 


314 CHAPTER 5 PROBABILITY: WHAT ARE THE CHANCES? 


In the previous example, the event “reads neither paper” is the complement of 
the event “reads at least one of the papers.” To solve part (d) of the problem, we 
could have used our answer from (c) and the complement rule: 


P(reads neither paper) = 1 — P(reads at least one paper) = 1 — 0.60 = 0.40 


As you'll see in Section 5.3, the fact that “none” is the opposite of “at least one” 
comes in handy for a variety of probability questions. 


Section 5.2. Summary 


e A probability model describes chance behavior by listing the possible outcomes 
in the sample space S and giving the probability that each outcome occurs. 


e An event is a subset of the possible outcomes in the sample space. To find the 
probability that an event A happens, we can rely on some basic probability rules: 


© ForanyeventaA, 0 = PA) = 1, 
e P(S) = 1, where S = the sample space 
e If all outcomes in the sample space are equally likely, 


number of outcomes corresponding to event A 


P(A) = 
) total number of outcomes in sample space 

© Complement rule: P(AC) = 1 — P(A), where A© is the complement of 
event A; that is, the event that A does not happen. 

e Addition rule for mutually exclusive events: Events A and B are mutually 
exclusive (disjoint) if they have no outcomes in common. If A and B are 
disjoint, P(A or B) = P(A) + P(B). 

e A two-way table or a Venn diagram can be used to display the sample space for 
a chance process. ‘Two-way tables and Venn diagrams can also be used to find 
probabilities involving events A and B, like the union (A U B) and intersection 
(A.M B). The event A U B (“A or B”) consists of all outcomes in event A, event B, or 
both. The event AN B (“A and B”) consists of outcomes in both A and B. 


e ‘The general addition rule can be used to find P(A or B): 
P(A or B) = P(A U B) = P(A) + P(B) — P(AN B) 


Section 5.2 Exercises 


39. Role-playing games Computer games in which the (a) List the sample space for rolling the die twice (spots 
playing g puter g ple sp g P 
1) 306| players take the roles of characters are very popular. ‘They showing on first and second rolls). 
& go back to earlier tabletop games such as Dungeons & (b) What is the assignment of probabilities to outcomes 
Dragons. These games use many different types of dice. in this sample space? Assume that the die is perfectly 


A four-sided die has faces with 1, 2, 3, and 4 spots. balanced. 


42. 


43. 


44, 


8%. 
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. Tossing coins Imagine tossing a fair coin 3 times. 


What is the sample space for this chance process? 


What is the assignment of probabilities to outcomes 
in this sample space? 


Role-playing games Refer to Exercise 39. Define 
event A: sum is 5. Find P(A). 


Tossing coins Refer to Exercise 40. Define event B: 
get more heads than tails. Find P(B). 


Probability models? In each of the following situ- 
ations, state whether or not the given assignment of 
probabilities to individual outcomes is legitimate, 
that is, satisfies the rules of probability. If not, give 
specific reasons for your answer. 


Roll a 6-sided die and record the count of spots on 
the up-face: P(1) = 0, P(2) = 1/6, P(3) = 1/3, P(4) = 
1/3, P(5) = 1/6, P(6) = 0. 


Choose a college student at random and record gen- 
der and enrollment status: P(female full-time) = 0.56, 
P(male full-time) = 0.44, P(female part-time) = 0.24, 
P(male parttime) = 0.17. 


Deal a card from a shuffled deck: P(clubs) = 12/52, 
P(diamonds) = 12/52, P(hearts) = 12/52, P(spades) = 
16/52. 


Rolling a die The following figure displays several pos- 
sible probability models for rolling a die. Some of the 
models are not legitimate. ‘That is, they do not obey the 
rules. Which are legitimate and which are not? In the 
case of the illegitimate models, explain what is wrong. 


Probability 
Outcome Modell Model2 Model3 Model4 
1/7 1/3 1/3 1 
ea 1/7 1/6 1/6 1 
["-.| 1/7 1/6 1/6 2 
I: 3| 1/7 0 1/6 1 
[Z| 1/7 1/6 1/6 1 
El 1/7 1/6 1/6 2 


Blood types All human blood can be typed as one of 


O, A, B, or AB, but the distribution of the types varies 
a bit with race. Here is the distribution of the blood 
type of a randomly chosen black American: 


Blood type: 0 A B AB 
Probability: 0.49 0.27 


0.20 ? 


46. 


48. 
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What is the probability of type AB blood? Why? 


What is the probability that the person chosen does 
not have type AB blood? 


Maria has type B blood. She can safely receive blood 
transfusions from people with blood types O and B. 
What is the probability that a randomly chosen black 
American can donate blood to Maria? 


Languages in Canada Canada has two official 
languages, English and French. Choose a Canadian at 
random and ask, “What is your mother tongue?” Here 
is the distribution of responses, combining many sepa- 
rate languages from the broad Asia/Pacific region:’ 


Language: 
Probability: 0.63 0.22 0.06 ? 


English French Asian/Pacific Other 


What probability should replace “?” in the distribu- 
tion? Why? 


What is the probability that a Canadian’s mother 
tongue is not English? 


What is the probability that a Canadian’s mother 
tongue is a language other than English or French? 


. Education among young adults Choose a young 


adult (aged 25 to 29) at random. The probability 

is 0.13 that the person chosen did not complete 
high school, 0.29 that the person has a high school 
diploma but no further education, and 0.30 that the 
person has at least a bachelor’s degree. 


What must be the probability that a randomly cho- 
sen young adult has some education beyond high 
school but does not have a bachelor’s degree? Why? 


What is the probability that a randomly chosen young 
adult has at least a high school education? Which rule 
of probability did you use to find the answer? 


Preparing for the GMAT A company that offers courses 
to prepare students for the Graduate Management 
Admission ‘Test (GMAT) has the following information 
about its customers: 20% are currently undergraduate 
students in business; 15% are undergraduate students in 
other fields of study; 60% are college graduates who are 
currently employed; and 5% are college graduates who 
are not employed. Choose a customer at random. 


What's the probability that the customer is currently 
an undergraduate? Which rule of probability did you 
use to find the answer? 

What’s the probability that the customer is not an 
undergraduate business student? Which rule of prob- 
ability did you use to find the answer? 


. Who eats breakfast? Students in an urban school 


were curious about how many children regularly eat 
breakfast. They conducted a survey, asking, “Do you 
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eat breakfast on a regular basis?” All 595 students in 
the school responded to the survey. The resulting 
data are shown in the two-way table below.* 


Male Female Total 
Eats breakfast regularly 190 110 300 
Doesn't eat breakfast regularly 130 165 295 


Total 


320 275 595 


If we select a student from the school at random, 
what is the probability that the student is 


a female? 

someone who eats breakfast regularly? 
a female and eats breakfast regularly? 
a female or eats breakfast regularly? 


Sampling senators ‘The two-way table below describes 
the members of the U.S Senate in a recent year. 


Male Female 
Democrats 47 18 
Republicans 36 4 


If we select a U.S. senator at random, what’s the 
probability that the senator is 


a Democrat? 

a female? 

a female and a Democrat? 
a female or a Democrat? 


Roulette An American roulette wheel has 38 slots 
with numbers | through 36, 0, and 00, as shown in the 
figure. Of the numbered slots, 18 are red, 18 are black, 
and 2—the 0 and 00—are green. When the wheel is 
spun, a metal ball is dropped onto the middle of the 


wheel. If the wheel is balanced, the ball is equally likely 


to settle in any of the numbered slots. Imagine spinning 
a fair wheel once. Define events B: ball lands in a black 
slot, and E: ball lands in an even-numbered slot. (‘Treat 
0 and 00 as even numbers.) 


56. 
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Make a two-way table that displays the sample space 
in terms of events B and E. 

Find P(B) and P(E). 

Describe the event “B and E” in words. ‘Then find 
P(B and E). 

Explain why P(B or FE) # P(B) + P(E). Then use the 
general addition rule to compute P(B or E). 


Playing cards Shuffle a standard deck of playing 
cards and deal one card. Define events J: getting a 
jack, and R: getting a red card. 


Construct a two-way table that describes the sample 
space in terms of events J and R. 


Find P(J) and P(R). 


Describe the event “J and R” in words. ‘Then find 
P(J and R). 


Explain why P(J or R) # P(J) + P(R). Then use the 
general addition rule to compute P(J or R). 


. Who eats breakfast? Refer to Exercise 49. 


Construct a Venn diagram that models the chance 
process using events B: eats breakfast regularly, and 
M: is male. 


Find P(B U M). Interpret this value in context. 
Find P(BS N M°). Interpret this value in context. 
Sampling senators Refer to Exercise 50. 


Construct a Venn diagram that models the chance 
process using events R: is a Republican, and 
F: is female. 


Find P(R U F). Interpret this value in context. 
Find P(R© N F°). Interpret this value in context. 


Facebook versus YouTube A recent survey suggests 
that 85% of college students have posted a profile on 
Facebook, 73% use YouTube regularly, and 66% do 

both. Suppose we select a college student at random. 


Make a two-way table for this chance process. 
Construct a Venn diagram to represent this setting. 


Consider the event that the randomly selected col- 
lege student has posted a profile on Facebook or uses 
YouTube regularly. Write this event in symbolic form 
based on your Venn diagram in part (b). 


Find the probability of the event described in part (c). 
Explain your method. 


Mac or PC? A recent census at a major university re- 
vealed that 40% of its students mainly used Macintosh 
computers (Macs). The rest mainly used PCs. At the 
time of the census, 67% of the school’s students were 
undergraduates. The rest were graduate students. 


In the census, 23% of respondents were graduate 
students who said that they used PCs as their main 
computers. Suppose we select a student at random 
from among those who were part of the census. 


(a) Make a two-way table for this chance process. 
(b) Construct a Venn diagram to represent this setting. 


(c) Consider the event that the randomly selected stu- 
dent is a graduate student and uses a Mac. Write this 
event in symbolic form based on your Venn diagram 
in part (b). 


(d) Find the probability of the event described in part (c). 
Explain your method. 


Multiple choice: Select the best answer for Exercises 57 to 60. 


57. In government data, a household consists of all occu- 
pants of a dwelling unit. Choose an American house- 
hold at random and count the number of people it 
contains. Here is the assignment of probabilities for 
the outcome: 


Number of persons: 1 2 5 4 5 6 7+ 
Probability: 0.25 0.32 72? 272 0.07 0.03 0.01 


The probability of finding 3 people in a household is 
the same as the probability of finding 4 people. These 
probabilities are marked ??? in the table of the distri- 
bution. The probability that a household contains 3 
people must be 


(a) 0.68.  (b) 0.32. (d) 0.08. 


(e) between 0 and 1, and we can say no more. 


(c) 0.16. 


58. Ina sample of 275 students, 20 say they are vegetar- 
ians. Of the vegetarians, 9 eat both fish and eggs, 
3 eat eggs but not fish, and 7 eat neither. Choose one 
of the vegetarians at random. What is the probability 
that the chosen student eats fish or eggs? 


(a) 9/20 (c) 22/20 (e) 22/275 
(b) 13/20 (d) 9/275 


Exercises 59 and 60 refer to the following setting. The 
casino game craps is based on rolling two dice. Here is the 
assignment of probabilities to the sum of the numbers on 
the up-faces when two dice are rolled: 


Outcomes 2 3 4 5 6 7 8 9 10 11 12 
Probability: 1/36 2/36 3/36 4/36 5/36 6/36 5/36 4/36 3/36 2/36 1/36 


59. The most common bet in craps is the “pass line.” A 
pass line bettor wins immediately if either a 7 or an 
11 comes up on the first roll. This is called a natural. 
What is the probability of a natural? 


2/36 (c) 8/36 — (e) 20/36 
6/36 (d) 12/36 


ge & 
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Ifa player rolls a 2, 3, or 12, it is called craps. What 
is the probability of getting craps or an even sum on 
one roll of the dice? 


4/36 (c) 20/36 
18/36 (d) 22/36 


Crawl before you walk (3.2) At what age do babies 
learn to crawl? Does it take longer to learn in the 
winter, when babies are often bundled in clothes that 
restrict their movement? Perhaps there might even 
be an association between babies’ crawling age and 
the average temperature during the month they first 
try to crawl (around six months after birth). Data 
were collected from parents who brought their ba- 
bies to the University of Denver Infant Study Center 
to participate in one of a number of studies. Parents 
reported the birth month and the age at which their 
child was first able to creep or crawl a distance of 4 
feet within one minute. Information was obtained on 
414 infants (208 boys and 206 girls). Crawling age is 
given in weeks, and average temperature (in °F) is 
given for the month that is six months after the birth 
month.” 


(e) 32/36 


Average Average 
Birth month crawling age temperature 
January 29.84 66 
February Sie? 73 
March 29.70 72 
April 31.84 63 
May 28.58 52 
June 31.44 39 
July 33.64 33 
August 32.82 30 
September 33.83 33 
October SR) 37 
November 33.38 48 
December BO OnO2. Oil 


Analyze the relationship between average crawling 
age and average temperature. What do you conclude 
about when babies learn to crawl? 


. Treating low bone density (4.2) Fractures of the 


spine are common and serious among women with 
advanced osteoporosis (low mineral density in the 
bones). Can taking strontium ranelate help? A large 
medical trial assigned 1649 women to take either 
strontium ranelate or a placebo each day. All of 

the subjects had osteoporosis and had had at least 
one fracture. All were taking calcium supplements 
and receiving standard medical care. The response 
variables were measurements of bone density and 
counts of new fractures over three years. ‘The subjects 
were treated at 10 medical centers in 10 different 
countries.!° Outline an appropriate design for this 
experiment. Explain why this is the proper design. 
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- 53 Conditional Probability 


and Independence 


WHAT YOU WILL LEARN __ By the end of the section, you will be able to: 


Calculate and interpret conditional probabilities. e Determine if two events are independent. 
Use the general multiplication rule to calculate e When appropriate, use the multiplication rule for 


probabilities. independent events to compute probabilities. 


Use tree diagrams to model a chance process and 
calculate probabilities involving two or more events. 


The probability we assign to an event can change if we know that some other 
event has occurred. This idea is the key to many applications of probability. 


What Is Conditional Probability? 


Let’s return to the setting of the pierced-ears example (page 309). Earlier, we used the 
two-way table below to find probabilities involving events A: is male and B: has pierced 
ears for a randomly selected student. Here is a summary of our previous results: 

P(A) = P(male) = 90/178 P(A M B) = P(male and pierced ears) = 19/178 
P(B) = P(pierced ears) = 103/178 P(A U B) = P(male or pierced ears) = 174/178 


Gender 
Pierced Ears Male Female Total 
Yes 19 84 103 
No 71 4 75 
Total 90 88 178 


Now let’s turn our attention to some other interesting probability questions. 


Who Has Pierced Ears? 
The idea of conditional probability 


1. If we know that a randomly selected student has pierced ears, what is the 
probability that the student is male? There are a total of 103 students in the class 
with pierced ears. We can restrict our attention to this group, since we are told 
that the chosen student has pierced ears. Because there are 19 males among the 
103 students with pierced ears, the desired probability is 


P(is male given has pierced ears) = 19/103, or about 18.4% 


2. If we know that a randomly selected student is male, what’s the probability 
that the student has pierced ears? This time, our attention is focused on the 
males in the class. Because 19 of the 90 males in the class have pierced ears, 


Phas pierced ears given is male) = 19/90, or about 21.1% 


These two questions sound alike, but they’re asking about two very different things. 
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A probability like “the probability that a randomly selected student is male given 
that the student has pierced ears” is known as a conditional probability. The name 
comes from the fact that we are trying to find the probability that one event will hap- 
pen under the condition that some other event is already known to have occurred. 
We often use the phrase “given that” to signal the condition. There’s even a special 
notation to indicate a conditional probability. In the example above, we would write 
P(is male | has pierced ears), where the | means “given that” or “under the condition 
that.” Because we already defined the events A: is male and B: has pierced ears, we 


could write the conditional probability as P(A | B) = 19/103. 


Let’s look more closely at how conditional probabilities are calculated. From 
the two-way table below, we see that 


Number of students who are male and have pierced ears 19 


P(male | pierced = 
irnale | prested eats) Number of students with pierced ears 103 


Gender 


Pierced Ears? Male Female Total 


Yes 19 84 103 = P(male | pierced ears) = 19/103 
No 71 4 75 
Total 90 88 178 


What if we focus on probabilities instead of numbers of students? Notice that 


19 
P(male and pierced ears) 178 19. Peeiele d 
P(pierced ears) 7 103 ~ es sd eater 

178 


P(AMB) 
P(B) 


computing a conditional probability. 


In symbols, P(A | B) = . This observation leads to a general formula for 


CALCULATING CONDITIONAL PROBABILITIES 
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Here’s an example that illustrates the use of this formula in a familiar setting. 


e 


Who Reads the Paper? 


Conditional probability formula 


On page 313, we classified the residents of a large apartment complex based on 
the events A: reads USA Today, and B: reads the New York Times. The com- 
pleted Venn diagram is reproduced here. 


Reads both 
papers 


B: reads 


A: reads 
New York Times 


USA Today 


AP® EXAM TIP You can PROBLEM: What's the probability that a randomly selected resident who reads USA Today also 


Imes? 
write Statamonts like reads the New York Timesé 


P(BI A) if events A and Bare SOLUTION: Because we're given that the randomly chosen resident reads USA Today, we want to 
defined clearly, or you can find P(reads New York Times | reads USA Today), or P(B| A). By the conditional probability formula, 


use a verbal equivalent, such P(BM A) 
as P(reads New York Times P(B| A) = AA) 
| reads USA Today). Use the = = 
approach that makes the Because P(BM A) = 0.05 and P(A) = 0.40, we have 
0.05 
most sense to you. P(B| A) = = 0.125 
0.40 
There’s a 12.5% chance that a randomly selected resident who reads USA Today also reads the 
New York Times. 
For Practice Try Exercise 
THINK Is there a connection between conditional probability and 


the conditional distributions of Chapter 1? Of course! For the col- 
ABOUT IT lege statistics class that we discussed earlier, Figure 5.8 shows the conditional 
distribution of ear-piercing status for each gender. Above, we found that P(pierced 
ears | male) = 19/90, or about 21%. Note that P(pierced ears | female) = 84/88, 
or about 95%. You can see these values displayed in the “Yes” bars of Figure 5.8. 


[Pierced ears? Yes [J Pierced ears? No 


100 5 


x 4- 
20 4 
FIGURE 5.8 Conditional distribution o- 
of ear-piercing status for each gen- Male Female 
der in a large college statistics class. Gender 


O83 
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/ CHECK YOUR UNDERSTANDING 


Students at the University of New Harmony received 10,000 course grades last semester. 
The two-way table below breaks down these grades by which school of the university 
taught the course. The schools are Liberal Arts, Engineering and Physical Sciences, and 
Health and Human Services. 


Grade Level 
School A B Below B 
Liberal Arts 2,142 1,890 2,268 
Engineering and Physical Sciences 368 432 800 
Health and Human Services 882 630 588 


(This table is based closely on grade distributions at an actual university, simplified a bit 
for clarity.)'! 

College grades tend to be lower in engineering and the physical sciences (EPS) than 
in liberal arts and social sciences (which includes Health and Human Services). Choose a 
University of New Harmony course grade at random. Consider the two events E: the grade 
comes from an EPS course, and L: the grade is lower than a B. 


1. Find P(L). Interpret this probability in context. 


2. Find P(E | L) and P(L | E). Which of these conditional probabilities tells you whether 
this college’s EPS students tend to earn lower grades than students in liberal arts and 
social sciences? Explain. 


The General Multiplication Rule 
and Tree Diagrams 


Suppose that A and B are two events resulting from the same chance process. We 
can find the probability P(A or B) with the general addition rule: 


P(A or B) = P(A UB) = P(A) + P(B) — P(AMB) 


How do we find the probability that both events happen, P(A and B)? 
Start with the conditional probability formula 


P(BMA) 
P(A) 


The numerator, P(B MA), is the probability we want because P(B and A) is the 
same as P(A and B). Multiply both sides of the above equation by P(A) to get 


P(A): P(B| A) = P(BNA) = P(ANB) = P(A and B) 


This formula is known as the general multiplication rule. 


P(B| A) = 


GENERAL MULTIPLICATION RULE 
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In words, this rule says that for both of two events to occur, first one must occur, 
and then given that the first event has occurred, the second must occur. This 
is just common sense expressed in the language of probability, as the following 
example illustrates. 


Teens with Online Profiles 


Using the general multiplication rule 


The Pew Internet and American Life Project find that 93% of teenagers (ages 
12 to 17) use the Internet, and that 55% of online teens have posted a profile on a 
social-networking site.'” 


PROBLEM: Find the probability that a randomly selected teen uses the Internet and has posted 
a profile. Show your work. 

SOLUTION: Weknow that Plonline) = 0.93 and P(profile | online) = 0.55. Use the general 
multiplication rule: 


P(online and have profile) = Plonline) - P(profile | online) 
= (0.93)(0.55) = 0.5115 


There is about a 51% chance that a randomly selected teen uses the Internet and has posted a profile 
ona Social-networking site. 


For Practice Try Exercise 


The general multiplication rule is especially useful when a chance process 
involves a sequence of outcomes. In such cases, we can use a tree diagram to 
display the sample space. 


Serve It Up! 


Tree diagrams and the general multiplication rule 


Tennis great Roger Federer made 63% of his first serves in the 2011 season. When 
Federer made his first serve, he won 78% of the points. When Federer missed his 
first serve and had to serve again, he won only 57% of the points.'* Suppose we 
randomly choose a point on which Federer served. 


Figure 5.9 on the facing page shows a tree diagram for this chance process. There 
are only two possible outcomes on Federer’s first serve, a make or a miss. The 
first set of branches in the tree diagram displays these outcomes with their prob- 
abilities. The second set of branches shows the two possible results of the point for 
Federer—win or lose—and the chance of each result based on the outcome of the 
first serve. Note that the probabilities on the second set of branches are conditional 
probabilities, like P(win point | make first serve) = 0.78. 


FIGURE 5.9 A tree diagram 
displaying the sample space of 
randomly choosing a point on 
which Roger Federer served. 
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P(makes first-serve 
0.78 Wins point and wins point) 
~~ = (0.63) X (0.78) = 0.4914 
Makes 
—_ 
0.63 0.22 Doesn’t 
win point 


P(makes first-serve 
and doesn’t win point) 
= (0.63) X (0.22) = 0.1386 


———— P(misses first-serve 
Wins point and wins point) 


0.37 0.57 = (0.37) x (0.57) = 0.2109 


Misses 
first-serve : ; 
P(misses first-serve 


and doesn’t win point) 
= (0.37) X (0.43) = 0.1591 


0.43 Doesn’t 
win point 


What’s the probability that Federer makes the first serve and wins the point? From 
the tree diagram, Federer makes 63% of his first serves. Of this 63%, he wins 


the point 78% of the time. Because 78% of 63% = (0.63)(0.78) = 0.4914, Federer 


makes his first serve and wins the point about 49.14% of the time. 


The previous calculation amounts to multiplying probabilities along the branches 
of the tree diagram. Why does this work? The general multiplication rule provides 
the answer: 


P(make first serve and win point) = P(make first serve) - P(win point | make first serve) 


= (0.63)(0.78) = 0.4914 


When Federer is serving, what’s the probability that he wins the point? From the 
tree diagram, there are two ways Federer can win the point. He can make the first 
serve and win the point, or he can miss the first serve and win the point. Because 
these outcomes are mutually exclusive, 


P(win point) = P(makes first serve and wins point) + P(misses first serve and wins point) 


Some people use a result known as 
Bayes’s theorem to solve probability 
questions that require “going 
backward” in a tree diagram, like the 
one in this example. To be honest, 
Bayes’s theorem is just a complicated 
formula for computing conditional 
probabilities. For that reason, we won’t 
introduce it. 


= 0.4914 + 0.2109 = 0.7023 


Federer wins about 70% of the points when he is serving. 


Some interesting conditional probability questions involve “going in reverse” 
on a tree diagram. Here’s one related to the previous example. Suppose you are 
watching a recording of one of Federer’s matches from 2011 and he is serving in 
the current game. You get distracted before seeing his first serve but look up in 
time to see Federer win the point. How likely is it that he missed his first serve? 

To find this probability, we start with the result of the point, which is displayed 
on the second set of branches in Figure 5.9, and ask about the outcome of the 
serve, which is shown on the first set of branches. We can use the information from 
the tree diagram and the conditional probability to do the required calculation: 


P(nissed first serve and wins point) 


P(missed first serve | wins point) = , 
P(wins point) 


= 0.2109 _ 9.2109 
0.4914 + 0.2109 0.7023 


Given that Federer won the point, there is about a 30% chance that he missed his 
first serve. 


= 0.3003 
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Here is another example where we need to reverse the conditioning. 


Who Visits YouTube? 


Tree diagrams and conditional probability 


Video-sharing sites, led by YouTube, are popular destinations on the Internet. Let’s 
look only at adult Internet users, aged 18 and over. About 27% of adult Internet us- 
ers are 18 to 29 years old, another 45% are 30 to 49 years old, and the remaining 
28% are 50 and over. The Pew Internet and American Life Project finds 
that 70% of Internet users aged 18 to 29 have visited a video-sharing site, 
along with 51% of those aged 30 to 49 and 26% of those 50 or older. Do 
most Internet users visit YouTube and similar sites? 


PROBLEM: Suppose we select an adult Internet user at random. 
(a) Draw a tree diagram to represent this situation. 
(b) Find the probability that this person has visited a video-sharing site. Show your work. 


(c) Given that this person has visited a video-sharing site, find the probability that he or she is aged 
18 to 29. Show your work. 


SOLUTION: 


Visits video-sharing 
oe sites (a) The tree diagram in Figure 5.10 organizes the given information. 


(b) There are three disjoint paths to “visits video-sharing sites,’ one for each 


18 tO 29 Ses 
Doesn't Visit eee 
of the three age groups. These paths are colored red in Figure 5.10. Because the 
02 
0.45 


7 three paths are disjoint, the probability that an adult Internet user visits video- 
visits video-sharing| | sharing sites is the sum of their probabilities: 


0.51 | sites 
—= |30 to 49 a. visits video-sharing sites) = (0.27)(0.70) + (0.45)(0.51) + (0.28)(0.26) 


= 0.1890 + 0.2295 + 0.0728 = 0.4913 
video-sharing Sites 


eal About 49% of all adult Internet users have visited a video-sharing site. 
vee vied 
eas —_ aus (c) Use the tree diagram and the definition of conditional probability: 
= ig : P(18 to 29 and visits video-sharing site) 
Doesn't Visit P(18 to 29 | visits video-sharing site) = a —— 
video-sharing sites P(visits video-sharing site) 
0.1890 
= = 0.3847 
FIGURE 5.10 Tree diagram for use of the Internet 0.4913 


and video-sharing sites such as YouTube. The three — Given that the person visits video-sharing sites, there is about a 38.5% chance 
disjoint paths to the outcome that an adult Internet — that he or she is aged 18 to 29. 
user visits video-sharing sites are colored red. 


For Practice Try Exercise 


One of the most important applications of tree diagrams and conditional prob- 
ability is in the area of drug and disease testing. 
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Mammograms 
Conditional probability in real life 


Many women choose to have annual mammograms to screen for breast cancer after 
age 40. A mammogram isn’t foolproof. Sometimes, the test suggests that a woman has 
breast cancer when she really doesn’t (a “false positive”). Other times, the test says that 
a woman doesn’t have breast cancer when she actually does (a “false negative”). 


Suppose that we know the following information about breast cancer and mam- 
mograms in a particular region: 
¢ One percent of the women aged 40 or over in this region have breast cancer. 
e¢ For women who have breast cancer, the probability of a negative mammo- 
gram is 0.03. 
For women who don’t have breast cancer, the probability of a positive mam- 
mogram is 0.06. 


PROBLEM: Arandomly selected woman aged 40 or over from this region 


Positive 
ca Ee wee mammogram| | tests positive for breast cancer in a mammogram. Find the probability that 


she actually has breast cancer. Show your work. 


Negati 
a Se SOLUTION: The tree diagram in Figure 5.11 summarizes the situation. 


Because 1% of women in this region have breast cancer, 99% don’t. Of those 


06 Positive women who do have breast cancer, 3% would test negative ona mammogram. 
ee oll manne grain The remaining 97% would (correctly) test positive. Among the women who 


Beeasr catia | Negative don't have breast cancer, 6% would test positive ona mammogram. The 
0.94 9 
mammogram 


remaining 94% would (correctly) test negative. 


FIGURE 5.11 Tree diagram 

showing whether or not a 

woman has breast cancer and 

the likelihood of her receiving a 
positive or anegative test result , 
from a mammogram. 


We want to find P(breast cancer | positive mammogram). By the conditional probability formula, 
P(breast cancer and positive mammogram) 


P(breast cancer | positive mammogram) = Apositi 
positive mammogram 


To find P(breast cancer and positive mammogram), we use the general multiplication rule along 
with the information displayed in the tree diagram: 

Abreast cancer and positive mammogram) = P(breast cancer) - P(positive mammogram | breast cancer) 

= (0.01)(0.97) = 0.0097 
To find P(positive mammogram), we need to calculate the probability that a randomly selected wom- 
an aged 40 or over from this region gets a positive mammogram. There are two ways this can happen: 
(1) ifthe woman has breast cancer and the test result is positive, and (2) ifthe woman doesn't have 
cancer, but the mammogram gives a false positive. From the tree diagram, the desired probability is 
P(positive mammogram) = (0.01)(0.97) + (0.99)(0.06) = 0.0691 


Using these two results, we can find the conditional probability: 


P(breast cancer and positive mammogram) 0.0097 _ aan 


P(breast cancer | positive mammogram) = Ppositi ) 
positive mammogram 


Given that a randomly selected woman from the region has a positive mammogram, there is only 
about a 14% chance that she actually has breast cancer! 


For Practice Try Exercise 
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Are you surprised by the final result of the example? Most people are. Sometimes 
a two-way table that includes counts is more convincing. ‘To make calculations 
simple, we'll suppose that there were exactly 10,000 women aged 40 or over in this 
region, and that exactly 100 have breast cancer (that’s 1% of the women). 

How many of those 100 would have a positive mammogram? It would be 97% 
of 100, or 97 of them. That leaves 3 who would test negative. How many of the 
9900 women who don’t have breast cancer would get a positive mammogram? Six 
percent of them, or (0.06)(9900) = 594 women. The remaining 9900 — 594 = 9306 
would test negative. In total, 97 + 594 = 691 women would have positive mam- 
mograms and 3 + 9306 = 9309 women would have negative mammograms. ‘This 
information is summarized in the two-way table below: 


Has breast cancer? 


Yes No Total 
Mammogram Positive 97 594 691 
result Negative 3 9306 9309 
Total 100 9900 10,000 


Given that a randomly selected woman has a positive mammogram, the 
two-way table shows that the conditional probability P(breast cancer | positive 
mammogram) = 97/691 = 0.14. 

This example illustrates an important fact when considering proposals for wide- 
spread testing for serious diseases or illegal drug use: if the condition being tested 
is uncommon in the population, many positives will be false positives. The best 
remedy is to retest any individual who tests positive. 


CHECK YOUR UNDERSTANDING 


A computer company makes desktop and laptop computers at factories in three states— 
California, ‘Texas, and New York. The California factory produces 40% of the company’s 
computers, the Texas factory makes 25%, and the remaining 35% are manufactured in 
New York. Of the computers made in California, 75% are laptops. Of those made in ‘Texas 
and New York, 70% and 50%, respectively, are laptops. All computers are first shipped to 
a distribution center in Missouri before being sent out to stores. Suppose we select a com- 
puter at random from the distribution center. '* 


1. Construct a tree diagram to represent this situation. 
2. Find the probability that the computer is a laptop. Show your work. 
3. Given that a laptop is selected, what is the probability that it was made in California? 


Conditional Probability 
and Independence 


Suppose you toss a fair coin twice. Define events A: first toss is a head, and B: sec- 
ond toss is a head. We know that P(A) = 1/2 and P(B) = 1/2. What’s P(B | A)? It’s 
the conditional probability that the second toss is a head given that the first toss was 
a head. The coin has no memory, so P(B | A) = 1/2. In this case, P(B | A) = P(B). 
Knowing that the first toss was a head does not change the probability that the sec- 
ond toss is a head. 
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Let’s contrast the coin-toss scenario with our earlier pierced-ears example. In 
that case, the chance process involved randomly selecting a student from a col- 
lege statistics class. The events of interest were A: is male, and B: has pierced ears. 
We already found that P(A) = 90/178, P(B) = 103/178, and P(B | A) = 19/90. 


If we know that the chosen student is male, the probability 


sane that he has pierced ears is 19/90 = 0.211. This conditional 
Pierced Ears Male Female Total probability is very different from the unconditional probabil- 
Yes 19 84 103 ity P(B) = 103/178 = 0.579 that a randomly selected student 
No 71 4 75 from the class has pierced ears. 
Total 90 88 178 To recap, P(B | A) = P(B) for events A and B in the coin- 


toss setting. For the pierced-ears scenario, however, P(B | A) # 
P(B). When knowledge that one event has happened does not change the likeli- 
hood that another event will happen, we say that the two events are independent. 


DEFINITION: Independent events 


Two events A and B are independent if the occurrence of one event does not change 
the probability that the other event will happen. In other words, events A and Bare 
independent if P(A| B) = P(A) and AjB| A) = PAB). 


Determining whether two events related to the same chance process are inde- 
pendent requires us to compute probabilities. Here’s an example that shows what 
we mean. 


Lefties Down Under 
Checking for independence 


Is there a relationship between gender and handedness? ‘To find out, we used 
CensusAtSchool’s Random Data Selector to choose an SRS of 100 Australian high 
school students who completed a survey. The two-way table displays data on the 
gender and dominant hand of each student. 


Gender 
Dominant Hand Male Female Total 
Right 39 51 90 
Left 7 3 10 
Total 46 54 100 


PROBLEM: Are the events “male” and “left-handed” independent? Justify your answer. 


SOLUTION: To check whether the two events are independent, we want to find out if knowing that 
one event has happened changes the probability that the other event occurs. Suppose we are told 
that the chosen student is male. From the two-way table, P(left-handed | male) = 7/46 = 0.152. 
The unconditional probability P(left-handed) = 10/100 = 0.10. These two probabilities are close, 
but they're not equal. So the events “male” and “left-handed” are not independent. Knowing that the 
student is male increases the probability that the student is left-handed. 


For Practice Try Exercise 
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You might have thought, “Surely In the preceding example, we could have also compared P(male | left-handed) 
Ss ihe ce aa with P(male). Of the 10 left-handed students in the sample, 7 were male. So 
andiletichandad®aea hound tba P(male | left-handed) = 7/10 = 0.70. We can see from the two-way table that 
independent.” As the example shows, | P(male) = 46/100 = 0.46. Once again, the two probabilities are not equal. 


you can’t use your intuition to check Knowing that a person is left-handed makes it more likely that the person is 
whether events are independent. To male. 
be sure, you have to calculate some 
probabilities. 
THINK Is there a connection between independence of events and 


association between two variables? In the previous example, we 
ABOUT IT found that the events “male” and “left-handed” were not independent. Does that 
mean there actually is a relationship between the variables gender and handed- 
ness in the larger population? Maybe or maybe not. If there is no association 
between the variables, it would be surprising to choose a random sample of 100 
students for which P(left-handed | male) was exactly equal to P(left-handed). But 
these two probabilities should be close to equal if there’s no association between 
the variables. How close is close? You'll have to wait a few chapters to find out. 


or —_$ 


CHECK YOUR UNDERSTANDING 


For each chance process below, determine whether the events are independent. Justify 
your answer. 

1. Shuffle a standard deck of cards, and turn over the top card. Put it back in the deck, 
shuffle again, and turn over the top card. Define events A: first card is a heart, and B: 
second card is a heart. 


2. Shuffle a standard deck of cards, and turn over the top two cards, one at a time. 
Gender Define events A: first card is a heart, and B: second card is a heart. 


Handedness Female Male 3. The 28 students in Mr. Tabor’s AP® Statistics class completed a brief survey. One of 
Left 3 1 the questions asked whether each student was right- or left-handed. The two-way table 
summarizes the class data. Choose a student from the class at random. ‘The events of 
interest are “female” and “right-handed.” 


Right 18 6 


Independence: A Special Multiplication Rule 


What happens to the general multiplication rule in the special case when events A 
and B are independent? In that case, P(B | A) = P(B). We can simplify the general 
multiplication rule as follows: 


P(AM B) = P(A) - (BIA) 
= PUA) < PUB) 
This result is known as the multiplication rule for independent events. 
Cee SS eee 
DEFINITION: Multiplication rule for independent events 
lf A and B are independent events, then the probability that A and B both occur is 
P(AN B) = P(A): P(B) 
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Note that this rule only applies to independent events. Let’s look at an example 
that uses the multiplication rule for independent events to analyze an important 
historical event. 


The Challenger Disaster 


Independence and the multiplication rule 


On January 28, 1986, Space Shuttle Challenger exploded on takeoff. All seven 
crew members were killed. Following the disaster, scientists and statisticians helped 
analyze what went wrong. They determined that the failure of O-ring joints in the 
shuttle’s booster rockets was to blame. Under the cold conditions that day, experts 
estimated that the probability that an individual O-ring joint would function prop- 
erly was 0.977. But there were six of these O-ring joints, and all six had to function 
properly for the shuttle to launch safely. 


PROBLEM: Assuming that 0-ring joints succeed or fail independently, find the probability that 
the shuttle would launch safely under similar conditions. 


SOLUTION: For the shuttle to launch safely, all six O-ring joints need to function properly. The 
chance that this happens is given by 


P( joint 1 OK and joint 2 OK and joint 3 OK and joint 4 OK and joint 5 OK and joint 6 OK) 
By the multiplication rule for independent events, this probability is 
P(joint 1 OK). P(joint 2 OK). P(joint 3 OK) - P(joint 4 OK) - P(joint 5 OK) - P( joint 6 OK) 
= (0.977)(0.977)(0.977)(0.977)(0.977)(0.977) = 0.67 


There’s an 87% chance that the shuttle would launch safely under similar conditions (anda 13% 
chance that it wouldn't). 


Note: As a result of the statistical analysis following the Challenger disaster, NASA 
made important safety changes to the design of the shuttle’s booster rockets. 


For Practice Try Exercise 89] 


The next example uses the fact that “at least one” and “none” are opposites. 


Rapid HIV Testing 


Finding the probability of “at least one” 


Many people who come to clinics to be tested for HIV, the virus that causes 
AIDS, don’t come back to learn the test results. Clinics now use “rapid HIV 
tests” that give a result while the client waits. In a clinic in Malawi, for example, 
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use of rapid tests increased the percent of clients who learned their test results 


from 69% to 99.7%. 


The trade-off for fast results is that rapid tests are less accurate than slower labora- 
tory tests. Applied to people who have no HIV antibodies, one rapid test has prob- 
ability about 0.004 of producing a false positive (that is, of falsely indicating that 
antibodies are present). 


PROBLEM: Ifaclinic tests 200 randomly selected people who are free of HIV antibodies, what 
is the chance that at least one false positive will occur? 


SOLUTION: Itis reasonable to assume that the test results for different individuals are inde- 
pendent. We have 200 independent events, each with probability 0.004. “At least one” combines 
many possible outcomes. It will be easier to use the fact that 


Plat least one positive) = 1 — P(no positives) 


We'll find F{no positives) first. The probability of a negative result for any one personis 1 — 0.004 = 0.996. 
Tofind the probability that all 200 people tested have negative results, use the multiplication rule for indepen- 
dent events: 
P(no positives) = P(all 200 negative) 
= (0.996)(0.996).. . (0.996) 
= 0.9967 = 0.4486 


The probability we want is therefore 


P(at least one positive) = 1 — 0.4486 = 0.5514 


There is more than a 50% chance that at least 1 of the 200 people will test positive for HIV, even 
though no one has the virus. 


For Practice Try Exercise 


The multiplication rule P(A and B) = P(A) - P(B) holds if A and Bare 
independent but not otherwise. The addition rule P(A or B) = P(A) + P(B) rr) 
holds if A and B are mutually exclusive but not otherwise. Resist the tempta- 

tion to use these simple rules when the conditions that justify them are not 

present. 


©2008 by King Features Syndicate, inc. 
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Section 5.3 Conditional Probability and Independence 24331 


Sudden Infant Death 


Condemned by independence 


Assuming independence when it isn’t true can lead to disaster. Several mothers in 
England were convicted of murder simply because two of their children had died 
in their cribs with no visible cause. An “expert witness” for the prosecution said 
that the probability of an unexplained crib death in a nonsmoking middle-class 
family is 1/8500. He then multiplied 1/8500 by 1/8500 to claim that there is only 
a |-in-72-million chance that two children in the same family could have died 
naturally. This is nonsense: it assumes that crib deaths are independent, and data 
suggest that they are not. Some common genetic or environmental cause, not 
murder, probably explains the deaths. 


THINK Is there a connection between mutually exclusive and 
independent? Let's start with a new chance process. Choose a U.S. adult 

ABOUT IT at random. Define event A: the person is male, and event B: the person is preg- 
nant. It’s fairly clear that these two events are mutually exclusive (can’t happen 

together)! What about independence? If you know that event A has occurred, 

does this change the probability that event B happens? Of course! If we know 

the person is male, then the chance that the person is pregnant is 0. Because 

P(B | A) # P(B), the two events are not independent. Two mutually exclusive 

events can never be independent, because if one event happens, the other event is 


guaranteed not to happen. 


CHECK YOUR UNDERSTANDING 


1. During World War II, the British found that the probability that a bomber is lost 
through enemy action on a mission over occupied Europe was 0.05. Assuming that 
missions are independent, find the probability that a bomber returned safely from 20 
missions. 

2. Government data show that 8% of adults are full-time college students and that 30% 


of adults are age 55 or older. Because (0.08)(0.30) = 0.024, can we conclude that about 
2.4% of adults are college students 55 or older? Why or why not? 
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Calculated Risks 


The chapter-opening Case Study on page 287 described drug-testing programs 
for high school athletes. Suppose that 16% of the high school athletes in a 
large school district have taken a banned substance. The drug test used by this 
district has a false positive rate of 5% and a false negative rate of 10%. Use what 
you have learned in this chapter to help answer the following questions about 
the district’s drug-testing program. Show your method clearly. 


What’s the probability that a randomly chosen athlete tests posi- 
tive for banned substances? 

If two athletes are randomly selected, what’s the probability that at 
least one of them tests positive? 

If a randomly chosen athlete tests positive, what’s the probability 
that the student did not take a banned substance? Based on your 
answer, do you think that an athlete who tests positive should be 
suspended from athletic competition for a year? Why or why not? 
Ifa randomly chosen athlete tests negative, what’s the probability 
that the student took a banned substance? Explain why it makes 
sense for the drug-testing process to be designed so that this prob- 
ability is less than the one you found in Question 3. 

The district decides to immediately retest any athlete who tests 
positive. Assume that the results of an athlete’s two tests are inde- 
pendent. Find the probability that a student who gets a positive 
result on both tests actually took a banned substance. Based on 
your answer, do you think that an athlete who tests positive twice 
should be suspended from athletic competition for a year? Why 
or why not? 


Section 5.3 Summary 


e If one event has happened, the chance that another event will happen is a 
conditional probability. The notation P(B | A) represents the probability 
that event B occurs given that event A has occurred. 


e You can calculate conditional probabilities with the conditional probability 
formula 


P(AMB) 


P(A | B) = PB) 


63. 
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e The general multiplication rule states that the probability of events A and B 
occurring together is 


P(A and B) = P(ANMB) = P(A) - P(B| A) 


e¢ When chance behavior involves a sequence of outcomes, a tree diagram can 
be used to describe the sample space. Tree diagrams can also help in finding 
the probability that two or more events occur together. We simply multiply 
along the branches that correspond to the outcomes of interest. 


e¢ When knowing that one event has occurred does not change the probability 
that another event happens, we say that the two events are independent. For 
independent events A and B, P(A | B) = P(A) and P(B | A) = P(B). If two 
events A and B are mutually exclusive (disjoint), they cannot be independent. 


e In the special case of independent events, the multiplication rule becomes 


P(A and B) = P(AN B) = P(A) - P(B) 


Exercises 


Get rich A survey of 4826 randomly selected young 
adults (aged 19 to 25) asked, “What do you think 

are the chances you will have much more than a 
middle-class income at age 30?” The two-way table 
shows the responses.'® Choose a survey respondent at 
random. 


Gender 
Opinion Female Male Total 
Almost no chance 96 98 194 
Some chance but probably not 426 286 712 
A 50-50 chance 696 720 1416 
A good chance 663 758 1421 
Almost certain 486 597 1083 
Total 2367 2459 4826 


Given that the person selected is male, what's the 
probability that he answered “almost certain”? 


If the person selected said “some chance but probably 
not,” what’s the probability that the person is female? 


A Titanic disaster In 1912 the luxury liner ‘Titanic, 
on its first voyage across the Atlantic, struck an 
iceberg and sank. Some passengers got off the ship 
in lifeboats, but many died. The two-way table gives 
information about adult passengers who lived and 
who died, by class of travel. Suppose we choose an 
adult passenger at random. 


Survival Status 


Class of Travel Survived Died 
First class 197 122 
Second class 94 167 
Third class 151 476 


Given that the person selected was in first class, 
what’s the probability that he or she survived? 


If the person selected survived, what’s the probability 
that he or she was a third-class passenger? 


Sampling senators The two-way table describes the 
members of the U.S. Senate in a recent year. Sup- 
pose we select a senator at random. Consider events 
D: is a democrat, and F: is female. 


Male Female 
Democrats 47 13 
Republicans 36 4 


Find P(D | F). Explain what this value means. 
Find P(F | D). Explain what this value means. 


. Who eats breakfast? The following two-way table 


describes the 595 students who responded to a school 
survey about eating breakfast. Suppose we select a 
student at random. Consider events B: eats breakfast 
regularly, and M: is male. 
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Male Female Total 
Eats breakfast regularly 190 110 300 
Doesn't eat breakfast regularly 130 165 295 
Total 320 275 595 

(a) Find P(B | M). Explain what this value means. 

(b) Find P(M | B). Explain what this value means. 

67. Foreign-language study Choose a student in grades 
9 to 12 at random and ask if he or she is studying a 
language other than English. Here is the distribution 
of results: 

Language: Spanish French German Allothers None 

Probability: 0.26 0.09 0.03 0.03 0.59 

(a) What’s the probability that the student is studying a 
language other than English? 

(b) What is the conditional probability that a student 
is studying Spanish given that he or she is studying 
some language other than English? 

68. Income tax returns Here is the distribution of the 
adjusted gross income (in thousands of dollars) reported 
on individual federal income tax returns in a recent year: 

Income: <5) 25-49 50-99 100-499 =500 
Probability: 0.431 0.248 =0.215 0.100 0.006 

(a) What is the probability that a randomly chosen return 
shows an adjusted gross income of $50,000 or more? 

(b) Given that a return shows an income of at least 
$50,000, what is the conditional probability that the 
income is at least $100,000? 

69. ‘Tall people and basketball players Select an adult 
at random. Define events T’: person is over 6 feet 
tall, and B: person is a professional basketball player. 
Rank the following probabilities from smallest to 
largest. Justify your answer. 

PUD) 7 PB) RB) 8 |) 
70. ‘Teachers and college degrees Select an adult at ran- 
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dom. Define events A: person has earned a college 
degree, and T: person’s career is teaching. Rank the 
following probabilities from smallest to largest. Justify 
your answer. 


P(A) P(T) P(A|T)  P(T|A) 


Facebook versus YouTube A recent survey suggests 
that 85% of college students have posted a profile on 
Facebook, 73% use YouTube regularly, and 66% do 
both. Suppose we select a college student at random 
and learn that the student has a profile on Facebook. 
Find the probability that the student uses YouTube 
regularly. Show your work. 
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Mac or PC? A recent census at a major university 
revealed that 40% of its students mainly used Ma- 
cintosh computers (Macs). The rest mainly used 
PCs. At the time of the census, 67% of the school’s 
students were undergraduates. The rest were gradu- 
ate students. In the census, 23% of the respondents 
were graduate students who said that they used PCs 
as their primary computers. Suppose we select a 
student at random from among those who were part 
of the census and learn that the student mainly uses 
a PC. Find the probability that this person is a gradu- 
ate student. Show your work. 


. Free downloads? Illegal music downloading 


has become a big problem: 29% of Internet users 
download music files, and 67% of downloaders say 
they don’t care if the music is copyrighted.'’ What 
percent of Internet users download music and don’t 
care if it’s copyrighted? Write the information given 
in terms of probabilities, and use the general multi- 
plication rule. 


At the gym Suppose that 10% of adults belong to 
health clubs, and 40% of these health club members 
go to the club at least twice a week. What percent of 
all adults go to a health club at least twice a week? 
Write the information given in terms of probabilities, 
and use the general multiplication rule. 


Box of chocolates According to Forrest Gump, “Life 
is like a box of chocolates. You never know what 
you're gonna get.” Suppose a candy maker offers a 
special “Gump box” with 20 chocolate candies that 
look the same. In fact, 14 of the candies have soft 
centers and 6 have hard centers. Choose 2 of the 
candies from a Gump box at random. 


Draw a tree diagram that shows the sample space of 
this chance process. 


Find the probability that one of the chocolates has a 
soft center and the other one doesn’t. 


Inspecting switches A shipment contains 10,000 
switches. Of these, 1000 are bad. An inspector draws 
2 switches at random, one after the other. 


Draw a tree diagram that shows the sample space of 
this chance process. 


Find the probability that both switches are defective. 


Fill er up! In a recent month, 88% of automobile 
drivers filled their vehicles with regular gasoline, 
2% purchased midgrade gas, and 10% bought pre- 
mium eee Of those who bought regular gas, 28% 
paid with a credit card; of customers who bought 
midgrade and premium gas, 34% and 42%, respec- 
tively, paid with a credit card. Suppose we select a 
customer at random. 
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Draw a tree diagram to represent this situation. 


Find the probability that the customer paid with a 
credit card. Show your work. 


Given that the customer paid with a credit card, find 
the probability that she bought premium gas. Show 
your work. 


Urban voters ‘The voters in a large city are 40% 
white, 40% black, and 20% Hispanic. (Hispanics may 
be of any race in official statistics, but here we are 
speaking of political blocks.) A mayoral candidate an- 
ticipates attracting 30% of the white vote, 90% of the 
black vote, and 50% of the Hispanic vote. Suppose 
we select a voter at random. 


Draw a tree diagram to represent this situation. 


Find the probability that this voter votes for the may- 
oral candidate. Show your work. 


Given that the chosen voter plans to vote for the 
candidate, find the probability that the voter is black. 
Show your work. 


Lactose intolerance Lactose intolerance causes 
difficulty in digesting dairy products that contain lac- 
tose (milk sugar). It is particularly common among 
people of African and Asian ancestry. In the United 
States (ignoring other groups and people who con- 
sider themselves to belong to more than one race), 
82% of the population is white, 14% is black, and 4% 
is Asian. Moreover, 15% of whites, 70% of blacks, 
and 90% of Asians are lactose intolerant.!? Suppose 
we select a U.S. person at random. 


What is the probability that the person is lactose 
intolerant? Show your work. 


Given that the person is lactose intolerant, find the 
probability that he or she is Asian. Show your work. 


Fundraising by telephone ‘Tree diagrams can or- 
ganize problems having more than two stages. The 
figure at top right shows probabilities eu a charity 
calling potential donors by telephone.”? Each 
person called is either a recent donor, a past donor, 
or a new prospect. At the next stage, the person 
called either does or does not pledge to contribute, 
with conditional probabilities that depend on the 
donor class to which the person belongs. Finally, 
those who make a pledge either do or don’t actually 
make a contribution. Suppose we randomly select a 
person who is called by the charity. 


What is the probability that the person contributed to 
the charity? Show your work. 


Given that the person contributed, find the probabil- 
ity that he or she is a recent donor. Show your work. 
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Contribute 
Pledge 
Recent 
donor 


No pledge 


0.5 Contribute 
Pledge 
0.3 Past 
donor 
0.7 No pledge 
0.2 : 
Contribute 
Pledge = 
<a Not 
ae 
No pledge 


HIV testing Enzyme immunoassay (EIA) tests 
are used to screen blood specimens for the pres- 
ence of antibodies to HIV, the virus that causes 
AIDS. Antibodies indicate the presence of the 
virus. The test is quite accurate but is not always 
correct. Here are approximate probabilities of 
positive and negative EIA outcomes when the 
blood tested does and does not actually contain 
antibodies to HIV:7! 


Test Result 
Truth aE = 
Antibodies present 0.9985 0.0015 
Antibodies absent 0.006 0.994 


Suppose that 1% of a large population carries anti- 
bodies to HIV in their blood. We choose a person 
from this population at random. Given that the EIA 
test is positive, find the probability that the person 
has the antibody. Show your work. 


Testing the test Are false positives too common 

in some medical tests? Researchers conducted an 
experiment involving 250 patients with a medical 
condition and 750 other patients who did not have 
the medical condition. ‘The medical technicians who 
were reading the test results were unaware that they 
were subjects in an experiment. 


Technicians correctly identified 240 of the 

250 patients with the condition. They also identi- 
fied 50 of the healthy patients as having the condi- 
tion. What were the false positive and false negative 
rates for the test? 


Given that a patient got a positive test result, what 
is the probability that the patient actually had the 
medical condition? Show your work. 
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Get rich Refer to Exercise 63. 
Find P(“a good chance” | female). 
Find P(“a good chance’). 


Use your answers to (a) and (b) to determine whether 
the events “a good chance” and “female” are inde- 
pendent. Explain your reasoning. 


A Titanic disaster Refer to Exercise 64. 
Find P(survived | second class). 
Find P(survived). 


Use your answers to (a) and (b) to determine whether 
the events “survived” and “second class” are indepen- 
dent. Explain your reasoning. 


Sampling senators Refer to Exercise 65. Are events 
D and F independent? Justify your answer. 


Who eats breakfast? Refer to Exercise 66. Are 
events B and M independent? Justify your answer. 


Rolling dice Suppose you roll two fair, six-sided 
dice—one red and one green. Are the events “sum 
is 7” and “green die shows a 4” independent? Justify 
your answer. 


Rolling dice Suppose you roll two fair, six-sided 
dice—one red and one green. Are the events “sum 
is 8” and “green die shows a 4” independent? Justify 
your answer. 


Bright lights? A string of Christmas lights contains 
20 lights. ‘The lights are wired in series, so that if any 
light fails, the whole string will go dark. Each light 
has probability 0.02 of failing during a 3-year period. 
The lights fail independently of each other. Find the 
probability that the string of lights will remain bright 
for 3 years. 


Common names ‘lhe Census Bureau says that the 
10 most common names in the United States are 

(in order) Smith, Johnson, Williams, Brown, Jones, 
Miller, Davis, Garcia, Rodriguez, and Wilson. These 
names account for 9.6% of all U.S. residents. Out 

of curiosity, you look at the authors of the textbooks 
for your current courses. There are 9 authors in all. 
Would you be surprised if none of the names of these 
authors were among the 10 most common? (Assume 
that authors’ names are independent and follow the 
same probability distribution as the names of all 
residents.) 


Universal blood donors People with type O-nega- 
tive blood are universal donors. ‘That is, any patient 
can receive a transfusion of O-negative blood. Only 
7.2% of the American population have O-negative 
blood. If we choose 10 Americans at random who 
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gave blood, what is the probability that at least 1 of 
them is a universal donor? 


Lost Internet sites Internet sites often vanish or 
move, so that references to them can’t be followed. 
In fact, 13% of Internet sites referenced in major 
scientific journals are lost within two years after 
publication.” If we randomly select seven Internet 
references, from scientific journals, what is the 
probability that at least one of them doesn’t work two 
years later? 


Late shows Some TV shows begin after their sched- 
uled times when earlier programs run late. Accord- 
ing to a network’s records, about 3% of its shows start 
late. ‘To find the probability that three consecutive 
shows on this network start on time, can we multiply 


(0.97)(0.97)(0.97)2 Why or why not? 


Late flights An airline reports that 85% of its flights 
arrive on time. To find the probability that its next 
four flights into LaGuardia Airport all arrive on time, 
can we multiply (0.85)(0.85)(0.85)(0.85)? Why or 


why not? 


The geometric distributions You are tossing a 

pair of fair, six-sided dice in a board game. Tosses 

are independent. You land in a danger zone that 
requires you to roll doubles (both faces showing the 
same number of spots) before you are allowed to play 
again. How long will you wait to play again? 


What is the probability of rolling doubles on a single 
toss of the dice? (If you need review, the possible 
outcomes appear in Figure 5.2 (page 306). All 36 
outcomes are equally likely.) 


What is the probability that you do not roll doubles 
on the first toss, but you do on the second toss? 


What is the probability that the first two tosses are 
not doubles and the third toss is doubles? This is the 
probability that the first doubles occurs on the third 
toss. 


Now you see the pattern. What is the probability 
that the first doubles occurs on the fourth toss? 
On the fifth toss? Give the general result: what 
is the probability that the first doubles occurs on 
the kth toss? 


The probability of a flush A poker player holds 
a flush when all 5 cards in the hand belong to 
the same suit. We will find the probability of a 
flush when 5 cards are dealt. Remember that a 
deck contains 52 cards, 13 of each suit, and that 
when the deck is well shuffled, each card dealt 
is equally likely to be any of those that remain in 
the deck. 
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(a) We will concentrate on spades. What is the prob- 
ability that the first card dealt is a spade? What is 
the conditional probability that the second card is a 
spade given that the first is a spade? 


(b) Continue to count the remaining cards to find the 
conditional probabilities of a spade on the third, the 
fourth, and the fifth card given in each case that all 
previous cards are spades. 


(c) ‘The probability of being dealt 5 spades is the product 
of the five probabilities you have found. Why? What 
is this probability? 


(d) ‘The probability of being dealt 5 hearts or 5 diamonds 
or 5 clubs is the same as the probability of being 
dealt 5 spades. What is the probability of being dealt 
a flush? 


Multiple choice: Select the best answer for Exercises 

97 to. 99. 

97. An athlete suspected of using steroids is given two 
tests that operate independently of each other. ‘Test 
A has probability 0.9 of being positive if steroids 
have been used. ‘Test B has probability 0.8 of being 
positive if steroids have been used. What is the prob- 
ability that neither test is positive if steroids have 
been used? 


fey (c) 0.02 (e) 0.08 
(b) 0.38 (d) 0.28 


98. In an effort to find the source of an outbreak of food 
poisoning at a conference, a team of medical detec- 
tives carried out a study. ‘They examined all 50 peo- 
ple who had food poisoning and a random sample of 
200 people attending the conference who didn’t get 
food poisoning. The detectives found that 40% of the 
people with food poisoning went to a cocktail party 
on the second night of the conference, while only 
10% of the people in the random sample attended 
the same party. Which of the following statements 
is appropriate for describing the 40% of people who 
went to the party? (Let F = got food poisoning and 
A = attended party.) 


(a) P(F|A) = 0.40 (d) P(AC|F) = 0.40 
(b) P(A| FC) = 0.40 (ec) P(A|F) = 0.40 
(c) P(F| AC) = 0.40 


99. Suppose a loaded die has the following probability 


model: 


Outcome: 1 2 3 4 5 6 
Probability: 0.3 0.1 0.1 0.1 0.1 0.3 


If this die is thrown and the top face shows an 
odd number, what is the probability that the die 
shows a 1? 


0.10 (d) 0.50 
0.17 (ce) 0.60 
0.30 


Exercises 100 and 101 refer to the following setting. Your 
body mass index (BMI) is your weight in kilograms 
divided by the square of your height in meters. Online 
BMI calculators allow you to enter weight in pounds 
and height in inches. High BMI is a common but 
controversial indicator of being overweight or obese. A 
study by the National Center for Health Statistics found 
that the BMI of American young women (ages 20 to 29) 
is approximately Normal with mean 26.8 and standard 
deviation 7.4.7? 


100. BMI (2.2) People with BMI less than 18.5 are 
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often classed as “underweight.” What percent of 
young women are underweight by this criterion? 
Sketch and shade the area of interest under a 
Normal curve. 


BMI (5.2) Suppose we select two American young 
women in this age group at random. Find the 
probability that at least one of them is classified as 
underweight. Show your work. 


. Life at work (1.1) The University of Chicago’s 


General Social Survey asked a representative sam- 
ple of adults this question: “Which of the following 
statements best describes how your daily work is or- 
ganized? (1) I am free to decide how my daily work 
is organized. (2) I can decide how my daily work is 
organized, within certain limits. (3) I am not free 
to decide how my daily work is organized.” Here is 
a two-way table of the responses for three levels of 
education:”* 


Highest Degree Completed 
Response Less than High School HighSchool Bachelor’s 
1 31 161 81 
2 49 269 85 
3 47 112 14 


Do these data suggest that there is an association 
between level of education and freedom to or- 
ganize one’s work in the adult population? Give 
appropriate evidence to support your answer. 
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Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam (c) Describe how to use a table of random digits to 
free response questions. Your task is to generate a complete, con- estimate the probability that 2 or fewer of the 
cise response in 15 minutes. 4 randomly selected students completed the 
assignment. 

Directions: Show all your work. Indicate clearly the methods (d) 
you use, because you will be scored on the correctness of your 

methods as well as on the accuracy and completeness of your 

results and explanations. 


Complete three repetitions of your simulation us- 
ing the random digits below and use the results to 
estimate the probability described in part (c). 


A statistics teacher has 40 students in his class, 23 females 12975 13258 13048 45144 72321 81940 00360 02428 
and 17 males. At the beginning of class on a Monday, the 96767 35964 23822 96012 94951 65194 50842 55372 


teacher planned to spend time reviewing an assignment due 

that day. Unknown to the teacher, only 19 of the females and 37609 59057 66967 83401 60705 02384 90597 93600 

11 of the males had completed the assignment. The teacher 

plans to randomly select students to do problems from the After you finish, you can view two example solutions on the book’s 

assignment on the whiteboard. Web site (www.whfreeman.com/tps5e). Determine whether you 
(a) What is the probability that a randomly selected think each solution is “complete,” “substantial,” “developing,” 

student has completed the assignment? or “minimal.” If the solution is not complete, what improvements 

would you suggest to the student who wrote it? Finally, your teach- 

er will provide you with a scoring rubric. Score your response and 

note what, if anything, you would do differently to improve your 

own score. 


(b) Are the events “selecting a female” and “selecting a 
student who completed the assignment” indepen- 
dent? Justify your answer. 

Suppose that the teacher randomly selects 4 students to 

do a problem on the whiteboard and only 2 of the students 
had completed the assignment. 


Chapter Review 


Section 5.1: Randomness, Probability, and Simulation a simulation, use the familiar four-step process: state the 
In this section, you learned about the law of large num- question of interest, plan how to use a chance device to 
bers and the idea of probability. The law of large num- imitate a process, do many repetitions, and make a conclu- 
bers says that when you repeat a chance process many, sion based on the results. If you are using random digits to 
many times, the relative frequency of an outcome will perform your simulation, be sure to consider whether or 
approach a single number. This single number is called not digits can be repeated within each trial. 


the probability of the outcome—how often we expect 


the outcome to occur in a very large number of repeti- Section 5.2: Probability Rules 


tions of the chance process. Make sure to remember In this section, you learned that chance behavior can be de- 
the “large” part of the law of large numbers. Although scribed by a probability model. Probability models have two 
clear patterns emerge in a large number of repetitions, parts, a list of possible outcomes (the sample space) and a 
we shouldn’t expect such regularity in a small number probability for each outcome. The probability of each out- 
of repetitions. come in a probability model must be between 0 and 1, and 

Simulations are powerful tools that we can use to imitate the probabilities of all the outcomes in the sample space 


chance processes and estimate probabilities. ‘To perform must add to 1. 


An event is a subset of the possible outcomes of a chance 
process. ‘he complement rule says that the probability that 
an event occurs is 1 minus the probability that the event 
doesn’t occur. In symbols, the complement rule says that 
P(E) = 1 — P(E®). Given two events A and B from some 
chance process, use the general addition rule to find the 
probability that event A or event B occurs: 


P(A or B) = P(A UB) = P(A) + P(B) — P(AM B) 


If the events A and B have no outcomes in common, 
use the addition rule for mutually exclusive events: 
P(A UB) = P(A) + P(B). 

Finally, you learned how to use two-way tables and Venn 
diagrams to display the sample space for a chance process 
involving two events. Using a two-way table or a Venn dia- 
gram is a helpful way to organize information and calculate 
probabilities involving the union (A or B) and the intersec- 
tion (A and B) of two events. 


Section 5.3: Conditional Probability and Independence 


In this section, you learned that a conditional probability 
describes the probability of an event occurring given that an- 
other event is known to have already occurred. ‘To calculate 


What Did You Learn? 


Learning Objective 


Section 


the probability that event A occurs given that event B has 
occurred, use the formula 
P(AMB) _ P(Aand B) 


PONE) = pe) PB) 


‘Two-way tables and tree diagrams are useful ways to organize 
the information provided in a conditional probability prob- 
lem. Two-way tables are best when the problem describes 
the number or proportion of cases with certain characteris- 
tics. Tree diagrams are best when the problem provides the 
conditional probabilities of different events or describes a 
sequence of events. 

Use the general multiplication rule for calculating the 
probability that event A and event B both occur: 


P(A and B) = P(AM B) = P(A): P(B | A) 


If knowing that event B occurs doesn’t change the probabil- 
ity that event A occurs, then events A and B are independent. 
‘That is, events A and B are independent if P(A | B) = P(A). If 
events A and B are independent, use the multiplication rule 
for independent events to find the probability that events A 
and B both occur: P(AM B) = P(A) - P(B). 


Related Example 
on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Interpret probability as a long-run relative frequency. 


Bl 291, 292, 293, 294, 295 Rd.1 


Use simulation to model chance behavior. 


Ohl 296, 297 R8.2 


Determine a probability model for a chance process. 


O2 306 R95.3, R5.10 


Use basic probability rules, including the complement rule and the 
addition rule for mutually exclusive events. 


o:2 308 R9.4, R5.10 


Use a two-way table or Venn diagram to model a chance process 
and calculate probabilities involving two events. 


Rd.4, R5.5, 


O:2 309, 312, 313 R5.7, R5.8 


Use the general addition rule to calculate probabilities. 


2 313 Ro.4, R5.5 


Calculate and interpret conditional probabilities. 


5.3 318, 320 R9.6, R5.8 


Use the general multiplication rule to calculate probabilities. 


5.3 322 R5.6 


Use tree diagrams to model a chance process and calculate prob- 
abilities involving two or more events. 


5.3 322, 324, 325 RO.6 


Determine whether two events are independent. 


5.3 327 RO.7, R5.8 


When appropriate, use the multiplication rule for independent 
events to compute probabilities. 


5.3 329, 331 R9.9, R5.10 
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Chapter 5 Chapter Review Exercises 


These exercises are designed to help you review the impor- 
tant ideas and methods of the chapter. 


R5.1 Rainy days The TV weatherman says, “There’s a 
30% chance of rain tomorrow.” Explain what this 
statement means. 


R5.2 Click it or else From police records, it has been 
determined that 15% of drivers stopped for routine 
license checks are not wearing seat belts. Ifa police 
officer stops 10 vehicles, how likely is it that two 
consecutive drivers won't be wearing their seat belts? 


— 
~ 
—_ 


Describe the design of a simulation to estimate this 
probability. Explain clearly how you will use the 
partial table of random digits below to carry out 
your simulation. 

Carry out three repetitions of the simulation. Copy 
the random digits below onto your paper. ‘Then mark 
on or directly above the table to show your results. 


S 


29077, Se 14803 OlO83 47052 O20 274 e025 
S052 80s SIZ Wallko svi 7oll 
27102 =56027 =55892 33063 = 41842 = 81868 
2133 0) to 10) 2 LO OO eum 176) moll OG 


R5.3 Weird dice Nonstandard dice can produce interest- 
ing distributions of outcomes. Suppose you have 
two balanced, six-sided dice. Die A has faces with 2, 
2, 2, 2, 6, and 6 spots. Die B has faces with 1, 1, 1, 
5, 5, and 5 spots. Imagine that you roll both dice at 
the same time. 


(a) Find a probability model for the difference (Die A — 
Die B) in the total number of spots on the up-faces. 

(b) Which die is more likely to roll a higher number? 
Justify your answer. 

R5.4 Race and ethnicity ‘he Census Bureau allows each 
person to choose from a long list of races. That is, in 
the eyes of the Census Bureau, you belong to what- 
ever race you say you belong to. Hispanic (also called 


Latino) is a separate category. Hispanics may be of any 
race. If we choose a resident of the United States at 
random, the Census Bureau gives these probabilities:” 


Hispanic Not Hispanic 


Asian 0.001 0.044 
Black 0.006 0.124 
White 0.139 0.674 
Other 0.003 0.009 


(a) Verify that this is a legitimate assignment of 
probabilities. 

(b) What is the probability that a randomly chosen 
American is Hispanic? 

(c) Non-Hispanic whites are the historical majority in 
the United States. What is the probability that a 
randomly chosen American is not a member of this 
group? 

(d) Explain why P(white or Hispanic) # P(white) + 
P(Hispanic). Then find P(white or Hispanic). 

R5.5 In 2012, fans at Arizona Diamondbacks home 
games would win 3 free tacos from ‘Taco Bell if the 
Diamondbacks scored 6 or more runs. In the 2012 
season, the Diamondbacks won 41 of their 81 home 
games and gave away free tacos in 30 of their 81 
home games. In 26 of the games, the Diamond- 
backs won and gave away free tacos. Choose a 
Diamondbacks home game at random. 


(a) Make a Venn diagram to model this chance process. 

(b) What is the probability that the Diamondbacks lost 
and did not give away free tacos? 

(c) What is the probability that the Diamondbacks won 


the game or fans got free tacos? 


R5.6 Steroids A company has developed a drug test to 
detect steroid use by athletes. The test is accurate 
95% of the time when an athlete has taken steroids. 
It is 97% accurate when an athlete hasn’t taken 
steroids. Suppose that the drug test will be used in 
a population of athletes in which 10% have actually 


taken steroids. Let’s choose an athlete at random 
and administer the drug test. 


(a) Make a tree diagram showing the sample space of 
this chance process. 

(b) What’s the probability that the randomly selected 
athlete tests positive? Show your work. 

(c) Suppose that the chosen athlete tests positive. 
What's the probability that he or she actually used 
steroids? Show your work. 


R5.7 Mike’s pizza You work at Mike’s pizza shop. You 
have the following information about the 7 pizzas 
in the oven: 3 of the 7 have thick crust and 2 of 
the 3 thick crust pizzas have mushrooms. Of the 
remaining 4 pizzas, 2 have mushrooms. Choose a 
pizza at random from the oven. 


(a) Make a two-way table to model this chance process. 
(b) Are the events “getting a thick-crust pizza” and “get- 


ting a pizza with mushrooms” independent? Explain. 


(c) You add an eighth pizza to the oven. This pizza has 
thick crust with only cheese. Now are the events 
“getting a thick-crust pizza” and “getting a pizza 
with mushrooms” independent? Explain. 

R5.8 Deer and pine seedlings As suburban gardeners 
know, deer will eat almost anything green. Ina 
study of pine seedlings at an environmental center 
in Ohio, researchers noted how deer damage varied 
with how much of the seedling was covered by 
thorny undergrowth: 


Deer Damage 
Thorny Cover Yes No 
None 60 181 
<al/a) 76 158 
1/3 to 2/3 44 177 
>2/3 29 176 


(a) What is the probability that a randomly selected 
seedling was damaged by deer? 


(b) What are the conditional probabilities that a ran- 
domly selected seedling was damaged, given each 
level of cover? 


(c) Does knowing about the amount of thorny cover on 
a seedling change the probability of deer damage? 
Justify your answer. 
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R5.9 A random walk on Wall Street? The “random 
walk” theory of stock prices holds that price move- 
ments in disjoint time periods are independent of 
each other. Suppose that we record only whether 
the price is up or down each year, and that the 
probability that our portfolio rises in price in any 
one year is 0.65. (This probability is approximately 
correct for a portfolio containing equal dollar 
amounts of all common stocks listed on the New 
York Stock Exchange.) 


(a) What is the probability that our portfolio goes up 
for three consecutive years? 


(b) What is the probability that the portfolio’s value 
moves in the same direction (either up or down) 
for three consecutive years? 


R5.10 Blood types Each of us has an ABO blood type, 
which describes whether two characteristics called 
A and B are present. Every human being has two 
blood type alleles (gene forms), one inherited from 
our mother and one from our father. Each of these 
alleles can be A, B, or O. The two that we inherit 
determine our blood type. The table shows what 
our blood type is for each combination of two al- 
leles. We inherit each of a parent’s two alleles with 
probability 0.5. We inherit independently from our 
mother and father. 


Alleles inherited Blood type 
AandA A 
AandB AB 
A and 0 A 
BandB B 
B and O B 
O and O 6) 


— 
fo 
— 


Hannah and Jacob both have alleles A and B. 
Diagram the sample space that shows the alleles 
that their next child could receive. Then give the 
possible blood types that this child could have, 
along with the probability for each blood type. 


(b) Jennifer has alleles A and O. Jose has alleles A and 
B. They have two children. What is the probability 
that at least one of the two children has blood type 
B? Show your method. 
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PROBABILITY: WHAT ARE THE CHANCES? 


Chapter 5 AP® Statistics Practice Test 


Section |: Multiple Choice Select the best answer for each question. 


T5.1 Dr. Stats plans to toss a fair coin 10,000 times in the 
hope that it will lead him to a deeper understanding 
of the laws of probability. Which of the following state- 


ments is true? 


(a) [tis unlikely that Dr. Stats will get more than 5000 
heads. 

(b) Whenever Dr. Stats gets a string of 15 tails in a row, it 
becomes more likely that the next toss will be a head. 

(c) The fraction of tosses resulting in heads should be 
exactly 1/2. 

(d) ‘The chance that the 100th toss will be a head de- 
pends somewhat on the results of the first 99 tosses. 

(e) Itis likely that Dr. Stats will get about 50% heads. 


T5.2 China has 1.2 billion people. Marketers want to know 
which international brands they have heard of. 
A large study showed that 62% of all Chinese adults 
have heard of Coca-Cola. You want to simulate choos- 
ing a Chinese at random and asking if he or she has 
heard of Coca-Cola. One correct way to assign random 
digits to simulate the answer is: 


(a) One digit simulates one person’s answer; odd means 
“Yes” and even means “No.” 

(b) One digit simulates one person’s answer; 0 to 6 mean 
“Yes” and 7 to 9 mean “No. ” 

(c) One digit simulates the result; 0 to 9 tells how many 
in the sample said “Yes.” 

(d) ‘Two digits simulate one person’s answer; 00 to 61 
mean “Yes” and 62 to 99 mean “No. ” 

(e) ‘Two digits simulate one person’s answer; 00 to 62 
mean “Yes” and 63 to 99 mean “No. ” 

T5.3 Choose an American household at random and record 
the number of vehicles they own. Here is the prob- 
ability model if we ignore the few households that own 
more than 5 cars: 


Number of cars: 0 1 2 SC 4 5 
Probability: O09 OSGi OS Oe On Ch OLO SOLO? 


A housing company builds houses with two-car garages. 
What percent of households have more cars than the 
garage can hold? 

ay? (by 13% (e) 20% (di45Z. “(e) 552 

15.4 Computer voice recognition software is getting 

better. Some companies claim that their software 
correctly recognizes 98% of all words spoken by a 
trained user. To simulate recognizing a single word 


when the probability of being correct is 0.98, let two 
digits simulate one word; 00 to 97 mean “correct.” 
The program recognizes words (or not) indepen- 
dently. ‘To simulate the program’s performance on 10 
words, use these random digits: 


60970 70024 17868 29843 61790 90656 87964 


The number of words recognized correctly out of the 

10 is 
A Go js 
Questions T5.5 to T5.7 refer to the following setting. 
One thousand students at a city high school were 
classified according to both GPA and whether or not 
they consistently skipped classes. The two-way table 
below summarizes the data. Suppose that we choose a 
student from the school at random. 


GPA 
Skipped Classes <2.0 2.0-3.0 >3.0 
Many 80 25 5 
Few 175 450 265 
15.5 What is the probability that a student has a GPA 
under 2.0? 
(a) 0.227 (b) 0.255 (c) 0.450 (d) 0.475 (e) 0.506 


15.6 What is the probability that a student has a GPA 


under 2.0 or has skipped many classes? 
(a) 0.080 (b) 0.281 (c) 0.285 (d) 0.365 (e) 0.727 


15.7 What is the probability that a student has a GPA under 
2.0 given that he or she has skipped many classes? 


(a) 0.080 (b) 0.281 (c) 0.285 (d) 0.314 (e) 0.727 


15.8 For events A and B related to the same chance pro- 
cess, which of the following statements is true? 


(a) IfA and B are mutually exclusive, then they must be 
independent. 

(b) IfA and B are independent, then they must be mutu- 
ally exclusive. 

(c) IfA and B are not mutually exclusive, then they must 
be independent. 

(d) IfA and B are not independent, then they must be 
mutually exclusive. 

(e) IfA and B are independent, then they cannot be 
mutually exclusive. 


T5.9 Choose an American adult at random. The probabil- 


ity that you choose a woman is 0.52. The probability 
that the person you choose has never married is 0.25. 
The probability that you choose a woman who has 
never married is 0.11. The probability that the person 
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T5.10 A deck of playing cards has 52 cards, of which 12 


(a) 0.001 


are face cards. If you shuffle the deck well and turn 
over the top 3 cards, one after the other, what's the 
probability that all 3 are face cards? 


(c) 0.010 (e) 0.02 


you choose is either a woman or has never been mar- 
ried (or both) is therefore about 


(a) 0.77. (b) 0.66. (c) 0.44. (d) 0.38. 


(b) 0.005 — (d) 0.012 


(e) 0.13. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


15.11 Your teacher has invented a “fair” dice game to 
play. Here’s how it works. Your teacher will roll one 
fair eight-sided die, and you will roll a fair six-sided 
die. Each player rolls once, and the winner is the 
person with the higher number. In case of a tie, nei- 
ther player wins. The table shows the sample space 


15.13 Researchers are interested in the relationship be- 
tween cigarette smoking and lung cancer. Suppose 
an adult male is randomly selected from a particular 
population. The following table shows the probabil- 
ities of some events related to this chance process: 


of this chance process. Event Probability 
Smokes 0.25 
Teacher Rolls Smokes and gets cancer 0.08 
Does not smoke and does not get cancer 0.71 


You Roll 1 2 3 4 5 6 7 8 


(a) Find the probability that the individual gets cancer 
given that he is a smoker. Show your work. 

(b) Find the probability that the individual smokes or 
gets cancer. Show your work. 

(c) ‘Two adult males are selected at random. Find the 
probability that at least one of the two gets cancer. 
Show your work. 


(a) Let A be the event “your teacher wins.” Find P(A). 
(b) Let B be the event “you get a 3 on your first roll.” 
Find P(A U B). 
(c) Are events A and B independent? Justify your answer. 
15.12 Three machines—A, B, and C—are used to produce 


1 
2 
3 
4 
5 
6 
) 
) 15.14 Based on previous records, 17% of the vehicles passing 
through a tollbooth have out-ofstate plates. A bored 


tollbooth worker decides to pass the time by counting 
how many vehicles pass through until he sees two with 


a large quantity of identical parts at a factory. Machine 
A produces 60% of the parts, while Machines B and C 
produce 30% and 10% of the parts, respectively. His- 
torical records indicate that 10% of the parts produced 
by Machine A are defective, compared with 30% for 
Machine B and 40% for Machine C. 


out-of-state plates.”” 


Describe the design of a simulation to estimate the 
average number of vehicles it takes to find two with 
out-of-state plates. Explain clearly how you will use 
the partial table of random digits below to carry out 
your simulation. 


Perform three repetitions of the simulation you 
described in part (a). Copy the random digits below 
onto your paper. Then mark on or directly above 
the table to show your results. 


(a) Draw a tree diagram to represent this chance (b) 
process. 

(b) If we choose a part produced by one of these three 
machines, what’s the probability that it’s defective? 
Show your work. 


TOS 9203 UG 449 05059 Secs 31830 

(c) Ifa part is inspected and found to be defective, 53115 84469 94868 57967 05811 84514 
which machine is most likely to have produced it? Bal77 OO75S7 IFES W5582 SISMOG Hl435 
Give appropriate evidence to support your answer. FSOUN W300G ©3395 S504) I5ooo OosS% 
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Random 
Variables 


A Jury of Your Peers? 


Are accused criminals in the United States entitled to a “jury of their peers”? Sort of. The Sixth Amend- 
ment to the U.S. Constitution begins, “In all criminal prosecutions, the accused shall enjoy the right to 
a speedy and public trial, by an impartial jury of the State and district wherein the crime shall have been 
committed....” There is no mention of a “jury of your peers” in the Constitution or any of its amend- 
ments. However, an 1879 U.S. Supreme Court decision said that a jury should be chosen from a group 
“composed of the peers or equals [of the accused]; that is, of his neighbors, fellows, associates, persons 
having the same legal status in society as he holds.”! 

To meet the Sixth Amendment requirement of impartiality, most courts start by randomly selecting a 
large jury pool from the citizens who live in the court’s jurisdiction. The jurors for a given trial are then 
chosen from the jury pool in a process known as voir dire. Each prospective juror answers a set of ques- 
tions posed by the judge and the lawyers for both the prosecution and the defense. Depending on their 
answers, prospective jurors are excluded or seated on the jury. 

In one case that made it all the way to the Supreme Court, a defense lawyer in Michigan challenged 
the process of selecting the jury pool in the trial of his accused client. Here are the facts: 


e About 7.28% of the citizens in the court’s jurisdiction were black. 
e The jury pool had between 60 and 100 members, only 3 of whom were black. 


Is it plausible that a jury pool with so few black citizens could be chosen just by chance? 
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ACTIVITY 


MATERIALS: 


3 small paper cups per 
student; enough tap water 
for 2 cups per student and 
enough bottled water for 1 
cup per student; 1 six-sided 
die and 1 index card per 
student 


Introduction 


Do you drink bottled water or tap water? According to a recent report in U.S. 
Mayor Newspaper, about 75% of people drink bottled water regularly. Some peo- 
ple do so because they believe bottled water is safer than tap water. (‘There’s little 
evidence to support this belief.) Others say they prefer the taste of bottled water. 
Can people really tell the difference? 


Bottled Water versus Tap Water 


This Activity will give you and your classmates a chance to discover whether or 
not you can taste the difference between bottled water and tap water. 


1. Before class begins, your teacher will prepare numbered stations with cups of 
water. You will be given an index card with a station number on it. 


2. Go to the corresponding station. Pick up three cups (labeled A, B, and C) 
and take them back to your seat. 

3. Your task is to determine which one of the three cups contains the bottled wa- 
ter. Drink all the water in Cup A first, then the water in Cup B, and finally the 
water in Cup C. Write down the letter of the cup that you think held the bottled 
water. Do not discuss your results with any of your classmates yet! 


4. While you taste, your teacher will make a chart on the board like this one: 


Stationnumber Bottled watercup? — Truth 


2 5. When you are told to do so, go to the board and record your station 


The ABC News program 20/20 
set up a blind taste test in which 
people were asked to rate four 
different brands of bottled water 
and New York City tap water 
without knowing which they 
were drinking. Can you guess the 
result? Tap water came out the 
clear winner in terms of taste. 


number and the letter of the cup you identified as containing bottled 
water. 


6. Your teacher will now reveal the truth about the cups of drinking water. How 
many students in the class identified the bottled water correctly? What percent 
of the class is this? 


7. Let’s assume that no one in your class can distinguish tap water from bottled 
water. In that case, students would just be guessing which cup of water tastes differ- 
ent. If so, what’s the probability that an individual student would guess correctly? 


8. How many correct identifications would you need to see to be convinced that 
the students in your class aren’t just guessing? With your classmates, design and 
carry out a simulation to answer this question. What do you conclude about your 
class’s ability to distinguish tap water from bottled water? 


When Mr. Bullard’s class did the preceding Activity, 13 out of 21 students made 
correct identifications. If we assume that the students in his class can’t tell tap 
water from bottled water, then each one is basically guessing, with a 1/3 chance 
of being correct. So we’d expect about one-third of his 21 students, that is, about 
7 students, to guess correctly. How likely is it that 13 or more of his 21 students 
would guess correctly? To answer this question without a simulation, we need a 
different kind of probability model from the ones we saw in Chapter 5. 
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Section 6.1 introduces the concept of a random variable, a numerical outcome 
of some chance process (like the 13 students who guessed correctly in Mr. Bullard’s 
class). Each random variable has a probability distribution that gives us information 
about the likelihood that a specific event happens (like 13 or more correct guesses 
out of 21) and about what’s expected to happen if the chance behavior is repeated 
many times. Section 6.2 examines the effect of transforming and combining ran- 
dom variables on the shape, center, and spread of their probability distributions. In 
Section 6.3, we'll look at two random variables with probability distributions that 
are used enough to have their own names—binomial and geometric. 


Discrete and Continuous 
Random Variables 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 
e Compute probabilities using the probability distribution e Calculate and interpret the standard deviation 


of a discrete random variable. of a discrete random variable. 


e Calculate and interpret the mean (expected value) e Compute probabilities using the probability distribution 
of a discrete random variable. of certain continuous random variables. 


A probability model describes the possible outcomes of a chance process and the 
likelihood that those outcomes will occur. For example, suppose we toss a fair 
coin 3 times. The sample space for this chance process is 


HHH HHT HTH THH ATT TH TTH TIT 
Because there are 8 equally likely outcomes, the probability is 1/8 ®) 
Berk 


for each possible outcome. Define the variable X = the number of 
heads obtained. The value of X will vary from one set of tosses to 
another but will always be one of the numbers 0, 1, 2, or 3. How 
likely is X to take each of those values? It will be easier to answer 

this question if we group the possible outcomes by the num- 

ber of heads obtained: 


X = 0: TTT 

A= (HTT. “THY ‘TT 

X=2:HHT HTH THH 

X = 3: HHH 
We can summarize the probability distribution of X 
as follows: 
Value: 0 1 2 3 


Probability: 1/8 3/8 3/8 1/8 


Figure 6.1 on next page shows the probability distribution of X in graphical form. 
Notice the symmetric shape. 
We can use the probability distribution to answer questions about the variable X. 
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3/8 5 


2/8 4 


Probability 


1/8 + 


FIGURE 6.1 Histogram of the 
probability distribution for 

X = number of heads in three 
tosses of a fair coin. 


RANDOM VARIABLES 


Value 


What's the probability that we get at least one head in three 
tosses of the coin? In symbols, we want to find P(X = 1). We 
could add probabilities to get the answer: 
P(X = 1) = P(X = 1) + P(X = 2) + P(X =3) 
= 1/8 + 3/8 + 3/8 = 7/8 
Or we could use the complement rule from Chapter 5: 
P(X =1)=1-PX<1)=1-P(X=0) 
=1-18=7/8 
A numerical variable that describes the outcomes of a 
chance process (like X in the coin-tossing scenario) is called 


a random variable. The probability model for a random vari- 
able is its probability distribution. 


DEFINITION: Random variable and probability distribution 


A random variable takes numerical values that describe the outcomes of some 
chance process. The probability distribution of a random variable gives its possible 
values and their probabilities. 


There are two main types of random variables, corresponding to two types of 
probability distributions: discrete and continuous. 


Discrete Random Variables 


We have learned several rules of probability but only one way of assigning prob- 
abilities to events: assign a probability to every individual outcome, then add these 
probabilities to find the probability of any event. This idea works well if we can 
find a way to list all possible outcomes. We will call random variables having 
probability assigned in this way discrete random variables.’ The probability dis- 
tribution for a discrete random variable must have outcome probabilities that are 
between 0 and | and that add up to 1. 


DISCRETE RANDOM VARIABLES AND THEIR 
PROBABILITY DISTRIBUTIONS 


A discrete random variable X takes a fixed set of possible values with gaps 
between. The probability distribution of a discrete random variable X lists 
the values x; and their probabilities ;: 


Value: X] x2 -X3 re 
Probability: p; p2 p3  ... 
The probabilities p; must satisfy two requirements: 
1. Every probability p; isa number between 0 and 1. 
2. The sum of the probabilities is 1: p) + p2 + p3 +--- = 1. 


To find the probability of any event, add the probabilities p; of the particular 
values x; that make up the event. 
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The variable X in the coin-tossing example is a discrete random variable. We 
can list the possible values of X as 0, 1, 2, 3. Note that there are gaps between these 
values on a number line. The corresponding probabilities are all between 0 and 
1, and their sum is 1/8 + 3/8 + 3/8 + 1/8 = 1. 

Here’s an example of a discrete random variable that involves something a bit 
more serious than tossing coins. 


Apgar Scores: Babies’ Health at Birth | 


Discrete random variables 


In 1952, Dr. Virginia Apgar suggested five criteria for measuring a baby’s 
health at birth: skin color, heart rate, muscle tone, breathing, and response 
when stimulated. She developed a 0-1-2 scale to rate a newborn on each 
of the five criteria. A baby’s Apgar score is the sum of the ratings on each 
of the five scales, which gives a whole-number value from 0 to 10. Apgar 
scores are still used today to evaluate the health of newborns. 


What Apgar scores are typical? To find out, researchers recorded the Apgar 
scores of over 2 million newborn babies in a single year.* Imagine selecting 
one of these newborns at random. (That’s our chance process.) Define the 
random variable X = Apgar score of a randomly selected baby one minute 
after birth. The table below gives the probability distribution for X. 


Value: 0 ] 2 3 4 5 6 7 8 9 10 
Probability: 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


PROBLEM: 
(a) Show that the probability distribution for Xis legitimate. 
(b) Makea histogram of the probability distribution. Describe what you see. 


(c) Doctors decided that Apgar scores of 7 or higher indicate a healthy baby. What's the prob- 
ability that a randomly selected baby is healthy? 


SOLUTION: 


(a) The probabilities are all between O and 1, and they add up to 1. So this 
is a legitimate probability distribution. 

(b) Figure 6.2 shows a histogram of the probability distribution of X. Shape: 
The graph is skewed to the left and single-peaked. A randomly selected new- 
born will most likely have an Apgar score on the high end of the scale, which 
means that the baby was fairly healthy at birth. Center: From the probability 
distribution, we see that the median is 8. Spread: Apgar scores vary from O 
to 10. But most newborns receive scores between 4 and 10. 


Probability 


3 4 S 6 


aes (c) The probability of choosing a healthy baby is P(X= 7). We can calcu- 


late this probability as follows: 
FIGURE 6.2 Histogram showing TNT) ah 0) SN 2) att) = 10) 
the probability distribution of = 0.099 + 0.319 + 0.437 + 0.053 = 0.908 


the random variable X= Apgar —_ That is, we'd have about a 91% chance of randomly choosing a healthy baby. 
score of a randomly selected 


newborn one minute after birth. For Practice Try Exercise 
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Although this procedure was later Note that the probability of randomly selecting a newborn whose Apgar «4 
Rane TaD Pp ak WoeGreliy score is greater than or equal to 7 is not the same as the probability that the 


APGAR also represents the five scales: >; ‘ ; gs 
Appearance, Pulse, Grimace, Activity, baby’s Apgar score is strictly greater than 7. The latter probability is 


and Respiration. P(X > 7) = P(X = 8) + P(X = 9) + P(X = 10) 


= 0.319 + 0.437 + 0.053 = 0.809 


The outcome X = 7 is included in “greater than or equal to” and is not included 
in “greater than.” Be sure to confirm the values of interest when dealing with 
discrete random variables. 


ao/cueck YOUR UNDERSTANDING 

North Carolina State University posts the grade distributions for its courses online.’ Stu- 
dents in Statistics 101 in a recent semester received 26% A’s, 42% B’s, 20% C’s, 10% D’s, 
and 2% F’s. Choose a Statistics 101 student at random. The student’s grade on a four-point 
scale (with A = 4) is a discrete random variable X with this probability distribution: 


Value of X: 0 ] 2 3 4 
Probability: 0.02 0.10 0.20 0.42 0.26 


1. Say in words what the meaning of P(X = 3) is. What is this probability? 
2. Write the event “the student got a grade worse than C” in terms of values of the 
random variable X. What is the probability of this event? 


3. Sketch a graph of the probability distribution. Describe what you see. 


Mean (Expected Value) of a Discrete 
Random Variable 


When we analyzed distributions of quantitative data in Chapter 1, we made it 
a point to discuss their shape, center, and spread. We'll follow the same strategy 
with probability distributions of random variables. You can use what you learned 
earlier to describe the shape of a probability distribution histogram. We’ve al- 
ready seen examples of symmetric (number of heads in three coin tosses) and left- 
skewed (Apgar score of a randomly chosen baby) probability distributions. What 
about center and spread? 

The mean x of a set of observations is their average. The mean px of a discrete 
random variable X is also an average of the possible values of X but with an im- 
portant change to take into account the fact that not all outcomes may be equally 
likely. A simple example will show what we need to do. 


Winning (and Losing) at Roulette 


Finding the mean of a discrete random variable 
On an American roulette wheel, there are 38 slots numbered | through 36, plus 0 
and 00. Half of the slots from | to 36 are red; the other half are black. Both the 0 


and 00 slots are green. Suppose that a player places a simple $1 bet on red. If the 
ball lands in a red slot, the player gets the original dollar back, plus an additional 
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dollar for winning the bet. If the ball lands in a different-colored slot, the 
player loses the dollar bet to the casino. 


Let’s define the random variable X = net gain from a single $1 bet on red. 
The possible values of X are —$1 and $1. (‘The player either gains a dollar 


as) G or loses a dollar.) What are the corresponding probabilities? The chance 
= ele that the ball lands in a red slot is 18/38. The chance that the ball lands in 


in a different-colored slot is 20/38. Here is the probability distribution of X: 


Value: -$1 $1 
Probability: 20/38 18/38 


What is the player’s average gain? The ordinary average of the two possible out- 
comes —$]1 and $1 is $0. But $0 isn’t the average winnings because the player is 
less likely to win $1 than to lose $1. In the long run, the player gains a dollar 18 
times in every 38 games played and loses a dollar on the remaining 20 of 38 bets. 
The player’s long-run average gain for this simple bet is 


i si(3) , sn) = 50.05 


You see that the player loses (and the casino gains) an average of five cents per $1 
bet in many, many plays of the game. 


If someone played several games of roulette, we would call the mean amount 
the person gained x. The mean in the previous example is a different quantity —it 
is the long-run average gain we'd expect if someone played roulette a very large 
number of times. For this reason, the mean of a random variable is often referred 
to as its expected value. Just as probabilities describe the proportion of times that 
an outcome occurs in many repetitions of a chance process, the mean of a discrete 
random variable describes the long-run average outcome. 

There are two ways of denoting the mean of a random variable X. We can use 
the notation jy, or we can write E(X), as in the “expected value of X.” In the rou- 
lette example, py = E(X) = —$0.05. 

The mean of any discrete random variable is found just as in the roulette ex- 
ample. It is an average of the possible outcomes, but a weighted average in which 
each outcome is weighted by its probability. Here (finally!) is the definition. 


EE a S| 


DEFINITION: Mean (expected value) of a discrete random variable 
Suppose that X is a discrete random variable with probability distribution 


Value: xX Xp X3 
Probability: Dy Po ps3 


To find the mean (expected value) of X, multiply each possible value by its probability, 
then add all the products: 


pix = EX) = XP, + XpP2 + Xgp3 +... 


= DXi 
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Let’s put the definition to use in calculating the mean of a familiar random 
variable. 


Apgar Scores: What’s Typical? 


Mean and expected value as an average 


In our earlier example, we defined the random variable X to be the Apgar score of 
a randomly selected baby. The table below gives the probability distribution for 
X once again. 


Value x;: 0 1 2 3 4 5 6 7 8 9 10 
Probability p;; 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


PROBLEM: Compute the mean of the random variable X. Interpret this value in context. 
SOLUTION: From the probability distribution for X, we see that 1 in every 1000 babies would 
have an Apgar score of O; 6 in every 1000 babies would have an Apgar score of 1; and 50 on. So the 
mean (expected value) of Xis 
by = E(X) = Lxp; 
= (0)(0.001) + (1)(0.006) + (2)(0.007) + -- - + (10)(0.053) = 6.128 
The mean Apgar score of a randomly selected newborn is 8.128. This is the average Apgar score of 


many, many randomly chosen babies. 


For Practice Try Exercise 9 | 


AP® EXAM TIP If the mean Notice that the mean Apgar score, 8.128, is not a possible value of the random 
variable X. It’s also not an integer. If you think of the mean as a long-run average 


of a random variable has a non- = ; 
over many repetitions, these facts shouldn’t bother you. 


integer value, but you report it 
as an integer, your answer will 
not get full credit. 


THE BEST WAY TO 
MAKE THIS DECISION 
IS BY CALCULATING 
THE EXPECTED VALUE 
OF EACH POSSIBLE 
OUTCOME. 


7-23-05 ©2005 Scott Adams, Inc./Dist. by UFS, Inc. 


www.dilbert.com _ scottadams@aol.com 


Standard Deviation (and Variance) 
of a Discrete Random Variable 


With the mean as our measure of center fora discrete random variable, it shouldn’t 
surprise you that we'll use the standard deviation as our measure of spread. 
In Chapter 1, we first defined the sample variance s? as the “typical” squared de- 
viation from the mean and then took the square root of the variance to get the 
sample standard deviation s,. The definition of the variance of a random variable 


oX is similar to the definition of the variance for a set of quantitative data. That is, 
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Recall that the formula for the sample the variance is an “average” of the squared deviation (x; — Lx)” of the values of the 
PAlanOe IS variable X from its mean jux. As with the mean, the average we use is a weighted 
¢_ 2O= *F average. Each outcome is weighted by its probability to take account of outcomes 
—————— a 
| that are not equally likely. To get the standard deviation of a random variable, 
we take the square root of the variance. Here are the details. 


De 


DEFINITION: Variance and standard deviation of a discrete random 
variable 


Suppose that X is a discrete random variable with probability distribution 


Value: x Xp Xe 
Probability: D; Do P3 


and that j., is the mean of X. The variance of X is 
Var(X) = of = (x1 — px)? Pi + (Xe — bx)? Do + (Xs — pix)? Pg +... 
= DG — px)’ 
The standard deviation of X, cy, is the square root of the variance. 


Oy= Vd — py)?P; 


‘The standard deviation of a random variable X is a measure of how much the 
values of the variable typically vary from the mean sux. Let’s compute the variance 
and standard deviation of a familiar discrete random variable. 


Apgar Scores: How Variable 
Are They? 


Calculating measures of spread 


In the last example, we calculated the mean Apgar score of a randomly chosen 
newborn to be px = 8.128. The table below gives the probability distribution for 
X one more time. 


Value x;: 0 l 2 3 4 5 6 7 8 9 10 
Probability p;; 0.001 0.006 0.007 0.008 0.012 0.020 0.038 0.099 0.319 0.437 0.053 


PROBLEM: Compute and interpret the standard deviation of the random variable X. 
SOLUTION: The formula forthe variance of Xis of = D(x; — /4y)p;. Plugging in values gives 
oy = (0 — 8.128)*(0.001) + (1 — 8.128)7(0.006) 
+ (2 — 8.128)?(0.007) + --- + (10 — 8.128)?(0.053) 
o; = 2.066 


The standard deviation of Xis cy = 2.066 = 1.437. A randomly selected baby’s Apgar score 
will typically differ from the mean (8.128) by about 1.4 units. 


For Practice Try Exercise 
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You can use your calculator to graph the probability distribution of a discrete 
random variable and to calculate measures of center and spread, as the following 
Technology Corner illustrates. 


ANALYZING RANDOM VARIABLES 
CORNER. ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Let’s explore what the calculator can do using the random variable X = Apgar score of a randomly selected newborn. 


TI-83/84 TI-89 
1. Start by entering the values of the random variable in L1/list] and the corresponding probabilities in L2/list2. 


NORMAL FLOAT AUTO REAL RADIAN MP 


Pep eae a ELE COO 


1i1st1(11J=0 
MAIN RAD AUTO FUNC 


2. To graph a histogram of the probability distribution: 
¢ Set up a statistics plot with Xlist: L1/list] and Freq: L2/list2. 
e Adjust your window settings as follows: Xmin = — 1, Xmax = 11, Xscl = 1,Ymin = —0.1, Ymax = 0.5, Yscl = 0.1. 


e Press [GRAPH] (/@||F3]on the TI-89). 


NORMAL FLOAT AUTO REAL RADIAN MP fl 


Ploti:LasL2 


“mint 9. 


ws maxi 10. hi .437 


min=3 
max<16 n=.437 Malin RAD AUTO FUNC 


3. ‘To calculate the mean and standard deviation of the random variable, use one-variable statistics with the values in 
LI/list1 and the probabilities (relative frequencies) in L2/list2. 


e¢ OS 2.55 or later: In the dialog box, specify List! L1 © In the Statistics/List Editor, press [F4] (Calc) and 
and FreqList: L2. Then choose Calculate. Older OS: choose 1-Var Stats... Use the inputs List: 
Execute the command 1-Var Stats L1,L2. list] and es list2. 


NORMAL FLOAT AUTO REAL RADIAN MP a] 


x=8.128 
2x=8.128 
=x?=68.13 


Sx= 
ox=1. 437225104 
=1 
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gp/ cuec YOUR UNDERSTANDING 


The calculator command rand will 
generate a random number from 

0 to 1. Can you figure out how to 
modify the command to find a random 
number between, say, 1 and 3? 


A large auto dealership keeps track of sales made during each hour of the day. Let X = the 
number of cars sold during the first hour of business on a randomly selected Friday. Based 
on previous records, the probability distribution of X is as follows: 


Cars sold: 0 1 2 3 
Probability: 0.3 0.4 0.2 0.1 


1. Compute and interpret the mean of X. 
2. Compute and interpret the standard deviation of X. 


Continuous Random Variables 


When we use the table of random digits to select a digit between 0 and 9, the 
result is a discrete random variable (call it X). The probability model assigns prob- 
ability 1/10 to each of the 10 possible values of X. 

Suppose we want to choose a number at random between 0 and 1, allowing any 
number between 0 and | as the outcome (like 0.84522 or 0.1111119). Calculator 
and computer random number generators will do this. The sample space of this 
chance process is an entire interval of numbers: 


S = all numbers between 0 and 1 


Call the outcome of the random number generator Y for short. How can we find 
probabilities of events like P(0.3 = Y = 0.7)? As in the case of selecting a random 
digit, we would like all possible outcomes to be equally likely. But we cannot as- 
sign probabilities to each individual value of Y and then add them, because there 
are infinitely many possible values. 

In situations like this, we use a different way of assigning probabilities directly 
to events—as areas under a density curve. Recall from Chapter 2 that any density 
curve has area exactly 1 underneath it, corresponding to total probability 1. 


Random Numbers 


Density curves and probability distributions 


The random number generator will spread its output uniformly across the en- 
tire interval from 0 to | as we allow it to generate a long sequence of random 
numbers. The results of many trials are represented by the density curve of a 
uniform distribution. This density curve appears in purple in Figure 6.3 on 
the next page. It has height | over the interval from 0 to 1. The area under the 
density curve is 1, and the probability of any event is the area under the density 
curve and above the event in question. 
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As Figure 6.3 shows, the probability that the random 
number generator produces a number Y between 0.3 and 


0.7 is 


Density curve 


P0.3 = Y =0.7) = 04 
That’s because the area of the shaded rectangle is 
length x width = 04x 1 = 04 


Height = 1 < 


FIGURE 6.3 Assigning probabilities 
for generating a random number 
between 0 and 1. The probability 
of any interval of numbers is the 
area above the interval and under 
the density curve. The shaded area 
represents P(0.3 = Y= 0.7) 


0 0.3 0.7 HD 


Random number 


In many cases, discrete random Figure 6.3 shows the probability distribution of the random variable Y = ran- 
variables arise from counting dom number between 0 and 1. We call Y a continuous random variable because 


something—for instance, the number i+. values are not isolated numbers but rather an entire interval of numbers. 
of siblings that a randomly selected 


student has. Continuous random 

variables often arise from measuring 
something—for instance, the height 
or time to run a mile for a randomly DEFINITION: Continuous random variable 
selected student. 


A continuous random variable X takes all values in an interval of numbers. The prob- 
ability distribution of Xis described by a density curve. The probability of any event is 
the area under the density curve and above the values of X that make up the event. 


probabilities to intervals of outcomes rather than to individual outcomes. 

In fact, all continuous probability models assign probability 0 to every in- 
dividual outcome. Only intervals of values have positive probability. To see that 
this is true, consider a specific outcome from the random number generator of 
the previous example, such as P(Y = 0.7). The probability of this event is the area 
under the density curve that’s above the point 0.70000 ... on the horizontal axis. 
But this vertical line segment has no width, so the area is 0. For that reason, 


The probability distribution for a continuous random variable assigns 1 


P(0.3 <Y < 0.7) = P(0.3 = Y <0.7) = P(0.3 <Y<0.7) =04 


We can use any density curve to assign probabilities. The density curves that 
are most familiar to us are the Normal curves of Chapter 2. We learned how to 
find areas in any Normal distribution on page 118. Normal distributions can be 
probability distributions as well as models for data. The following example shows 
the connection between the two. 


AP® EXAM TIP When 
showing your work on a 
free response question, you 
must include more than a 
calculator command. Writing 
normalcdf (68,70, 
64,2.7) will notearn 
you full credit for a Normal 
calculation. At a minimum, 
you must indicate what each 
of those calculator inputs 
represents. Better yet, sketch 
and label a Normal curve to 
show what you’re finding. 


Section 6.1 Discrete and Continuous Random Variables 357 


Young Women’s Heights 
Normal probability distributions 


The heights of young women closely follow the Normal distribution with mean 
jt = 64 inches and standard deviation o = 2.7 inches. This is a distribution for a 
large set of data. Now choose one young woman at random. Call her height Y. If 
we repeat the random choice very many times, the distribution of values of Y is the 
same Normal distribution that describes the heights of all young women. Find the 
probability that the chosen woman is between 68 and 70 inches tall. 


PROBLEM: What's the probability that a randomly chosen young woman has height between 68 
and 70 inches? 


SOLUTION: 


Step 1: State the distribution and the values of interest. The height Yofa randomly 
chosen young woman has the N(64, 2.7) distribution. We want to find P(68 = Y= 70). Figure 

6.4 shows the distribution with the area of interest shaded and the mean, standard deviation, and 
boundary values labeled. 


Normal curve 
M = 64, 0=2.7 


Probability = 22? 


FIGURE 6.4 The probability 
that a randomly chosen young 
woman has height between 
68 and 70 inches as an area 


Height in inches 
under a Normal curve. 


Step 2: Perform calculations—show your work! The standardized scores for the two 


boundary values are 
68 — 64 70 — 64 
re AO ANA res LYLE 
Zak 2.7 


The random variable Z follows a standard Normal distribution, and the desired probability is 
P(1.48 = ZS 2.22). From Table A, we find that P(Z S 2.22) = 0.9868 and P(Z <= 1.48) = 
0.9306. So we have 
F148 = Z= 2.22) = FZ= 2.22) — AZ=148) 
= 0.9868 — 0.9306 = 0.0562 


Using technology: The command normalcdf (lower:68, upper:70, [4:64, 0:2.7) 
gives an area of 0.0561. 


Step 3: Answer the question. There's about a 5.6% chance that a randomly chosen young 
woman has a height between 68 and 70 inches. 


For Practice Try Exercise 
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RANDOM VARIABLES 


The calculation in the preceding example is the same as those we did in 
Chapter 2. Only the language of probability is new. 

What about the mean and standard deviation for continuous random variables? 
The probability distribution of a continuous random variable X is described by a 
density curve. Chapter 2 showed how to find the mean of the distribution: it is the 
point at which the area under the density curve would balance if it were made out 
of solid material. The mean lies at the center of symmetric density curves such as 
the Normal curves. We can locate the standard deviation of a Normal distribution 
from its inflection points. Exact calculation of the mean and standard deviation 
for most continuous random variables requires advanced mathematics.° 


Summary 


e A random variable takes numerical values determined by the outcome of a 
chance process. The probability distribution of a random variable X tells us 
what the possible values of X are and how probabilities are assigned to those 
values. There are two types of random variables: discrete and continuous. 


e A discrete random variable has a fixed set of possible values with gaps be- 
tween them. The probability distribution assigns each of these values a prob- 
ability between 0 and | such that the sum of all the probabilities is exactly 1. 
The probability of any event is the sum of the probabilities of all the values 
that make up the event. 


e Acontinuous random variable takes all values in some interval of numbers. 
A density curve describes the probability distribution of a continuous random 
variable. The probability of any event is the area under the curve above the 
values that make up the event. 


e The mean of a random variable py is the balance point of the probability 
distribution histogram or density curve. Because the mean is the long-run 
average value of the variable after many repetitions of the chance process, it 
is also known as the expected value of the random variable, E'(X). 


e IfX is a discrete random variable, the mean is the average of the values of X, 
each weighted by its probability: 
px = E(X) = Dc, = 26D AP Fay OA AP SYDR Po 6 « 


e The variance of a random variable ox is the “average” squared deviation 
of the values of the variable from their mean. The standard deviation ox is 
the square root of the variance. The standard deviation measures the typical 
distance of the values in the distribution from the mean. 


e For a discrete random variable X, the variance is 
OX = D(x — px)" pi = Oc. — pox)" + (2 — pux)"p2 + (3 — pex)"p3 + 0° 
and the standard deviation is 


One Woe — pbx)"; 
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TECHNOLOGY 
CORNER 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


11. Analyzing random variables on the calculator 


Exercises 


1. ‘Toss 4 times Suppose you toss a fair coin 4 times. Let 
X = the number of heads you get. 


(a) Find the probability distribution of X. 


(b) Make a histogram of the probability distribution. 
Describe what you see. 


(c) Find P(X S 3) and interpret the result. 


2.  Pair-a-dice Suppose you roll a pair of fair, six-sided dice. 
Let T = the sum of the spots showing on the up-faces. 


(a) Find the probability distribution of T. 


(b) Make a histogram of the probability distribution. 
Describe what you see. 


(c) Find P(T = 5) and interpret the result. 
3. Spell-checking Spell-checking software catches 


“nonword errors,” which result in a string of letters that 
is not a word, as when “the” is typed as “teh.” When 
undergraduates are asked to write a 250-word essay 
(without spell-checking), the number X of nonword 
errors has the following distribution: 


Value: 0 1 2 3} 4 
Probability: 0.1 0.2 0.3 0.3 0.1 


(a) Write the event “at least one nonword error” in terms 
ot X. What is the probability of this event? 


(b) Describe the event X S 2 in words. What is its 
probability? What is the probability that X < 2? 


4. Kids and toys In an experiment on the behavior of 
young children, each subject is placed in an area with 
five toys. Past experiments have shown that the prob- 
ability distribution of the number X of toys played 
with by a randomly selected subject is as follows: 


Number of toys x;: 0 1 2, 3 4 5 
Probability p;: OLOS MO NIGIENO 3 ORO S23 eat OME ae Oell 
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Write the event “plays with at most two toys” in terms 
of X. What is the probability of this event? 


Describe the event X > 3 in words. What is its 
probability? What is the probability that X = 3? 


Benford’s law Faked numbers in tax returns, invoices, 


aren't present in legitimate records. Some patterns, 
like too many round numbers, are obvious and easily 
avoided by a clever crook. Others are more subtle. It is 
a striking fact that the first digits of numbers in legiti- 
mate records often follow a model known as Benford’s 
law.’ Call the first digit of a randomly chosen record X 
for short. Benford’s law gives this probability model for 
X (note that a first digit can’t be 0): 


(a) 
(b) 
DB 
pgkee) or expense account claims often display patterns that 


First digit: l 2 3 at 5 6 7 8 9 
Probability: 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046 


(a) Show that this is a legitimate probability distribution. 


(b) Make a histogram of the probability distribution. 
Describe what you see. 


(c) Describe the event X = 6 in words. What is P(X = 6)? 


(d) Express the event “first digit is at most 5” in terms of 
X. What is the probability of this event? 


6. Working out Choose a person aged 19 to 25 years at 
random and ask, “In the past seven days, how many 
times did you go to an exercise or fitness center or 
work out?” Call the response Y for short. Based on a 
large sample survey, here is a probability model for 
the answer you will get:* 


Days: 0 ] 2 3 4 5 6 7 
Probability: 0.68 0.05 0.07 0.08 0.05 0.04 0.01 0.02 


(a) Show that this is a legitimate probability distribution. 


(b) Make a histogram of the probability distribution. 
Describe what you see. 
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Describe the event Y < 7 in words. What is P(Y < 7)? 


Express the event “worked out at least once” in terms 
ot Y. What is the probability of this event? 


Benford’s law Refer to Exercise 5. The first digit of 
a randomly chosen expense account claim follows 
Benford’s law. Consider the events A = first digit is 7 
or greater and B = first digit is odd. 


What outcomes make up the event A? What is P(A)? 
What outcomes make up the event B? What is P(B)? 


What outcomes make up the event “A or B”? What 
is P(A or B)? Why is this probability not equal to 
P(A) + P(B)? 


Working out Refer to Exercise 6. Consider the 
events A = works out at least once and B = works 
out less than 5 times per week. 


What outcomes make up the event A? What is P(A)? 
What outcomes make up the event B? What is P(B)? 


What outcomes make up the event “A and B”? What 
is P(A and B)? Why is this probability not equal to 
P(A) - P(B)? 


Keno Keno is a favorite game in casinos, and simi- 
lar games are popular with the states that operate 
lotteries. Balls numbered | to 80 are tumbled ina 
machine as the bets are placed, then 20 of the balls 
are chosen at random. Players select numbers by 
marking a card. The simplest of the many wagers 
available is “Mark 1 Number.” Your payoff is $3 on 
a $1 bet if the number you select is one of those 
chosen. Because 20 of 80 numbers are chosen, your 
probability of winning is 20/80, or 0.25. Let X = the 
net amount you gain on a single play of the game. 


Make a table that shows the probability distribution of X. 


Compute the expected value of X. Explain what this 
result means for the player. 


Fire insurance Suppose a homeowner spends 
$300 for a home insurance policy that will pay out 
$200,000 if the home is destroyed by fire. Let Y = 
the profit made by the company on a single policy. 
From previous data, the probability that a home in 
this area will be destroyed by fire is 0.0002. 


Make a table that shows the probability distribution 
of Y. 


Compute the expected value of Y. Explain what this 
result means for the insurance company. 


Spell-checking Refer to Exercise 3. Calculate the 
mean of the random variable X and interpret this 
result in context. 
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12. Kids and toys Refer to Exercise +. Calculate the 
mean of the random variable X and interpret this 
result in context. 


13. Benford’s law and fraud A not-so-clever employee 
decided to fake his monthly expense report. He 
believed that the first digits of his expense amounts 
should be equally likely to be any of the numbers 
from | to 9. In that case, the first digit Y of a ran- 
domly selected expense amount would have the 
probability distribution shown in the histogram. 


0.4 4 


S 
bo 
! 


Probability 
=) 
N 
! 


0 il 2) 8} 4 5 6 a 8 9 
Outcomes 


(a) Explain why the mean of the random variable Y is 
located at the solid red line in the figure. 


(b) The first digits of randomly selected expense amounts 
actually follow Benford’s law (Exercise 5). According 
to Benford’s law, what’s the expected value of the first 
digit? Explain how this information could be used to 
detect a fake expense report. 


(c) What's P(Y > 6) in the above distribution? Ac- 
cording to Benford’s law, what proportion of first 
digits in the employee’s expense amounts should be 
greater than 6? How could this information be used 
to detect a fake expense report? 


14. Life insurance A life insurance company sells a 
term insurance policy to a 21-year-old male that pays 
$100,000 if the insured dies within the next 5 years. 
The probability that a randomly chosen male will 
die each year can be found in mortality tables. The 
company collects a premium of $250 each year as 
payment for the insurance. ‘The amount Y that the 
company earns on this policy is $250 per year, less 
the $100,000 that it must pay if the insured dies. 
Here is a partially completed table that shows infor- 
mation about risk of mortality and the values of 
Y = profit earned by the company: 


Age at death: 21 22 3) 24 25 26 or more 
Profit: $99,750 —$99,500 —$99,250 —$99,000 —$98,750 $1250 


Probability: 0.00183 0.00186 0.00189 0.00191 0.00193 


(a) Explain why the company suffers a loss of $98,750 
on such a policy if a client dies at age 25. 


(b) Find the missing probability. Show your work. 


(c) Calculate the mean py. Interpret this value in 
context. 
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15. Spell-checking Refer to Exercise 3. Calculate and 
1] 353 interpret the standard deviation of the random vari- 
& able X. Show your work. 


16. Kids and toys Refer to Exercise 4+. Calculate and in- 
terpret the standard deviation of the random variable 
X. Show your work. 


17. Benford’s law and fraud Refer to Exercise 13. It 
might also be possible to detect an employee’s fake 
expense records by looking at the variability in the 
first digits of those expense amounts. 


(a) Calculate the standard deviation oy. This gives us 
an idea of how much variation we’d expect in the 
employee’s expense records if he assumed that first 
digits from | to 9 were equally likely. 


(b) Now calculate the standard deviation of first digits that 
follow Benford’s law (Exercise 5). Would using stan- 
dard deviations be a good way to detect fraud? Explain. 


18. Life insurance 


(a) It would be quite risky for you to insure the life of a 
21-year-old friend under the terms of Exercise 14. 
There is a high probability that your friend would 
live and you would gain $1250 in premiums. But if 
he were to die, you would lose almost $100,000. Ex- 
plain carefully why selling insurance is not risky for 
an insurance company that insures many thousands 
of 21-year-old men. 


(b) ‘The risk of an investment is often measured by the 
standard deviation of the return on the investment. 
The more variable the return is, the riskier the invest- 
ment. We can measure the great risk of insuring a 
single person’s life in Exercise 14 by computing the 
standard deviation of the income Y that the insurer 
will receive. Find oy using the distribution and mean 
found in Exercise 14. 


19. Housing in San Jose How do rented housing units 
differ from units occupied by their owners? Here are 
the distributions of the number of rooms for owner- 
occupied units and renter-occupied units in San 
Jose, California:? 


Number of Rooms 
1 2 3 4 5 6 if 8 9 10 
Owned 0.003 0.002 0.023 0.104 0.210 0.224 0.197 0.149 0.053 0.035 
Rented 0.008 0.027 0.287 0.363 0.164 0.093 0.039 0.013 0.003 0.003 


Let X = the number of rooms in a randomly selected 
owner-occupied unit and Y = the number of rooms in a 
randomly chosen renter-occupied unit. 


(a) Make histograms suitable for comparing the prob- 
ability distributions of X and Y. Describe any differ- 
ences that you observe. 


(b) Find the mean number of rooms for both types of 
housing unit. Explain why this difference makes 
sense. 


(c) Find and interpret the standard deviations of both 
X and Y. 


20. Size of American households In government data, a 
household consists of all occupants of a dwelling unit, 
while a family consists of two or more persons who 
live together and are related by blood or marriage. So 
all families form households, but some households 
are not families. Here are the distributions of house- 
hold size and family size in the United States: 


Number of Persons 


1 2 3 4 5 6 7 
Household probability 0.25 0.32 017 0.15 0.07 0.03 0.01 
Family probability 0 042 023 0.21 0.09 0.03 0.02 


Let X = the number of people in a randomly selected 
US. household and Y = the number of people ina 
randomly chosen U.S. family. 


(a) Make histograms suitable for comparing the prob- 
ability distributions of X and Y. Describe any differ- 
ences that you observe. 


(b) Find the mean for each random variable. Explain 
why this difference makes sense. 


(c) Find and interpret the standard deviations of both 
X and Y. 


21. Random numbers Let X be a number between 
0 and 1 produced by a random number generator. 
Assuming that the random variable X has a uniform 
distribution, find the following probabilities: 


(a) P(X > 0.49) 
(b) P(X = 0.49) 
(c) P(0.19<X < 0.37 or 0.84 <X < 1.27) 


22. Random numbers Let Y be a number between 0 
and | produced by a random number generator. 
Assuming that the random variable Y has a uniform 
distribution, find the following probabilities: 


AY = 04) 

Py = 0) 

(oC) POLY = U5 orl 77 =¥ < 0:58) 

23. Running a mile A study of 12,000 able-bodied male 


pokey, students at the University of Illinois found that their 
& times for the mile run were approximately Normal 


with mean 7.11 minutes and standard deviation 0.74 
minute.!” Choose a student at random from this 
group and call his time for the mile Y. Find P(Y < 6) 
and interpret the result. 
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24. ITBS scores The Normal distribution with mean 
je = 6.8 and standard deviation o = 1.6 is a good 
description of the Lowa Test of Basic Skills (TBS) 
vocabulary scores of seventh-grade students in Gary, 
Indiana. Call the score of a randomly chosen student 
X for short. Find P(X = 9) and interpret the result. 


25. Ace! Professional tennis player Rafael Nadal hits the 
ball extremely hard. His first-serve speeds follow a 
Normal distribution with mean 115 miles per hour 
(mph) and standard deviation 6 mph. Choose one 
ot Nadal’s first serves at random. Let Y = its speed, 
measured in miles per hour. 


(a) Find P(Y > 120) and interpret the result. 
(b) What is P(Y = 120)? Explain. 


(c) Find the value of ¢ such that P(Y S$ c) = 0.15. Show 
your work. 


26. Pregnancy length The length of human pregnan- 
cies from conception to birth follows a Normal dis- 
tribution with mean 266 days and standard deviation 
16 days. Choose a pregnant woman at random. Let X 
= the length of her pregnancy. 


(a) Find P(X = 240) and interpret the result. 
(b) What is P(X > 240)? Explain. 


(c) Find the value of ¢ such that P(X = c) = 0.20. Show 
your work. 


Multiple choice: Select the best answer for Exercises 

27 to 30. 

Exercises 27 to 29 refer to the following setting. Choose 

an American household at random and let the random 
variable X be the number of cars (including SUVs and 
light trucks) they own. Here is the probability model if we 
ignore the few households that own more than 5 cars: 


Number of cars X: 0 ] 2 3 4 5 
Probability: O09 Wate Ws While WHO Wor 


27. What’s the expected number of cars in a randomly 
selected American household? 
(a) 1.00 (b) 1.75 (c) 1.84 (d) 2.00 (e) 2.50 


28. ‘The standard deviation of X is oy = 1.08. If many 
households were selected at random, which of the 
following would be the best interpretation of the 
value 1.08? 


(a) ‘The mean number of cars would be about 1.08. 


(b) ‘The number of cars would typically be about 1.08 


from the mean. 


(c) ‘The number of cars would be at most 1.08 from the 
mean. 


(d) ‘The number of cars would be within 1.08 from the 
mean about 68% of the time. 


(e) ‘The mean number of cars would be about 1.08 from 
the expected value. 


29. About what percentage of households have a number 
of cars within 2 standard deviations of the mean? 


(a) 68% (b) 71% (c) 93% (d) 95% (e) 98% 


30. Adeck of cards contains 52 cards, of which 4 are 
aces. You are offered the following wager: Draw one 
card at random from the deck. You win $10 if the 
card drawn is an ace. Otherwise, you lose $1. If you 
make this wager very many times, what will be the 
mean amount you win? 


a) About —$1, because you will lose most of the time. 


b) About $9, because you win $10 but lose only $1. 


Q 


( 

( 

(c) About —$0.15; that is, on average you lose about 15 cents. 
(d) About $0.77; that is, on average you win about 77 cents. 
( 


e) About $0, because the random draw gives you a fair bet. 


Exercises 31 to 34 refer to the following setting. Many chess 
masters and chess advocates believe that chess play devel- 
ops general intelligence, analytical skill, and the ability to 
concentrate. According to such beliefs, improved reading 
skills should result from study to improve chess-playing 
skills. ‘To investigate this belief, researchers conducted 

a study. All of the subjects in the study participated in a 
comprehensive chess program, and their reading perfor- 
mances were measured before and after the program. The 
graphs and numerical summaries below provide informa- 
tion on the subjects’ pretest scores, posttest scores, and the 
difference (post — pre) between these two scores. 


Descriptive Statistics: Pretest, Posttest, Post — pre 
Variable N Mean Median StDev Min Max Or Q3 
53 57.70 58.00 17.84 23.00 99.00 44.50 


Pretest TNC 1530) 


PosteesGy 53 6SmOeh 64500) 870 280099. CONS 000 7i5...0.0 


Post prem bore acc 3.00 13.02 -19.00 42.00 -3.50 14.00 


100 ae 


Reading performance 


Pretest Posttest Post — pre 
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31. Better readers? (1.3) Did students have higher read- 
a, ing scores after participating in the chess program? 
“4 Give appropriate statistical evidence to support your 

answer. 


Residual 


32. Chess and reading (4.3) Ifthe study found a statisti- 
ep, cally significant improvement in reading scores, 
could you conclude that playing chess causes an 
increase in reading skills? Justify your answer. 


Some graphical and numerical information about the cca tik 

relationship between pretest and posttest scores is provided 

nao Regression Analysis: Posttest versus Pretest 
Predictor Coef SE Coef T P 
Constant IT at ShNT) By a iti) 3.04 0.004 
Pretest 0.78301 0.09758 8.02 0.000 
S = 12.55 R-Sq = 55.8% R-Sq(adj) = 54.9% 


33. Predicting posttest scores (3.2) What is the equa- 
@ tion of the linear regression model relating posttest 
ra and pretest scores? Define any variables used. 

34. How well does it fit? (3.2) Discuss what s, r’, and 
2b = 2S me we @ ee plot tell you about this linear regression 

Pre-test ' 


Post-test 


Transforming and Combining 
Random Variables 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e Describe the effects of transforming a random variable e Find probabilities involving the sum or difference of 
by adding or subtracting a constant and multiplying or independent Normal random variables. 


dividing by a constant. 


e Find the mean and standard deviation of the sum or 
difference of independent random variables. 


In Section 6.1, we looked at several examples of random variables and their proba- 
bility distributions. We also saw that the mean juy and standard deviation ox give us 
important information about a random variable. For instance, for X = the amount 
gained on a single $1 bet on red in a game of roulette, we already showed that 
Lux = —$0.05. You can verify that the standard deviation is cx = $1.00. That is, a 
player can expect to lose an average of 5 cents per $1 bet if he plays many games. 
But if he plays only a few games, his actual gain could be much better or worse 
than this expected value. 
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Would the player be better off playing one game of roulette with a $2 bet on 
red or playing two games and betting $1 on red each time? To find out, we need 
to compare the probability distributions of the random variables Y = gain from a 
$2 bet and T = total gain from two $1 bets. Which random variable (if either) has 
the higher expected gain in the long run? Which has the larger variability? By the 
end of this section, you'll be able to answer questions like these. 


Linear Transformations 


In Chapter 2, we studied the effects of transformations on the shape, center, and 
spread of a distribution of data. Recall what we discovered: 


1. Adding (or subtracting) a constant: Adding the same positive number a to 
(subtracting a from) each observation: 
e Adds a to (subtracts a from) measures of center and location (mean, me- 
dian, quartiles, percentiles). 
¢ Does not change shape or measures of spread (range, JOR, standard deviation). 
2. Multiplying (or dividing) each observation by the same positive number b: 
e¢ Multiplies (divides) measures of center and location (mean, median, 
quartiles, percentiles) by b. 
e Multiplies (divides) measures of spread (range, IOR, standard deviation) by b. 
¢ Does not change the shape of the distribution. 
How are the probability distributions of random variables affected by similar 


transformations to the values of the variable? For reasons that will be clear later, 
we'll start by considering multiplication (or division) by a constant. 


Effect of multiplying or dividing by a constant Let’s start with a 
simple example of a discrete random variable. Pete’s Jeep Tours offers a popular 
half-day trip in a tourist area. There must be at least 2 passengers for the trip to 
run, and the vehicle will hold up to 6 passengers. The number of passengers X on 
a randomly selected day has the following probability distribution. 


No. of passengers x;: 2 3 4 5 6 
Probability p;: 0.15 0.25 0.35 0.20 0.05 
an Figure 6.5 shows a histogram of the probability distribution. 
ada Using what we learned in Section 6.1, the mean of X is 
0.25 
= vane bx = Dx; Pp; = (2)(0.15) + (3)(0.25) + (4)(0.35) 
2 ar + (5)(0.20) + (6)(0.05) = 3.75 
* 0.10 — That is, Pete expects an average of 3.75 passengers per trip. 
The variance of X is given by 
0.05 4 
0.00 4 o¢ = D(x; — px)’p; = (2 — 3.75)7(0.15) + 3 — 3.75)7(0.25) 
CF es +++++ (6 — 3.75)*(0.05) = 1.1875 
Number of passengers (X) 
FIGURE 6.5 The probability So the standard deviation of X is 
distribution of the random variable ox = V1.1875 = 1.0897 


X = the number of passengers 
on Pete’s trip on a randomly 
chosen day. 


On a randomly selected day, the number of people on a trip typically differs from 
the mean by about 1.09 passengers. 
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Pete’s Jeep Tours 


Multiplying a random variable by a constant 


Pete charges $150 per passenger. Let C = the total amount 
of money that Pete collects on a randomly selected trip. 
Because the amount of money Pete collects is just $150 times 
the number of passengers, we can write C = 150X. From the 
probability distribution of X, we can see that the chance of 
having two people (X = 2) on the trip is 0.15. In that case, 
C = (150)(2) = 300. So one possible value of C is $300, and its 
corresponding probability is 0.15. If X = 3, then C = (150)(3) = 
450, and the corresponding probability is 0.25. Thus, the prob- 
ability distribution of C is 


Total collected c;: 300 450 600 750 900 
Probability p;: 0.15 0.25 0.35 0.20 0.05 


Figure 6.6 is a histogram of this probability distribution. 


S 
n 


The mean of C is pig = De;p; = (300)(0.15) + (450)(0.25) 
+... + (900)(0.05) = 562.50. 


=) 
is) 


On average, Pete will collect a total of $562.50 from the 
half-day trip. The variance of C is 


Probability 
oO 
N 


S 
a 


oe = D(cei — pc)’: 
450 600750900 = (300 — 562.50)2(0.15) + (450 — 562.50)2(0.25) 


So 
= 


Money collected (C) 


ie = 2 = 
FIGURE 6.6 The probability distribution of the random a 
variable C = the amount of money Pete collects from his So the standard deviation of C is o¢ = V26,718.75 
trip on a randomly chosen day. = $16346. 


In the previous example, the random variable C was obtained by multiplying 
the values of our earlier random variable X by 150. To understand the effect of 
multiplying by a constant, let’s compare the probability distributions of the ran- 
dom variables X and C. 


Shape: The two probability distributions have the same shape. 

Center: The mean of X is py = 3.75. The mean of C is wc = 562.50, which is 
(150)(3.75). That is, wo = 150 py. 

Spread: The standard deviation of X is ox = 1.0897. The standard deviation of 
C is oc = 163.46, which is (150)(1.0897). That is, 7¢ = 150ox. 


Let’s summarize what we’ve learned so far about transforming a random 
variable. 
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EFFECT ON A RANDOM VARIABLE OF 
MULTIPLYING (OR DIVIDING) BY A CONSTANT 


Multiplying (or dividing) each value of a random variable by a positive 
number b: 


e Multiplies (divides) measures of center and location (mean, median, 
quartiles, percentiles) by b. 

¢ Multiplies (divides) measures of spread (range, JOR, standard devia- 
tion) by b. 

¢ Does not change the shape of the distribution. 


As with data, if we multiply a random variable by a negative constant b, our 
common measures of spread are multiplied by |J]. 


THINK How does multiplying by a constant affect the variance? For 
ABOUT IT Pete’s Jeep Tours, the variance of the number of passengers on a randomly selected 
trip is 0% = 1.1875. The variance of the total amount of money that Pete collects 
from such a trip is 0% = 26,718.75. That’s (22,500)(1.1875). So o% = 22,5000%. 
Where did 22,500 come from? It’s just (150)?. In other words, o@ = (150)?o%. 

Multiplying a random variable by a constant b multiplies the variance by b’. 


os a 


Effect of adding or subtracting a constant What happens to the 
probability distribution of a random variable if we add or subtract a constant? Let’s 
return to Pete’s Jeep Tours to find out. 


Pete’s Jeep Tours 
Effect of adding or subtracting a constant 


It costs Pete $100 to buy permits, gas, and a ferry pass for each 
half-day trip. The amount of profit V that Pete makes from 
the trip is the total amount of money C that he collects from 
passengers minus $100. That is, V = C — 100. If Pete has only 
two passengers on the trip (X = 2), then C = 2(150) = 300 and 
V = 200. From the probability distribution of C, the chance 
that this happens is 0.15. So the smallest possible value of V 


Probability 
Oo 
N 


0.0 is $200; its corresponding probability is 0.15. If X = 3, then 
. a ey a et ee C = 450 and V = 350, and the corresponding probability is 
hadi 0.25. The probability distribution of V is 
FIGURE 6.7 The probability 
distribution of the random Profit vj: 200 350 500 650 800 
variable V = the profit that Probability p;: 0.15 0.25 0.35 0.20 0.05 


Pete makes from his trip on a 
randomly chosen day. Figure 6.7 shows a histogram of this probability distribution. 
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The mean of V is py = vip; = (200)(0.15) + (350)(0.25) + --- + (800)(0.05) = 
462.50. On average, Pete will make a profit of $462.50 from the trip. The variance 
of V is 


oy = Dv; — py)’; 
= (200 — 462.50)(0.15) + (350 — 462.50)7(0.25) 
+... + (800 — 462.50)7(0.05) = 26,718.75 
So the standard deviation of V is 
oy = V26,718.75 = $163.46 


It’s fairly clear from the previous example that subtracting 100 from the values 
of the random variable C just shifts the probability distribution to the left by 100. 
This transformation decreases the mean by 100 (from $562.50 to $462.50) but 
doesn’t change the standard deviation ($163.46) or the shape. These results can 
be generalized for any random variable. 


EFFECT ON A RANDOM VARIABLE OF 
ADDING (OR SUBTRACTING) A CONSTANT 


ap/ cHeck YOUR UNDERSTANDING 


A large auto dealership keeps track of sales made during each hour of the day. Let X = the 
number of cars sold during the first hour of business on a randomly selected Friday. Based 
on previous records, the probability distribution of X is as follows: 


Cars sold: 0 1 2 3 
Probability: 0.3 O04 O02 O01 


The random variable X has mean py = 1.1 and standard deviation oy = 0.943. 


1. Suppose the dealership’s manager receives a $500 bonus from the company for each 
car sold. Let Y = the bonus received from car sales during the first hour on a randomly 
selected Friday. Find the mean and standard deviation of Y. 

2. ‘To encourage customers to buy cars on Friday mornings, the manager spends $75 to 
provide coffee and doughnuts. The manager’s net profit T on a randomly selected Friday 
is the bonus earned minus this $75. Find the mean and standard deviation of T. 


Putting it all together: Adding/subtracting and multiplying/ 
dividing What happens if we transform a random variable by both adding or 
subtracting a constant and multiplying or dividing by a constant? Let’s consider 
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For the linear transformation 

V= —100 + 150X, it would not be 
correct to apply the transformations 
in the reverse order: subtract 100 and 
then multiply by 150. Doing so would 
yield the same standard deviation but 
a different (wrong) mean. Just follow 
the order of operations from algebra. 


Can you see why this is called a “linear” 
transformation? The equation describing 
the sequence of transformations has 
the form Y= a+ bX, which you should 
recognize as a linear equation. 


RANDOM VARIABLES 


Pete’s Jeep Tours again. We could have gone directly from the number of pas- 
sengers X on a randomly selected jeep tour to Pete’s profit V with the equation 
V = 150X — 100 or, equivalently, V = —100 + 150X. This linear transformation 
of the random variable X includes both of the transformations that we performed 
earlier: (1) multiplying by 150 and (2) subtracting 100. (In general, a linear trans- 
formation can be written in the form Y = a + bX, where a and b are constants.) 
The net effect of this sequence of transformations is as follows: 


Shape: Neither transformation changes the shape of the probability distribution. 
Center: The mean of X is multiplied by 150 and then decreased by 100; that is, 
py = 150ux — 100 = —100 + 150py. 

Spread: The standard deviation of X is multiplied by 150 and is unchanged by the 
subtraction: oy = 150ox. 


This logic generalizes to any linear transformation. 


EFFECTS OF A LINEAR TRANSFORMATION ON A RANDOM VARIABLE 


If Y =a +t bX isa linear transformation of the random variable X, then 


e the probability distribution of Y has the same shape as the probability 
distribution of X if b > 0. 

° py =a + dix. 

© oy = |b\ox (because b could be a negative number). 


The bottom two rules in the summary box don’t just apply to means and stan- 
dard deviations. Linear transformations have similar effects on other measures 
of center or location (median, quartiles, percentiles) and spread (range, OR). 
Whether we’re dealing with data or random variables, the effects of a linear transfor- 
mation are the same. Note that these results apply to both discrete and continuous 
random variables. 


The Baby and the Bathwater 


Linear transformations 


PROBLEM: One brand of bathtub comes with a dial to set the water temperature. When the 
“babysafe” setting is selected and the tub is filled, the temperature X of the water follows a Normal 
distribution with a mean of 34°C and a standard deviation of 2°C. 


(a) Define the random variable Yto be the water temperature in degrees Fahrenheit 


(recall that F = —C + 32) when the dial is set on “babysafe.” Find the mean and standard 
deviation of Y. 


(b) According to Babies RUs, the temperature of a baby's bathwater should be between 90°F and 
100°F. Find the probability that the water temperature on a randomly selected day when the “ba- 
bysafe” setting is used meets the Babies RUs recommendation. Show your work. 
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SOLUTION: 
g) 
(a) According to the formula for converting Celsius to Fahrenheit, Y = 5 X + 32. We could also 


9 
write this inthe form Y= 32 + ale mean of Y is 


9 9 
fly = 32 + 5 hx = 32+ ale = 93.2°F 
The standard deviation of Y is 
9 g 

Oy a Oy 5 (2) 3.6°F 
(b) Step 1: State the distribution and the values of 
interest. The linear transformation doesn’t change the shape of the prob- 
ability distribution, so the random variable Yis Normally distributed with a 
mean of 93.2 and a standard deviation of 3.6. We want to find 380P(90 = 
Y= 100). The shaded area in Figure 6.8 shows the desired probability. 


Step 2: Perform calculations—show your work! To find this 
area, we can standardize the boundary values and use Table A: 


350 I3.2 100 _ 
Temperature (°F) i 90 — 93.2 a eer ae 100 — 93.2 
3.6 3.6 


Then P(—0.89 = Z= 1.89) = 0.9706 — 0.1867 = 0.7839. 


= 1.89 


FIGURE 6.8 The Normal prob- 
ability distribution of the random 


variable Y = the temperature (in Using technology. The command normalcdf (lower:90, upper:100, [:93.2,0:3.6) 
°F) of the bathwater when the dial gives an area of 0.7835. 


ie eet Oy) sbabysate:” Ihe stiaded Step 3: Answer the question. There's about a 78% chance that the water temperature meets 
area is the probability that the 
water temperature is between 


90°F and 100°F For Practice Try Exercise 


the recommendation on a randomly selected day. 


Combining Random Variables 


So far, we have looked at settings that involved a single random variable. Many in- 
teresting statistics problems require us to combine two or more random variables. 


Pete’s Jeeps and Erin’s Adventures 
When one random variable isn’t enough 


Earlier, we examined the probability distribution for the random variable X = the 
number of passengers on a randomly selected half-day trip with Pete’s Jeep Tours. 
Here’s a brief recap: 


No. of passengers x;: 2 x 5 5 6 
Probability p;: 0.15 O23 0.35 0.20 0.05 


Mean: px = 3.75 — Standard deviation: ox = 1.0897 
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Probability 
S 
8 
! 


0 1 2 3 4 5 6 
Number of passengers (X) 


Pete’s sister Erin, who lives near a tourist area in another part of the country, is 
impressed by the success of Pete’s business. She decides to join the business, run- 
ning tours on the same days as Pete in her slightly smaller vehicle, under the name 
Erin’s Adventures. After a year of steady bookings, Erin discovers that the number 
of passengers Y on her half-day tours has the following probability distribution. 
Figure 6.9 displays this distribution as a histogram. 


geal No. of passengers y;: Z 3 4 5 
Probability p;: 0.3 OF 02 0.1 
o4 4 
= Mean: py = 3.10 Standard deviation: oy = 0.943 
= 03- 
2 
E 024 How many total passengers T will Pete and Erin have on 
their tours on a randomly selected day? To answer this 
ald question, we need to know about the distribution of the 
0 random variable T= X + Y. 
0 1 2 3 4 5 
Number of passengers (Y) How many more or fewer passengers D will Pete have 


than Erin on a randomly selected day? ‘To answer this 
FIGURE 6.9 The probability distribution of the random variable question, we need to know about the distribution of the 
Y = the number of passengers on Erin’s trip on a randomly nda 

chosen day. 


As the example suggests, we want to investigate what happens when we add or 
subtract random variables. 


Sums of random variables How many total passengers T can Pete and 
Erin expect to have on their tours on a randomly selected day? Because Pete aver- 
ages [lx = 3.75 passengers per trip and Erin averages joy = 3.10 passengers per 
trip, they will average a total of pup = 3.75 + 3.10 = 6.85 passengers per day. We 
can generalize this result for any two random variables as follows: if T = X + Y, 
then wp = Lyx + py. In other words, the expected value (mean) of the sum of two 
random variables is equal to the sum of their expected values (means). 
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MEAN OF THE SUM OF RANDOM VARIABLES 


How much variability is there in the total number of passengers who go on 
Pete’s and Erin’s tours on a randomly chosen day? Let’s think about the possible 
values of T = X + Y. The number of passengers X on Pete’s tour is between 
2 and 6, and the number of passengers Y on Erin’s tour is between 2 and 5. So 
the total number of passengers T is between 4 and 11. Thus, the range of T is 
1] — 4 = 7. How is this value related to the ranges of X and Y? The range of X 
is + and the range of Y is 3, so 


range of T = range of X + range of Y 


That is, there’s more variability in the values of T than in the values of X or Y 
alone. ‘This makes sense, because the variation in X and the variation in Y both 
contribute to the variation in T. 

What about the standard deviation o7? If we had the probability distribu- 
tion of the random variable T, then we could calculate or. Let’s try to con- 
struct this probability distribution starting with the smallest possible value, 
T = 4. The only way to get a total of 4 passengers is if Pete has X = 2 passen- 
gers and Erin has Y = 2 passengers. We know that P(X = 2) = 0.15 and that 
P(Y = 2) = 0.3. If the two events X = 2 and Y = 2 are independent, then we 
can multiply these two probabilities. Otherwise, we’re stuck. In fact, we can’t 
calculate the probability for any value of T unless X and Y are independent 
random variables. 


Probability models often assume independence when the random variables 
describe outcomes that appear unrelated to each other. You should always ask 
whether the assumption of independence seems reasonable. For instance, it’s rea- 
sonable to treat the random variables X = number of passengers on Pete’s trip and 
Y = number of passengers on Erin’s trip on a randomly chosen day as indepen- 
dent, because the siblings operate their trips in different parts of the country. Now 
we can calculate the probability distribution of the total number of passengers 
that day. 
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FIGURE 6.10 The probability 
distribution of the random vari- 
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Pete’s Jeep Tours and Erin’s Adventures 


Sum of two random variables 


Let T = X + Y, as before. Because X and Y are independent random 
variables, P(T = 4) = P(X = 2 and Y = 2) = P(X = 2) X P(Y = 2) 
= (0.15)(0.3) = 0.045. There are two ways to get a total of T = 5 pas- 
sengers on a randomly selected day: X = 3, Y = 2 or X = 2, Y = 3. So 
P(T = 5) = P(X = 2 and Y = 3) + P(X = 3 and Y = 2) = (0.15)(04) 
+ (0.25)(0.3) = 0.06 + 0.075 = 0.135. 


We can construct the probability distribution by listing all combina- 
tions of X and Y that yield each possible value of T and adding the 
corresponding probabilities. Here is the result. 


Value t;: 4 5 6 7 8 9 10 ll 
Probability p;: 0.045 0.135 0.235 0.265 0.190 0.095 0.030 0.005 


You can check that the probabilities add to 1. A histogram of the probability distri- 
bution is shown in Figure 6.10. 


Probability 


able 7 = the total number of 3 4 5 6 7 8 9 10 It 


passengers on Pete’s and Erin’s 
trips on a randomly chosen day. 


Total number of passengers (7) 


The mean of T is pop = Sip; = (4)(0.045) + (5)(0.135) +... + (11)(0.005) = 6.85. 
Recall that x = 3.75 and py = 3.10. Our calculation confirms that 


Lop = px + ply = 3.75 + 3.10 = 6.85 
What about the variance of T? It’s 
of = X(t, — pr)’ pi 
= (4 — 6.85)7(0.045) + (5 — 6.85)?(0.135) 
+... + (11 — 6.85)7(0.005) = 2.0775 
Recalling that ox = 1.1875 and oF = 0.89, we see that 1.1875 + 0.89 = 2.0775. Thatis, 
of = 0% + oF 

To find the standard deviation of T, take the square root of the variance 


or = V2.0775 = 1441 


Section 6.2 Transforming and Combining Random Variables 373 


As the preceding example illustrates, when we add two independent random 
variables, their variances add. Standard deviations do not add. For Pete’s and 
Erin’s passenger totals, 


ox + oy = 1.0897 + 0.943 = 2.0327 
This is very different from op = 1.441. 


VARIANCE OF THE SUM OF INDEPENDENT RANDOM VARIABLES 


For any two independent random variables X and Y, if T = X + Y, then the 
variance of T is 
oF = Ge oh oF 


In general, the variance of the sum of several independent random variables is 
the sum of their variances. 


You might be wondering whether there’s a formula for computing the variance 
of the sum of two random variables that are not independent. There is, 
but it’s beyond the scope of this course. Just remember that you can add rT) 
variances only if the two random variables are independent and that you 
can never add standard deviations. 


SAT Scores 


The role of independence 


A college uses SAT scores as one criterion for admission. Experience has shown 
that the distribution of SAT scores among its entire population of applicants is 
such that 


SAT Math score X: ix = 519 ox = 115 
SAT Critical Reading score Y: pty = 507 exe = IN 


PROBLEM: What are the mean and standard deviation of the total score X + Y fora randomly 
selected applicant to this college? 


SOLUTION: The mean total score is 
| Uy ee [ly + [Uke — yt) ae 507 >= 1026 


The variance and standard deviation of the total cannot be computed from the information given. 
SAT Math and Critical Reading scores are not independent, because students who score high on one 
exam tend to score high on the other also. 


For Practice Try Exercise 


The next example involves two independent random variables and some 
transformations. 
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Pete’s and Erin’s Tours 
Rules for adding random variables 


Earlier, we defined X = the number of passengers that Pete has and 
Y = the number of passengers that Erin has on a randomly selected 
day. Recall that 


jx = 3.75, ox = 1.0897 jy = 3.10, oy = 0.943 


Pete charges $150 per passenger and Erin charges $175 per passenger. 


PROBLEM: Calculate the mean and the standard deviation of the total amount 
that Pete and Erin collect on a randomly chosen day. 


SOLUTION: Let W= the total amount collected. Then W= 150X + 175Y. If 
we let C= 150Xand G = 175Y, then we can write Was the sum of two random vari- 
ables: W = C + G.We can use what we learned earlier about the effect of multiplying 
by a constant to find the mean and standard deviation of Cand G. For C= 150X, 


Lc = 150p1y = 150(3.75) = $562.50 and o¢ = 150(1.0897) = $163.46 
Fone — 479, 
Lg = 175 pty = 175(3.10) = $542.50 and og = 175(0.943) = $165.03 
We know that the mean of the sum of two random variables equals the sum of their means: 
Lw= bet pg = 562.50 + 542.50 = 1105 
On average, Pete and Erin expect to collect a total of $ 1105 per day. 


Because the number of passengers Xand Yare independent random variables, so are the amounts of 
money collected C and G. Therefore, the variance of Wis the sum of the variances of Cand G. 


of, = 03 + 03 = (163.46)" + (165.03)" = 53,954.07 
To get the standard deviation, we take the square root of the variance: 
Ow = V 53,954.07 = 232.28 


The standard deviation of the total amount they collect is $232.28. 


For Practice Try Exercise 


We can extend our rules for adding random variables to situations 
involving repeated observations from the same chance process. For 
instance, suppose a gambler plays two games of roulette, each time 
placing a $1 bet on either red or black. What can we say about his 
total gain (or loss) from playing two games? Earlier, we showed that 
if X = the amount gained on a single $1 bet on red or black, then 
Lx = —$0.05 and ox = $1.00. Because we’re interested in the play- 
er’s total gain over two games, we'll define X; as the amount he gains 
from the first game and X; as the amount he gains from the second 
game. Then, his total gain T = X, + X. Both X; and X; have the 


THINK 
ABOUT IT 
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same probability distribution as X and, therefore, the same mean (—$0.05) and 
standard deviation ($1.00). The player’s expected gain in two games is 


br = bx, + bx, = (—$0.05) + (—$0.05) = —$0.10 


Because knowing the result of one game tells the player nothing about the result 
of the other game, X; and X2 are independent random variables. As a result, 


of = 0%, + o%, = (1.00)? + (1.00)? = 2.00 
and the standard deviation of the player’s total gain is 


or = V2.00 = $1.41 


X, + Xz is not the same as 2X At the beginning of the section, we asked 
whether a roulette player would be better off placing two separate $1 bets on red 
or a single $2 bet on red. The player’s total gain from two $1 bets on red is T = 
X, + X). This sum of random variables has mean pup = —$0.10 and standard devia- 
tion or = $1.41. Now think about what happens if the gambler places a $2 bet on 
red ina single game of roulette. Because the random variable X represents a player’s 
gain froma $1 bet, the random variable Y = 2X represents his gain from a $2 bet. 
What’s the player’s expected gain from a single $2 bet on red? It’s 


jy = 2px = 2(—$0.05) = —$0.10 


That’s the same as his expected gain from playing two games of roulette with a 
$1 bet each time. But the standard deviation of the player’s gain from a single $2 
bet is 


oy = lox = 2($1.00) = $2.00 


Compare this result to op = $1.41. There’s more variability in the gain from a 
single $2 bet than in the total gain from two $1 bets. 

Let’s take this one step further. Would it be better for the player to place a single 
$100 bet on red or to play 100 games and bet $1 each time on red? For the single 
$100 bet, the mean and standard deviation of the amount gained would be 


mean = 100px = 100(—$0.05) = —$5.00 
standard deviation = 100cx = 100($1.00) = $100.00 


For 100 games with a $1 bet, the mean and standard deviation of the amount 
gained would be 


mean = Lx, Lx, i LX 
= (—$0.05) + (—$0.05) + ---+ (—$0.05) = —$5.00 
variance = O*X, + O*X, i OX, = (1)? + (1)? +---+ (1)? = 100 


standard deviation = V100 = $10.00 


The player has a much better chance of winning (or losing) big with a single $100 
bet than with 100 separate $1 bets. Of course, the casino accepts thousands of bets 
each day, so it can count on being fairly close to its expected return of 5 cents per 


dollar bet. 
2 
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o/ CHECK YOUR UNDERSTANDING 


A large auto dealership keeps track of sales and lease agreements made during each hour 
of the day. Let X = the number of cars sold and Y = the number of cars leased during the 
first hour of business on a randomly selected Friday. Based on previous records, the prob- 
ability distributions of X and Y are as follows: 


Cars sold x;: 0 1 2 3 
Probability p;: 0.3 0.4 0.2 0.1 


Mean: pty = 1.1 Standard deviation: ox = 0.943 


Cars leased y;: 0 l 2 
Probability p;: 0.4 0.5 0.1 


Mean: py = 0.7 Standard deviation: oy = 0.64 


Define T = X + Y. Assume that X and Y are independent. 

1. Find and interpret pur. 

2. Compute a7. Show your work. 

3. The dealership’s manager receives a $500 bonus for each car sold and a $300 bonus 


for each car leased. Find the mean and standard deviation of the manager’s total bonus 
B. Show your work. 


Differences of random variables Now that we’ve examined sums of 
random variables, it’s time to investigate the difference of two random variables. 
Let’s start by looking at the difference in the number of passengers that Pete and 
Erin have on their tours on a randomly selected day, D = X — Y. Because Pete 
averages jix = 3.75 passengers per trip and Erin averages pry = 3.10 passengers per 
trip, the average difference is wp = 3.75 — 3.10 = 0.65 passengers. That is, Pete 
averages ().65 more passengers per day than Erin does. We can generalize this re- 
sult for any two random variables as follows: if D = X — Y, then up = px — py. In 
other words, the mean (expected value) of the difference of two random variables 
is equal to the difference of their means (expected values). 


MEAN OF THE DIFFERENCE OF RANDOM VARIABLES 


For any two random variables X and Y, if D = X — Y, then the mean of D is 
Ho EDS ine =x 


Lp = by — bx = 3.10 — 3.75 = —0.65. In other words, Erin averages 0.65 
fewer passengers than Pete does on a randomly chosen day. 

Earlier, we saw that the variance of the sum of two independent random 
variables is the sum of their variances. Can you guess what the variance of the dif- 
ference of two independent random variables will be? (If you were thinking some- 
thing like “the difference of their variances,” think again!) Let’s return to the jeep 
tours scenario. On a randomly selected day, the number of passengers X on Pete’s 
tour is between 2 and 6, and the number of passengers Y on Erin’s tour is between 


The order of subtraction is important. If we had defined D = Y — X, then rT) 
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2 and 5. So the difference in the number of passengers D = X — Y is between 
—3 and 4. Thus, the range of D is 4 — (—3) = 7. How is this value related to the 
ranges of X and Y? The range of X is 4 and the range of Y is 3, so 


range of D = range of X + range of Y 


As with sums of random variables, there’s more variability in the values of the dif- 
ference D than in the values of X or Y alone. This should make sense, because the 
variation in X and the variation in Y both contribute to the variation in D. 

If you follow the process we used earlier with the random variable T = X + Y, 
you can build the probability distribution of D = X— Y. Here it is. 


Value d;: 3 2 l 0 l 2 3 4 
Probability p; 0.015 0.055 0.145 0.235 0.260 0.195 0.080 0.015 


You can use the probability distribution to confirm that: 
l. Up = px — py = 3.75 — 3.10 = 0.65 

2. of = of + oF = 1.1875 + 0.89 = 2.0775 

3. op = V2.0775 = 1.441 


Result 2 shows that, just like with addition, when we subtract two independent 
random variables, variances add. 


VARIANCE OF THE DIFFERENCE OF RANDOM VARIABLES 


For any two independent random variables X and Y, if D = X — Y, then the 
variance of D is 


Op = ox + oF 


Let’s put our new rules for subtracting random variables to use in a familiar 
setting. 


Pete’s Jeep Tours and Erin’s 
Adventures 


Difference of random variables 


We have defined several random variables related to Pete’s and Erin’s tour busi- 
nesses. For a randomly selected day, 


C = amount of money that Pete collects G = amount of money that Erin collects 


Here are the means and standard deviations of these random variables: 


lic = 562.50 lig = 542.50 
oc = 163.46 oc = 165.03 
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PROBLEM: Calculate the mean and the standard deviation of the difference D = C — G inthe 
amounts that Pete and Erin collect on a randomly chosen day. Interpret each value in context. 


SOLUTION: Weknowthat the mean of the difference of two random variables is the difference of 
their means. That is, 


Llp = [le — [ig = 562.50 — 542.50 = 20.00 


On average, Pete collects $ 20 more per day than Erin does. Some days the difference will be more 
than $ 20, other days it will be less, but the average difference after lots of days will be about $ 20. 


Because the number of passengers Xand Yare independent random variables, so are the amounts of 
money collected Cand G. Therefore, the variance of Dis the sum of the variances of Cand G: 


o% = 0% + 03 = (163.46)? + (165.03)? = 53,954.07 


The value op = $282.28 in the example op = 1 /53,954.07 = 232.28 
should look familiar. It’s the same value 


we got earlier when we calculated the 
phic deviation of the total amount "he standard deviation of the difference in the amounts collected by Pete and Erin is $232.28. Even 


that Pete and Erin collect onarandomly — though the average difference in the amounts collected is $20, the difference on individual days will 
chosen day, o7 = $232.28. typically vary from the mean by about $232. 


For Practice Try Exercise 


CHECK YOUR UNDERSTANDING 


A large auto dealership keeps track of sales and lease agreements made during each hour 
of the day. Let X = the number of cars sold and Y = the number of cars leased during the 
first hour of business on a randomly selected Friday. Based on previous records, the prob- 
ability distributions of X and Y are as follows: 


Cars sold x;: 0 l 2 3 
Probability p;: 0.3 0.4 0.2 0.1 


Mean: py = 1.1 Standard deviation: oy = 0.943 


Cars leased y;: 0 l 2 
Probability p;: 0.4 0.5 0.1 
Mean: fly = 0.7 Standard deviation: oy = 0.64 
Define D = X — Y. Assume that X and Y are independent. 
1. Find and interpret pup. 
2. Compute op. Show your work. 


3. The dealership’s manager receives a $500 bonus for each car sold and a $300 bonus 
for each car leased. Find the mean and standard deviation of the difference in the 
manager’s bonus for cars sold and leased. Show your work. 


Combining Normal Random Variables 


So far, we have concentrated on finding rules for means and variances of ran- 
dom variables. If a random variable is Normally distributed, we can use its mean 
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and standard deviation to compute probabilities. The earlier example on young 
women’s heights (page 357) shows the method. What if we combine two Normal 
random variables? 


A Computer Simulation 


Sums and differences of Normal random variables 


We used Fathom software to simulate taking independent SRSs of 1000 ob- 
servations from each of two Normally distributed random variables, X and Y. 
Figure 6.11(a) shows the results. The random variable X is N(3, 0.9) and the ran- 
dom variable Y is N(1, 1.2). What do we know about the sum and difference of 
these two random variables? The histograms in Figure 6.11(b) came from adding 
and subtracting the values of X and Y for the 1000 randomly generated observa- 
tions from each distribution. 


4202 4 6 8 10 (b) 4202 4 6 8 10 


FIGURE 6.11 (a) Histograms showing the results of randomly selecting 1000 values from two 
different Normal random variables X and Y. (b) Histograms of the sum and difference of the 1000 
randomly selected values of X and Y. 


Let’s summarize what we see: 


Sum X + Y Difference X — Y 
Shape: Looks approximately Normal Looks approximately Normal 


Center: About 4, which makes sense About 2, which makes sense 
because because 


[ie = rey ea ee Sy ie ye 


Spread: The spreads of the two distributions are about the same. That makes 
sense because 


ee eae, 2 
Ox+Y = OX-y = OX T OY 


As the previous example illustrates, any sum or difference of independent Normal 
random variables is also Normally distributed. The mean and standard deviation 
of the resulting Normal distribution can be found using the appropriate rules for 
means and variances. 
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Give Me Some Sugar! 
Sums of Normal random variables 


Mr. Starnes likes sugar in his hot tea. From experience, he needs between 8.5 and 
9 grams of sugar in a cup of tea for the drink to taste right. While making his tea 
one morning, Mr. Starnes adds four randomly selected packets of sugar. Suppose 
the amount of sugar in these packets follows a Normal distribution with mean 
2.17 grams and standard deviation 0.08 grams. 


PROBLEM: What's the probability that Mr. Starnes’s tea tastes right? 
SOLUTION: 


Step 1: State the distribution and the values of interest. Let X = the amount of 
sugar in a randomly selected packet. Then X, = amount of sugar in Packet 1, X. = amount of sugar in 
— Packet 2, Xz = amount of sugar in Packet 3, and X, = amount of sugar in Packet 4. Each of these ran- 
dom variables has a Normal distribution with mean 2.17 grams and standard deviation 0.08 grams. 
= We're interested in the total amount of sugar that Mr. Starnes puts in his tea, which is given by 
T= X + Xp + X5 + Xy. 
The random variable Tis a sum of four independent Normal random variables. So T follows a Normal 
distribution with mean 


fir = ply, + bey, + py, 7 My, = 2.17 + 2.17 + 2.17 + 2.17 = 8.68 grams 
and variance 
of = of, + of, + of + 0%, = (0.08)? + (0.08)? + (0.08)? + (0.08)? = 0.0256 
The standard deviation of Tis 
or = V0.0256 = 0.16 grams 


We want to find the probability that the total amount of sugar in Mr. 
Starnes’s tea is between 8.5 and 9 grams. Figure 6.12 shows this 
probability as the area under a Normal curve. 


Step 2: Perform calculations—show your work! To find 
this area, we can standardize the boundary values and use Table A: 


N(8.68,0.16) 


DIED Ba, 
————————— n ee 
0.16 ae © ONG 


Then A—1.13 <Z< 2.00) = 0.9772 — 0.1292 = 0.8480. 


Using Technology: The command normalcdf (lower:8.5, 
upper:9, :8.68, 0:0.16) gives anarea of 0.6470. 


z 


8.50 8.68 9.00 
Total amount of sugar (grams) 


FIGURE 6.12 Normal distribution of the total amount of sugar Step 3: Answer the question. There’s about an 85% chance 
in Mr. Starnes’s tea. that Mr. Starnes’s tea will taste right. 


For Practice Try Exercise 


Here’s an example that involves subtracting two Normal random variables. 
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Put a Lid on It! 


Differences of Normal random variables 


The diameter C of a randomly selected large drink cup at a fast-food restaurant 
follows a Normal distribution with a mean of 3.96 inches and a standard deviation 
of 0.01 inches. The diameter L of a randomly selected large lid at this restaurant 
follows a Normal distribution with mean 3.98 inches and standard deviation 0.02 
inches. For a lid to fit on a cup, the value of L has to be bigger than the value of C, 
but not by more than 0.06 inches. 


PROBLEM: What's the probability that a randomly selected large lid will fit on a randomly chosen 
large drink cup? 


SOLUTION: 


Step 1: State the distribution and the values of interest. We'll define the random 
variable D = L — C to represent the difference between the lid’s diameter and the cup’s diameter. 


The random variable Dis the difference of two independent Normal random variables. So Dfollows a 
Normal distribution with mean 


Lp = Li — Le = 3.98 — 3.96 = 0.02 
and variance 
a} = of + o¢ = (0.02)? + (0.01)? = 0.0005 
The standard deviation of Dis 
op = V0.0005 = 0.0224 


We want to find the probability that the difference Dis between O and 0.06 inches. Figure 6.13 
shows this probability as the area under a Normal curve. 

Step 2: Perform calculations—show your work! To find 
this area, we can standardize the boundary values and use Table A: 


—_ TEU Ose 
=——""“=-9. nd z= ————— = 1. 
70.0224 : 0.0224 


Then P(—0.89 = Z< 1.79) = 0.9633 — 0.1867 = 0.7766. 


Using Technology: The command normalcdf (lower:0, up- 
per:0.06, w:0.02, 0:0.0224) gives anarea of 0.7770. 


Step 3: Answer the question. There's about a 78% chance 
that a randomly selected large lid will fit on a randomly chosen large 
drink cup at this fast-food restaurant. Roughly 22% of the time, the 
FIGURE 6.13 Normal distribution of the difference in lid lid won't fit. This seems like an unreasonably high chance of getting a lid 
diameter and cup diameter at a fast-food restaurant. that doesn’t fit. Maybe the restaurant should find a new supplier! 


0 0.02 0.06 
Difference in diameter (lid - cup), in inches 


For Practice Try Exercise 
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Summary 


e Adding a positive constant a to (subtracting a from) a random variable in- 
creases (decreases) the mean of the random variable by a but does not affect 
its standard deviation or the shape of its probability distribution. 

e = =Multiplying (dividing) a random variable by a positive constant b multiplies 
(divides) the mean of the random variable by b and the standard deviation by 
b but does not change the shape of its probability distribution. 

e A linear transformation of a random variable involves adding or subtracting 
a constant a, multiplying or dividing by a constant b, or both. We can write a 
linear transformation of the random variable X in the form Y = a + bX. The 
shape, center, and spread of the probability distribution of Y are as follows: 
Shape: Same as the probability distribution of X if b > 0. 

Center: py = a + bx 
Spread: oy = |b|ox 

e = IfX and Y are any two random variables, 

Lixsy = bx + pty: The mean of the sum of two random variables is the sum 
of their means. 

Lix-y = [x — py: The mean of the difference of two random variables is the 
difference of their means. 

e If X and Y are independent random variables, then knowing the value of one 
variable tells you nothing about the value of the other. In that case, variances add: 
oxy = ox + oy: The variance of the sum of two independent random vari- 
ables is the sum of their variances. 


ox_y = ox + of: The variance of the difference of two independent random 
variables is the sum of their variances. 


e The sum or difference of independent Normal random variables follows a 
Normal distribution. 


Exercises 


35. Crickets The length in inches of a cricket chosen 
at random from a field is a random variable X with 
mean 1.2 inches and standard deviation 0.25 inches. 
Find the mean and standard deviation of the length 


randomly selected 20-year-old man in inches. There 
are 12 inches in a foot. 


37. Get on the boat! A small ferry runs every half hour 


36. 


Y of a randomly chosen cricket from the field in 
centimeters. There are 2.54 centimeters in an inch. 


Men’s heights A report of the National Center for 

Health Statistics says that the height of a 20-year-old 
man chosen at random is a random variable H with 
mean 5.8 feet and standard deviation 0.24 feet. Find 
the mean and standard deviation of the height J of a 


from one side of a large river to the other. The num- 
ber of cars X on a randomly chosen ferry trip has the 
probability distribution shown below. You can check 
that py = 3.87 and oy = 1.29. 


Cars: 0 1 2 3 4 5 
Probability: 0.02 0.05 


0.08 0.16 0.27 0.42 
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40. 


(a) ‘The cost for the ferry trip is $5. Make a graph of the 
probability distribution for the random variable M = 
money collected on a randomly selected ferry trip. 
Describe its shape. 


(b) Find and interpret yy, 
(c) Find and interpret oy. 


38. Skee Ball Ana is a dedicated Skee Ball player (see 
photo) who always rolls for the 50-point slot. The 
probability distribution of Ana’s score X on a single 
roll of the ball is shown below. You can check that 
pix = 23.8 and oy = 12.63. 


Score: 10 20 30 40 50 
Probability: O32 027 O19 O15 07 


(a) A player receives one ticket from the game for every 
10 points scored. Make a graph of the probability 
distribution for the random variable T = number 
of tickets Ana gets on a randomly selected throw. 
Describe its shape. 


(b) Find and interpret sur. 
(c) Find and interpret or. 


Exercises 39 and 40 refer to the following setting. Ms. Hall 
gave her class a 10-question multiple-choice quiz. Let 

X = the number of questions that a randomly selected stu- 
dent in the class answered correctly. The computer output 
below gives information about the probability distribution 
of X. To determine each student’s grade on the quiz (out 
of 100), Ms. Hall will multiply his or her number of cor- 
rect answers by 5 and then add 50. Let G = the grade of a 
randomly chosen student in the class. 


N Mean Median StDev Min Max Q, Q3 
30 6 8.5 VS 4 IG 8 g 


39. Easy quiz 
(a) Find the mean of G. Show your method. 


(b) Find the standard deviation of G. Show your 
method. 


(c) How do the variance of X and the variance of G 
compare? Justify your answer. 


(a) 
(b) 
(c) 


and 


aoe 


Easy quiz 

Find the median of G. Show your method. 

Find the IOR of G. Show your method. 

What shape would the probability distribution of G 


have? Justify your answer. 


Get on the boat! Refer to Exercise 37. The ferry 
company’s expenses are $20 per trip. Define the ran- 
dom variable Y to be the amount of profit (money col- 
lected minus expenses) made by the ferry company 
ona randomly selected trip. That is, Y = M — 20. 


Find and interpret the mean of Y. 


Find and interpret the standard deviation of Y. 


. The Tri-State Pick 3 Most states and Canadian 


provinces have government-sponsored lotteries. Here 
is a simple lottery wager, from the ‘Tri-State Pick 3 
game that New Hampshire shares with Maine and 
Vermont. You choose a number with 3 digits from 0 
to 9; the state chooses a three-digit winning number 
at random and pays you $500 if your number is 
chosen. Because there are 1000 numbers with three 
digits, you have probability 1/1000 of winning. 
‘Taking X to be the amount your ticket pays you, the 
probability distribution of X is 


Payoff: $0 $500 
Probability: 0.999 0.001 


Show that the mean and standard deviation of X are 


px = $0.50 and ox = $15.80. 


If you buy a Pick 3 ticket, your winnings are W = X — 1, 
because it costs $1 to play. Find the mean and standard 
deviation of W. Interpret each of these values in context. 


Get on the boat! Based on the analysis in Exercise 41, 
the ferry company decides to increase the cost of a trip 
to $6. We can calculate the company’s profit Y on a ran- 
domly selected trip from the number of cars X. Find the 
mean and standard deviation of Y. Show your work. 


. Making a profit Rotter Partners is planning a major 


investment. From experience, the amount of profit X 
(in millions of dollars) on a randomly selected invest- 
ment of this type is uncertain, but an estimate gives 
the following probability distribution: 


Profit: l 55 ey 4 10 
Probability: 0.1 0.2 0.4 0.2 0.1 


Based on this estimate, py = 3 and oy = 2.52. Rotter 
Partners owes its lender a fee of $200,000 plus 10% 
of the profits X. So the firm actually retains Y = 

0.9X — 0.2 from the investment. Find the mean and 
standard deviation of Y. Show your work. 
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CHAPTER 6 


. Too cool at the cabin? During the winter months, 


the temperatures at the Starneses’ Colorado cabin 
can stay well below freezing (32°F or 0°C) for 
weeks at a time. To prevent the pipes from freezing, 
Mrs. Starnes sets the thermostat at 50°F. She also 
buys a digital thermometer that records the indoor 
temperature each night at midnight. Unfortunately, 
the thermometer is programmed to measure the 
temperature in degrees Celsius. Based on several 
years’ worth of data, the temperature T in the cabin 
at midnight on a randomly selected night follows a 
Normal distribution with mean 8.5°C and standard 
deviation 2.25°C. 


Let Y = the temperature in the cabin at midnight 
on a randomly selected night in degrees Fahrenheit 
(recall that F = (9/5)C + 32). Find the mean and 
standard deviation of Y. 


Find the probability that the midnight temperature 
in the cabin is below 40°F. Show your work. 


Cereal A company’s single-serving cereal boxes 
advertise 9.63 ounces of cereal. In fact, the amount 
of cereal X in a randomly selected box follows a 
Normal distribution with a mean of 9.70 ounces and 
a standard deviation of 0.03 ounces. 


Let Y = the excess amount of cereal beyond what's 
advertised in a randomly selected box, measured in 
grams (1 ounce = 28.35 grams). Find the mean and 
standard deviation of Y. 


Find the probability of getting at least 3 grams more 
cereal than advertised. Show your work. 


His and her earnings Researchers randomly select a 
married couple in which both spouses are employed. 
Let X be the income of the husband and Y be the 
income of the wife. Suppose that you know the 
means jx and jy and the variances o% and o¥ of 
both variables. 


Is it reasonable to take the mean of the total income 
X + Y to be py + py? Explain your answer. 


Is it reasonable to take the variance of the total in- 
come to be ox + of? Explain your answer. 


Rainy days Imagine that we randomly select a day 
from the past 10 years. Let X be the recorded rainfall 
on this date at the airport in Orlando, Florida, and 

Y be the recorded rainfall on this date at Disney 
World just outside Orlando. Suppose that you know 
the means juy and jy and the variances o% and o¥ of 
both variables. 


Is it reasonable to take the mean of the total rainfall 
X + Y to be py + py? Explain your answer. 


Is it reasonable to take the variance of the total rain- 
fall to be o% + of? Explain your answer. 
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Get on the boat! Refer to Exercise 41. Find the 
expected value and standard deviation of the total 
amount of profit made on two randomly selected 
days. Show your work. 


The Tri-State Pick 3 Refer to Exercise 42. Suppose 

you buy one Pick 3 ticket on each of two consecutive 
days. Find the expected value and standard deviation 
of your total winnings. Show your work. 


Essay errors ‘Typographical and spelling errors can be 
either “nonword errors” or “word errors.” A nonword 
error is not a real word, as when “the” is typed as “teh.” 
A word error is a real word, but not the right word, as 
when “lose” is typed as “loose.” When students are 
asked to write a 250-word essay (without spell-checking), 
the number of nonword errors X in a randomly selected 
essay has the following probability distribution: 


Value: 0 ] 2 3 4 
Probability: 0.1 


by = 2.1 ox = 1.136 


The number of word errors Y has this probability 
distribution: 


Value: 0 Il 2 3 
Probability: 0.4 


Oe 1.0 


Assume that X and Y are independent. 

An English professor deducts 3 points from a 
student’s essay score for each nonword error and 
2 points for each word error. Find the mean and 
standard deviation of the total score deductions for a 
randomly selected essay. Show your work. 


The Tri-State Pick 3 Refer to Exercise 42. You and 
a friend decide to play Pick 3, but with two different 
strategies. Your friend buys a $1 Pick 3 ticket on 
each of five consecutive days. You bet $5 ona single 
number on your Pick 3 ticket. Find the mean and 
standard deviation of the total winnings for you and 
your friend. Show your work. 


pry = 1.0 


Essay errors Refer to Exercise 51. 


Find the mean and standard deviation of the difference 
Y — X in the number of errors made by a randomly 
selected student. Interpret each value in context. 


From the information given, can you find the prob- 
ability that a randomly selected student makes more 
word errors than nonword errors? If so, find this prob- 
ability. If not, explain why not. 


Study habits ‘The Survey of Study Habits and At- 
titudes (SSHA) is a psychological test that measures 
academic motivation and study habits. The distribu- 
tion of SSHA scores among the women at a college 
has mean 120 and standard deviation 28, and the 
distribution of scores among male students has mean 
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105 and standard deviation 35. You select a single 
male student and a single female student at random 
and give them the SSHA test. 


(a) Find the mean and standard deviation of the dif 
ference (female minus male) between their scores. 
Interpret each value in context. 


(b) From the information given, can you find the prob- 
ability that the woman chosen scores higher than the 
man? If so, find this probability. If not, explain why 
you cannot. 


Essay scores Refer to Exercise 51. Find the mean 

and standard deviation of the difference in score de- 

© ductions (nonword — word) for a randomly selected 
essay. Show your work. 


56. The Tri-State Pick 3 Refer to Exercise 52. Find the 
mean and standard deviation of the difference (you — 
your friend) in winnings. Show your work. 


Exercises 57 and 58 refer to the following setting. In Exer- 
cises 14 and 18 of Section 6.1, we examined the probabil- 
ity distribution of the random variable X = the amount 

a life insurance company earns on a randomly chosen 

5-year term life policy. Calculations reveal that 

ix = $303.35 and ox = $9707.57. 

57. Life insurance The risk of insuring one person’s 
life is reduced if we insure many people. Suppose 
that we insure two 21-year-old males, and that their 
ages at death are independent. If X; and X) are the 
insurer’s income from the two insurance policies, the 
insurer’s average income W on the two policies is 


XGA) 

2 
Find the mean and standard deviation of W. (You 
see that the mean income is the same as for a single 
policy but the standard deviation is less.) 


We 


58. Life insurance If four 21-year-old men are insured, 

the insurer’s average income is 
ee 
Ve 
4 

where X; is the income from insuring one man. 
Assuming that the amount of income earned on 
individual policies is independent, find the mean 
and standard deviation of V. (If you compare with the 
results of Exercise 57, you should see that averaging 
over more insured individuals reduces risk.) 


59. ‘Time and motion A time-and-motion study mea- 
sures the time required for an assembly-line worker to 
perform a repetitive task. The data show that the time 
required to bring a part from a bin to its position on 
an automobile chassis varies from car to car according 
to a Normal distribution with mean 11 seconds and 
standard deviation 2 seconds. The time required to 
attach the part to the chassis follows a Normal distri- 


60. 


62. 


bution with mean 20 seconds and standard deviation 
4 seconds. The study finds that the times required for 
the two steps are independent. A part that takes a long 
time to position, for example, does not take more or 
less time to attach than other parts. 


What is the distribution of the time required for the 
entire operation of positioning and attaching a ran- 
domly selected part? 


Management’s goal is for the entire process to take 
less than 30 seconds. Find the probability that this 
goal will be met for a randomly selected part. 


Electronic circuit ‘The design of an electronic circuit 
for a toaster calls for a 100-ohm resistor and a 250-ohm 
resistor connected in series so that their resistances 
add. The components used are not perfectly uniform, 
so that the actual resistances vary independently 
according to Normal distributions. ‘The resistance of 
100-ohm resistors has mean 100 ohms and standard 
deviation 2.5 ohms, while that of 250-ohm resistors 
has mean 250 ohms and standard deviation 2.8 ohms. 


What is the distribution of the total resistance of the 
two components in series for a randomly selected 
toaster? 


Find the probability that the total resistance for a 
randomly selected toaster lies between 345 and 355 
ohms. 


Swim team Hanover High School has the best 
women’s swimming team in the region. The 400-meter 
freestyle relay team is undefeated this year. In the 
400-meter freestyle relay, each swimmer swims 100 
meters. ‘l’he times, in seconds, for the four swimmers 
this season are approximately Normally distributed 
with means and standard deviations as shown. Assume 
that the swimmer’s individual times are independent. 
Find the probability that the total team time in the 
400-meter freestyle relay for a randomly selected race is 
less than 220 seconds. 


Swimmer Mean Std. dev. 
Wendy or 2.8 
Jill 58.0 3.0 
Carmen 56.3 2.6 
Latrice 54.7 All 


Toothpaste Ken is traveling for his business. He has 
a new 0).85-ounce tube of toothpaste that’s supposed 
to last him the whole trip. The amount of toothpaste 
Ken squeezes out of the tube each time he brushes 
varies according to a Normal distribution with mean 
0.13 ounces and standard deviation 0.02 ounces. 

If Ken brushes his teeth six times on a randomly 
selected trip, what’s the probability that he’ll use all 
the toothpaste in the tube? 


386 


CHAPTER 6 


RANDOM VARIABLES 


63. Auto emissions The amount of nitrogen oxides 65. The mean of T is 
my (NOX) present in the exhaust of a particular type 
rol of car varies from car to car according to a Normal eI ELIS eS) Oe Se 
we distribution with mean 1.4 grams per mile (g/mi) and 66. ‘The standard deviation of T is 
standard deviation 0.3 g/mi. Two randomly selected ome hie Tenet cy lunar: 


64. 


cars of this type are tested. One has 1.1 g/mi of NOX; 
the other has 1.9 g/mi. The test station attendant 
finds this difference in emissions between two similar 
cars surprising. If the NOX levels for two randomly 
chosen cars of this type are independent, find the 
probability that the difference is at least as large as 
the value the attendant observed. 


Loser buys the pizza Leona and Fred are friendly 
competitors in high school. Both are about to take the 
ACT college entrance examination. They agree that 
if one of them scores 5 or more points better than the 
other, the loser will buy the winner a pizza. Suppose 
that in fact Fred and Leona have equal ability, so 
that each score varies Normally with mean 24 and 
standard deviation 2. (‘The variation is due to luck in 
guessing and the accident of the specific questions 
being familiar to the student.) The two scores are 
independent. What is the probability that the scores 
differ by 5 or more points in either direction? 


Multiple choice: Select the best answer for Exercises 65 
and 66, which refer to the following setting. ‘The number 
of calories in a l-ounce serving of a certain breakfast cereal 
is arandom variable with mean 110 and standard deviation 
10. The number of calories in a cup of whole milk is a 
random variable with mean 140 and standard deviation 12. 
For breakfast, you eat | ounce of the cereal with 1/2 cup of 
whole milk. Let T’ be the random variable that represents 
the total number of calories in this breakfast. 


Statistics for investing (3.1) Joe’s retirement plan 
invests in stocks through an “index fund” that follows 
the behavior of the stock market as a whole, as mea- 
sured by the Standard & Poor’s (S&P) 500 stock index. 
Joe wants to buy a mutual fund that does not track 

the index closely. He reads that monthly returns from 
Fidelity Technology Fund have correlation r = 0.77 
with the S&P 500 index and that Fidelity Real Estate 
Fund has correlation r = 0.37 with the index. 


Which of these funds has the closer relationship to 
returns from the stock market as a whole? How do 
you know? 


Does the information given tell Joe anything about 


which fund has had higher returns? 
Buying stock (5.3, 6.1) You purchase a hot stock 


_ for $1000. The stock either gains 30% or loses 25% 


each day, each with probability 0.5. Its returns on 
consecutive days are independent of each other. You 
plan to sell the stock after two days. 


What are the possible values of the stock after two 
days, and what is the probability for each value? 
What is the probability that the stock is worth more 
after two days than the $1000 you paid for it? 


What is the mean value of the stock after two days? 
(Comment: You see that these two criteria give differ- 
ent answers to the question “Should I invest?”) 


Binomial and Geometric 
Random Variables 


By the end of the section, you should be able to: 
Find probabilities involving geometric random 
variables. 


When appropriate, use the Normal approximation to the 
binomial distribution to calculate probabilities.* 


WHAT YOU WILL LEARN 


e Determine whether the conditions for using a binomial e 
random variable are met. 


Compute and interpret probabilities involving binomial 
distributions. 

Calculate the mean and standard deviation of a bino- 
mial random variable. Interpret these values in context. 


*This topic is not required for the AP® Statistics exam. Some teachers prefer to discuss this topic when 
presenting the sampling distribution of p (Chapter 7). 
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When the same chance process is repeated several times, we are often interested 
in whether a particular outcome does or doesn’t happen on each repetition. Here 
are some examples: 


e ‘To test whether someone has extrasensory perception (ESP), choose one of 
four cards at random —a star, wave, cross, or circle. Ask the person to iden- 
tify the card without seeing it. Do this a total of 50 times and see how many 
cards the person identifies correctly. Chance process: choose a card at random. 
Outcome of interest: person identifies card correctly. Random variable: X = 
number of correct identifications. 


e A shipping company claims that 90% of its shipments arrive on time. To test 
this claim, take a random sample of 100 shipments made by the company last 
month and see how many arrived on time. Chance process: Randomly select 
a shipment and check when it arrived. Outcome of interest: arrived on time. 
Random variable: Y = number of on-time shipments. 


e In the game of Pass the Pigs, a player rolls a pair of pig-shaped dice. On each 
roll, the player earns points according to how the pigs land. If the player gets 
a “pig out,” in which the two pigs land on opposite sides, she loses all points 
earned in that round and must pass the pigs to the next player. A player can 
choose to stop rolling at any point during her turn and to keep the points 
that she has earned before passing the pigs. Chance process: roll the pig dice. 
Outcome of interest: pig out. Random variable: T = number of rolls until the 
player pigs out. 


Some random variables, like X and Y in the first two examples above, count the 
number of times the outcome of interest occurs in a fixed number of repetitions. 
They are called binomial random variables. Other random variables, like T in 
the Pass the Pigs setting, count the number of repetitions of the chance process it 
takes for the outcome of interest to occur. They are known as geometric random 
variables. These two special types of discrete random variables are the focus of this 
section. 


Binomial Settings and Binomial 
Random Variables 


What do the following scenarios have in common? 


e ‘Toss a coin 5 times. Count the number of heads. 


e Spin a roulette wheel 8 times. Record how many times the ball lands in a 
red slot. 


e ‘Take a random sample of 100 babies born in U.S. hospitals today. Count the 
number of females. 


In each case, we’re performing repeated trials of the same chance process. 
The number of trials is fixed in advance. Also, knowing the outcome of one 
trial tells us nothing about the outcome of any other trial. That is, the tri- 
als are independent. We’re interested in the number of times that a specific 
event (we'll call ita “success”) occurs. Our chances of getting a “success” are 
the same on each trial. When these conditions are met, we have a binomial 
setting. 
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EE 


DEFINITION: Binomial setting 


A binomial setting arises when we perform several independent trials of the same 
chance process and record the number of times that a particular outcome occurs. 
The four conditions for a binomial setting are 


e Binary? The possible outcomes of each trial can be classified as “success” or 


“failure.” 

e Independent? Trials must be independent; that is, knowing the result of one trial 
The boldface letters in the box give must not tell us anything about the result of any other trial. 
ova De Di ey Oo TeneTanEN Oe e Number? The number of trials of the chance process must be fixed in advance. 
conditions for a binomial setting: just k fe é 
check the BINS! e Success? There is the same probability p of success on each trial. 


When checking the Binary condition, note that there can be more than two 
possible outcomes per trial—a roulette wheel has numbered slots of three colors: 
red, black, and green. If we define “success” as having the ball land in a red slot, 
then “failure” occurs when the ball lands in a black or a green slot. 

Think of tossing a coin n times as an example of the binomial setting. Each toss 
gives either heads or tails. Knowing the outcome of one toss doesn’t change the 
probability of a head on any other toss, so the tosses are independent. If we call 
heads a success, then p is the probability of a head and remains the same as long 
as we toss the same coin. For tossing a fair coin, p is 0.5. The number of heads we 
count is a binomial random variable X. The probability distribution of X is called 
a binomial distribution. 


(Mi 


DEFINITION: Binomial random variable and binomial distribution 


The count X of successes in a binomial setting is a binomial random variable. The 
probability distribution of X is a binomial distribution with parameters n and p, 
where nis the number of trials of the chance process and pis the probability of a 
success on any one trial. The possible values of X are the whole numbers from 0 to n. 


Later in the section, we’ll learn how to assign probabilities to outcomes and 
how to find the mean and standard deviation of a binomial random variable. For 
now, it’s important to be able to distinguish situations in which a binomial distri- 
bution does and doesn’t apply. 


From Blood Types to Aces 


Binomial settings and random variables 


PROBLEM: Here are three scenarios involving chance behavior. In each case, determine whether 
or not the given random variable has a binomial distribution. Justify your answer. 


(a) Genetics says that children receive genes from each of their parents independently. Each child of 


a particular set of parents has probability 0.25 of having type 0 blood. Suppose these parents have 
5 children. Let X = the number of children with type 0 blood. 


The Independent condition involves 
conditional probabilities. P(2nd card 
ace | 1st card ace) = 3/51 # P(2nd 
card ace) = 4/52, so the trials are not 
independent. The Success condition is 
about unconditional probabilities. 
P(kth card in a shuffled deck is an 
ace) = 4/52, so this condition is met. 
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(b) Shuffle a deck of cards. Turn over the first 10 cards, one at a time. Let Y = the number of aces 
you observe. 


(c) Shuffle a deck of cards. Turn over the top card. Put the card back in the deck, and shuffle again. 
Repeat this process until you get an ace. Let W = the number of cards required. 


SOLUTION: 
(a) To seeifthis is a binomial setting, we'll check the BINS: 
* Binary? “Success” = has type 0 blood. “Failure” = doesn’t have type 0 blood. 


* — Independent? Knowing one child’s blood type tells you nothing about another's because 
children inherit genes determining blood type independently from each of their parents. 


* Number? There are n = 5 trials of this chance process. 
* Success? The probability ofa success is p= 0.25 on each trial. 


This is a binomial setting. Because X counts the number of successes, it is a binomial random variable 
with parameters n = 5 and p = 0.25. 


(b) Let’s check the BINS: 
¢ Binary? “Success” = get an ace. “Failure” = don’t get an ace. 


* Independent? No. If the first card you turn over is an ace, then the next card is less likely to 
be an ace because you're not replacing the top card in the deck. Similarly, if the first card isn’t an 
ace, the second card is more likely to be an ace. 


* Number? There are n = 10 trials of this chance process. 

* Success? The probability that any particular card ina shuffled deck is an aceis p = 4/52. 
Because the trials are not independent, this is not a binomial setting. 
(c) Let’s check the BINS: 

° Binary? “Success” = get an ace. “Failure” = don’t get an ace. 


* — Independent? Because you are replacing the card in the deck and shuffling each time, the 
result of one trial does not tell you anything about the outcome of any other trial. 


¢ Number? The number of trials is not set in advance. You could get an ace on the first card you 
turn over, or it may take many cards to get an ace. 


* Success? The probability of getting an ace is p = 4/52 on each trial. 


Because there is no fixed number of trials, this is not a binomial setting. 


For Practice Try Exercises 169] and 


Part (c) of the example raises an important point about binomial random vari- 
ables. Besides checking the BINS, make sure that you’re being asked to count 
the number of successes in a certain number of trials. In part (c), you’re asked to 
count the number of trials until you get a success. That can’t be a binomial ran- 
dom variable. (As you'll see later, W is a geometric random variable.) 


CHECK YOUR UNDERSTANDING 


For each of the following situations, determine whether the given random variable has a 
binomial distribution or not. Justify your answer. 

1. Shuffle a deck of cards. ‘Turn over the top card. Put the card back in the deck, and 
shuffle again. Repeat this process 10 times. Let X = the number of aces you observe. 
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2. Choose students at random from your class. Let Y = the number who are over 


6 feet tall. 


3. Rolla fair die 100 times. Sometime during the 100 rolls, one corner of the die chips 
off. Let W = the number of 5s you roll. 


Binomial Probabilities 


In a binomial setting, we can define a random variable (say, X) as the number of 
successes in n independent trials. What’s the probability distribution of X? Let’s 
see if an example can help shed some light on this question. 


Inheriting Blood Type 


Calculating binomial probabilities 


Each child of a particular set of parents has probability 0.25 of having type O 
blood. Genetics says that children receive genes from each of their parents inde- 
pendently. If these parents have 5 children, the count X of children with type O 
blood is a binomial random variable with n = 5 trials and probability p = 0.25 of 
success on each trial. In this setting, a child with type O blood is a “success” 
(S) and a child with another blood type is a “failure” (F). 


What’s P(X = 0)? That is, what’s the probability that none of the 5 chil- 

dren has type O blood? It’s the chance that all 5 children don’t have type 
O blood. The probability that any one of this couple’s children doesn’t 
have type O blood is 1 — 0.25 = 0.75 (complement rule). By the multi- 
plication rule for independent events (Chapter 5), 


P(X = 0) = P(FFFFF) = (0.75)(0.75)(0.75)(0.75)(0.75) = (0.75)> = 0.2373 


How about P(X = 1)? There are several different ways in which exactly 1 of the 5 
children could have type O blood. For instance, the first child born might have 
type O blood, while the remaining 4 children don’t have type O blood. The prob- 
ability that this happens is 


P(SFFFF) = (0.25)(0.75)(0.75)(0.75)(0.75) = (0.25)(0.75)4 


Alternatively, Child 2 could be the one that has type O blood. The corresponding 
probability is 


P(FSFFF) = (0.75)(0.25)(0.75)(0.75)(0.75) = (0.25)(0.75)* 
There are three more possibilities to consider: 


P(FFSFF) = (0.75)(0.75)(0.25)(0.75)(0.75) = (0.25)(0.75)* 
P(FFFSF) = (0.75)(0.75)(0.75)(0.25)(0.75) = (0.25)(0.75)* 
P(FFFFS) = (0.75)(0.75)(0.75)(0.75)(0.25) = (0.25)(0.75)* 
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In all, there are five different ways in which exactly 1 child would have type 
O blood, each with the same probability of occurring. As a result, 


P(X = 1) = P(exactly | child with type O blood) 
= P(SFFFF) + P(FSFFF) + P(FFSFF) + P(FFFSF) + P(FFFFS) 
= 51025) 75\) — 0.39551 


There’s about a 40% chance that exactly 1 of the couple’s 5 children will have type 
O blood. 


Let’s continue with the scenario from the previous example. What if we wanted 
to find P(X = 2), the probability that exactly 2 of the couple’s children have type 
O blood? Because the method doesn’t depend on the specific setting, we use “S” 
for success and “F” for failure for short. 

Do the work in two stages, as shown in the example. 


e Find the probability that a specific 2 of the 5 tries—say, the first and the 
third —give successes. This is the outcome SFSFF. Because tries are indepen- 
dent, the multiplication rule for independent events applies. The probability 
we want is 

P(SFSFF) = P(S)P(F)P(S)P(F)P(F) 

= (0.25)(0.75)(0.25)(0.75)(0.75) 

= (0.25)*(0.75)? 


¢ Observe that any one arrangement of 2 S’s and 3 F’s has this same probability. 
This is true because we multiply together 0.25 twice and 0.75 three times 
whenever we have 2 S’s and 3 F’s. The probability that X = 2 is the probabil- 
ity of getting 2 S’s and 3 F’s in any arrangement whatsoever. Here are all the 
possible arrangements: 
SSFFF SFSFF SFFSF SFFFS FSSFF 
FSFSF FSFFS FFSSF FFSFS FFFSS 


There are 10 of them, all with the same probability. The overall probability of 
2 successes is therefore 


P(X = 2) = 10(0.25)*(0.75)? = 0.26367 
The pattern of this calculation works for any binomial probability. That is, 


P(X = k) = P(exactly k successes in 7 trials) 
= number of arrangements - p4(1 — p)"* 
To use this formula, we must count the number of arrangements of k successes in 


n observations. This number is called the binomial coefficient. We use the fol- 
lowing fact to do the counting without actually listing all the arrangements. 
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The formula for binomial coefficients uses the factorial notation. For any posi- 
tive whole number n, its factorial n! is 


n! =n(n — 1)(n — 2)-...- (3)(2)(1) 


We also define 0! = 1. 

The larger of the two factorials in the denominator of a binomial coefficient 
will cancel much of the n! in the numerator. For example, the binomial coef 
ficient we need to find the probability that exactly 2 of the couple’s 5 children 
inherit type O blood is 


@ _ 51 _ CLAD _ 6A _ 
2/2131 (2)(1)(B)(2)()—(2)(1) 

Some people prefer the notation The binomial coefficient is () not related to the fraction 3. A helpful 
5p instead of (3) for the binomial way to remember its meaning is to read it as “5 choose 2.” Binomial coef- 
coefficient. ficients have many uses, but we are interested in them only as an aid to 


finding binomial probabilities. If you need to compute a binomial coefficient, use 
your calculator. 


TECHNOLOGY > WQMIAL COEFFICIENTS ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


To calculate a binomial coefficient like () on the T1-83/84 or TI-89, proceed as follows: 
TI-83/84 TI-89 


e ‘Type 5, press |MATH |, arrow over to PRB, choose nCr, ¢ From the home screen, press[2nd][5](MATH), choose 
and press |ENTER|. ‘Then type 2 and press [ENTER Probability, nCr (,and press | ENTER |. Complete 


again to execute the command 5 nCr 2. the command nCr (5,2) and press | ENTER |. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 
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The binomial coefficient (',) counts the number of different ways in which k 
successes can be arranged among n7 trials. The binomial probability P(X = k) is 
this count multiplied by the probability of any one specific arrangement of the k 
successes. Here is the result we seek. 


BINOMIAL PROBABILITY FORMULA 


If X has the binomial distribution with n trials and probability p of success on 
each trial, the possible values of X are 0,1, 2,...,n. Ifk is any one of these 
values, 


A= (pa =o 


With our formula in hand, we can now calculate any binomial probability. 


Inheriting Blood Type 


Using the binomial probability formula 


PROBLEM: Each child ofa particular set of parents has probability 0.25 of having type 0 blood. 
Suppose the parents have 5 children. 


(a) Find the probability that exactly 3 of the children have type O blood. 


(b) Should the parents be surprised if more than 3 of their children have type 0 blood? Justify your 
answer. 


SOLUTION: Let X= the number of children with type 0 blood. We know that Xhas a binomial 
distribution with n = 5 and p = 0.25. 


(a) We want to find P(X = 3). 
5 
AS) -)t0.25)%(0.75) = 10(0.25)°(0.75)" = 0.08789 
There is about a 9% chance that exactly 3 of the 5 children have type 0 blood. 
(b) To answer this question, we need to find P(X > 3). 


Ad 3) — 0 — 4) — 9) — (F)\0.25)*(0.75)" + (2)t0.25)%(0.75) 
= 5(0.25)*(0.75)' + 1(0.25)°(0.75)° = 0.01465 + 0.00098 = 0.01563 


Because there's only about a 1.5% chance of having more than 3 children with type 0 blood, the 
parents should definitely be surprised if this happens. 


For Practice Try Exercises and 


We could also use the calculator’s binompdf and binomcdf£ commands 
to perform the calculations in the previous example. The following Technology 
Corner shows how to do it. 
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TECHNOLOGY 


CORNER BINOMIAL PROBABILITY ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


There are two handy commands on the T1-83/84 and TI-89 for finding binomial probabilities: binompdf and 
binomcedf. The inputs for both commands are the number of trials m, the success probability p, and the values of inter- 
est for the binomial random variable X. 


binompdf (n,p,k) computes P(X = k) 
binomcdf (n,p,k) computes P(X = k) 


Let’s use these commands to confirm our answers in the previous example. 


(a) Find the probability that exactly 3 of the children have type O blood. 


TI-83/84 TI-89 
© Press (DISTR) andchoose binompdf(. ¢ In the Stats/List Editor, Press (Distr) and 
e¢ OS 2.55 or later: In the dialog box, enter these val- choose Binomial Pdf. 
ues: trials:5, p:0.25, x value:3, choose e In the dialog box, enter these values: Num Trials, 
Paste, and then press [ENTER|, Older OS: Complete n:5, Prob Success,p:0.25, Xvalue:3, and 
the command binompdf (5,0.25,3) and press then choose | ENTER |. 


ENTER }. 
NORMAL FLOAT AUTO REAL RADIAN CL i 


binompdf(S,.25,3) 
- 987890625 


MAIN RAD AUTO FUNC 378 


These results agree with our previous answer using the binomial probability formula: 0.08789. 


(b) Should the parents be surprised if more than 3 of their children have type O blood? 
To find P(X > 3), use the complement rule: 


P(X > 3) = 1-P(X <3) = 1-binomed£ (5,0.25,3) 


e Press/2nd]/VARS| (DISTR) and choose e In the Stats/List Editor, Press [F5] (Distr) and 
binomcdf (. choose Binomial Cdf.... 

¢ OS 2.55 or later: In the dialog box, enter these values: ¢ In the dialog box, enter these values: Num Trials, 
trials:5, p:0.25, x value:3, choose Paste, n:5, Prob Success,p:0.25, lower val- 
and then press [ENTER] Subtract this result from 1 to ue:0, upper value: 3 and then choose |ENTER|. 
get the answer. Older OS: Complete the command Subtract this result from | to get the answer. 


binomedf (5,0.25,3)and press [ENTER], Sub- 


tract this result from | to get the answer. 


We could also have done the 
calculation for part (b) as 

AX> 3) = AX= 4) + P\X=5) 
= binompdf (5,0.25,4) + 
binompdf (5,0.25,5) 

= 0.01465 + 0.00098 = 0.01563. 
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NORMAL FLOAT AUTO REAL RADIAN CL fl 


binomcdf(S,.25,3) 
+ 984375 


Poeeee re errr eee retire eee tree ee ters iets iets 


Now we subtract from | to get the desired answer: 1 — 0.984375 = 0.015625. This result agrees with our previous answer 
using the binomial probability formula: 0.01563. 


AP® EXAM TIP Don’t rely on “calculator speak” when showing your work on free-response 
questions. Writing binompdf (5,0.25,3) =0.08789 will not earn you full credit for 

a binomial probability calculation. At the very least, you must indicate what each of those 
calculator inputs represents. For example, “I used binompdf(trials:5,p:0.25,x value:3).” 


Note the use of the complement rule to find P(X > 3) in the Technology 
Corner: P(X > 3) = 1 — P(X S 3). This is necessary because the calculator’s 
binomcdf (n,p,k) command only computes the probability of getting k or 
fewer successes in n trials. Students often have trouble identifying the correct 
third input for the binomcdf£ command when a question asks them to find the 
probability of getting less than, more than, or at least so many successes. 

Here’s a helpful tip to avoid making such a mistake: write out the possible val- 
ues of the variable, circle the ones you want to find the probability of, and cross 
out the rest. In the previous example, X can take values from 0 to 5 and we want 


to find P(X > 3): 
eY27 


Crossing out the values from 0 to 3 shows why the correct calculation is 1 — P(X = 3). 

Take another look at the solution in the blood-type example. The structure is 
much like the one we used when doing Normal calculations. Here is a revised 
summary box that describes the process. 


HOW TO FIND BINOMIAL PROBABILITIES 


Here’s an example that shows the method at work. 
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Free Lunch? 
Binomial calculations 


A local fast-food restaurant is running a “Draw a three, get it free” lunch promo- 
tion. After each customer orders, a touch-screen display shows the message “Press 
here to win a free lunch.” A computer program then simulates one card being 
drawn from a standard deck. If the chosen card is a 3, the customer's order is free. 
Otherwise, the customer must pay the bill. 


PROBLEM: 

(a) All 12 players ona school’s basketball team place individual orders at the restaurant. What is 
the probability that exactly 2 of them win a free lunch? 

(b) If 250 customers place lunch orders on the first day of the promotion, what's the probability 
that fewer than 10 wina free lunch? 


SOLUTION: 

(a) Step 1: State the distribution and the values of interest. Let X = the number 
of players who win a free lunch. There are 12 independent trials of the chance process, each with 
success probability 4/52 (because there are 4 threes in a standard deck of 52 cards). So Xhasa 
binomial distribution with n = 12 and p = 4/52. We want to find P(X = 2). 


Step 2: Perform calculations—show your work! The binomial probability formula gives 


ra=2)=(12)(2)(22)" <orse 


Using technology: The command binompdf (trials:12,p:4/52,x value: 2) also gives 
0.1754. 

Step 3: Answer the question. There is about a 17.5% probability that exactly 2 players will 
win a free lunch. 

(b) Step 1: State the distribution and the values of interest. Let Y= the number of 
customers who win a free lunch. There are 250 independent trials of the chance process, each with 
success probability 4/52. So Y has a binomial distribution with n = 250 and p = 4/52. We want to 
find PY < 10). 


Step 2: Perform calculations—show your work! The values of Y that interest us are 


Oil A28 8 @ YY @ Dio WW Wa wo. Bee 


To use the binomial formula, we would have to add up the probabilities for Y= 0, Y=1,...,Y=9. 
That’s too much work! The better option is to use technology: 


AY< 10) = A Y= 9) = binomedf£ (trials:250,p:4/52,xvalue:9) 
= 0.00613 


Step 3: Answer the question. There is almost no chance that fewer than 10 customers will 
win a free lunch. If this actually happened, the customers should be suspicious about the restau- 
rant’s claim. 


For Practice Try Exercise 


Section 6.3 Binomial and Geometric Random Variables 4,397 


ar/ cuteck YOUR UNDERSTANDING 
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0 1 2 3 
Number of children with type O blood (X) 


FIGURE 6.14 Histogram showing 
the probability distribution of the 
binomial random variable X = 
number of children with type 0 
blood in a family with 5 children. 


To introduce her class to binomial distributions, Mrs. Desai gives a 10-item, multiple- 
choice quiz. The catch is, students must simply guess an answer (A through E) for each 
question. Mrs. Desai uses her computer’s random number generator to produce the an- 
swer key, so that each possible answer has an equal chance to be chosen. Patti is one of the 
students in this class. Let X = the number of Patti’s correct guesses. 

1. Show that X is a binomial random variable. 

2. Find P(X = 3). Explain what this result means. 

3. ‘To get a passing score on the quiz, a student must guess correctly at least 6 times. 
Would you be surprised if Patti earned a passing score? Compute an appropriate 
probability to support your answer. 


Mean and Standard Deviation 
of a Binomial Distribution 


What does the probability distribution of a binomial random variable look like? The 
table below shows the possible values and corresponding probabilities for X = 
the number of children with type O blood. This is a binomial random variable with 
n = 5 and p = 0.25. Figure 6.14 shows a histogram of the probability distribution. 


Value x;: 0 | 2 3 4 5 
Probability p;; 0.23730 0.39551 0.26367 0.08789 0.01465 0.00098 


Let’s describe what we see. 

Shape: The probability distribution of X is skewed to the 
right. Because the chance that any one of the couple’s chil- 
dren inherits type O blood is 0.25, it’s quite likely that 0, 1, 
or 2 of the children will have type O blood. Larger values of 
X are much less likely. 

Center: The median number of children with type O blood 
is 1 because that’s where the 50th percentile of the distribu- 
tion falls. How about the mean? It’s 


px = Dxqp; = (0)(0.23730) + (1)(0.39551) +--+ + 
aaah (5)(0.00098) = 1.25 


So the expected number of children with type O blood in 
families like this one with 5 children is 1.25. 


Spread: The variance of X is 
0% = XK — px) pi 
= (0 — 1.25)(0.23730) + (1 — 1.25)7(0.39551) +--+ + 
(5 — 1.25)?(0.00098) = 0.9375 


So the standard deviation of X is 
ox = V 0.9375 = 0.968 


The number of children with type O blood will typically differ from the mean by 
about 0.968 in families like this one with 5 children. 
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THINK 
ABOUT IT 


RANDOM VARIABLES 


Did you think about why the mean is py = 1.25? Because each child has a 
0.25 chance of inheriting type O blood, we’d expect one-fourth of the 5 children 
to have this blood type. In other words, uy = 5(0.25) = 1.25. This method can be 
used to find the mean of any binomial random variable with parameters n and p: 

Hix = np 
There are fairly simple formulas for the variance and standard deviation, too, but 
they aren’t as easy to explain: 


ox=np(l—p) and oy=Vnp(1—p) 


For our family with 5 children, 
ox = 5(0.25)(0.75) = 0.9375 and ox = V0.9375 = 0.968 


MEAN AND STANDARD DEVIATION OF A BINOMIAL RANDOM VARIABLE 


Remember that these formulas work only for binomial distributions. 
They can’t be used for other distributions. 


Where do the binomial mean and variance formulas come 
from? We can derive the formulas for the mean and variance of a binomi- 
al distribution using what we learned about combining random variables in 
Section 6.2. Let’s start with the random variable B that’s described by the fol- 
lowing probability distribution: 


Value b;: 0 l 
Probability p;; 1l—-—p  p 


You can think of B as representing the result of a single trial of some chance pro- 
cess. Ifa success occurs (probability p), then B = 1. Ifa failure occurs, then B = 0. 
Notice that the mean of B is 


ip = Ubipi = (0) — p) + (1)(p) = p 


and that the variance of B is 
2 


o5 = &(bi — ps) pi = (0 — py*(1 — p) + (1 — p)’p 
= pl — p)ip + (1 — p)] 
= pl ~ p) 


Now consider the random variable X = B, + B, + --- + B,. We can think 
of X as counting the number of successes in n independent trials of this chance 
process, with each trial having success probability p. In other words, X is a bino- 
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mial random variable with parameters n and p. By the rules from Section 6.2, the 
mean of X is 

Lx = fp, + pp, tot pa = pt pte +p=np 
and the variance of X is 


ot = of, + of, +--+ of, 


= pl —p) + pl —p) +-+++ pl — p) 
= np(1 — p) 


‘The standard deviation of X is therefore 


ox = Vnp(1 — p) 


Ol 


Bottled Water versus Tap Water 


Binomial distribution in action 


Mr. Bullard’s AP® Statistics class did the Activity on page 346. There were 21 
students in the class. If we assume that the students in his class could not tell tap 
water from bottled water, then each one was basically guessing, with a 1/3 chance 
of being correct. Let X = the number of students who correctly identified the cup 
containing the different type of water. 


PROBLEM: 
(a) Explain why Xis a binomial random variable. 
(b) Find the mean and standard deviation of X. Interpret each value in context. 


(c) Ofthe 21 students in the class, 13 made correct identifications. Are you convinced that Mr. 
Bullard’s students could tell bottled water from tap water? Justify your answer. 


SOLUTION: 


(a) Assuming that students were just guessing, the Activity consisted of 21 repetitions of this 
chance process. Let’s check the BINS: 


° — Binary? Oneach trial, “success” = correct identification; “failure” = incorrect identification. 

* — Independent? One student's result should tell us nothing about any other student's result. 

e Number? There were n = 21 trials. 

* Success? Each student had a p = 1/3 chance of guessing correctly. 

Because Xis counting the number of successes in this binomial setting, it is a binomial random variable. 
(b) The mean of Xis 


fly = np = 21(1/3) = 7 
If the Activity were repeated many times with groups of 21 students who were just guessing, the 
average number of students who guess correctly would be about 7. The standard deviation of Xis 


Oy = Vip(1 — p) = V21(1/3)(2/3) = 2.16 
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If the Activity were repeated many times with groups of 21 students who were just guessing, the 
number of correct identifications would typically differ from 7 by about 2.16. 


(c) The class's result corresponds to X= 13, a value that’s nearly 3 standard deviations above the 
mean. How likely is it that 13 or more of Mr. Bullard’s students would guess correctly? It’s 


P(X= 13) = 1 —P(X=12) 
Using the calculator’s binomedf (trials:21,p:1/3,xvalue:12) command: 
P(X= 13) = 1 — 0.9932 = 0.0068 


The students had less than a 1% chance of getting so many right if they were all just guessing. This is 
strong evidence that some of the students in the class could tell bottled water from tap water. 


For Practice Try Exercise 


Although the histogram is slightly right- 
skewed (there’s a long tail that extends 
out to X = 21), it looks like a Normal 
density curve might fit the bulk of the 
distribution fairly well. 


Figure 6.15 shows the probability distribution for the number of correct guesses 
in Mr. Bullard’s class if no one can tell bottled water from tap water. As you can 
see from the graph, the chance of 13 or more guessing correctly is quite small. 
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FIGURE 6.15 Histogram of the probability distribution for the binomial random variable X = number 
of correct guesses in Mr. Bullard’s class. 


CHECK YOUR UNDERSTANDING 

Refer to the previous Check Your Understanding (page 397) about Mrs. Desai’s special 
multiple-choice quiz on binomial distributions. We defined X = the number of Patti’s 
correct guesses. 

1. Find jy. Interpret this value in context. 

2. Find ox. Interpret this value in context. 

3. What's the probability that the number of Patti’s correct guesses is more than 2 standard 
deviations above the mean? Show your method. 
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Binomial Distributions 
in Statistical Sampling 


The binomial distributions are important in statistics when we wish to make in- 
ferences about the proportion p of successes in a population. Here is an example 
involving a familiar product. 


Bad Flash Drives 


Binomial distributions and sampling 


A supplier inspects an SRS of 10 flash drives from a shipment of 10,000 flash 
drives. Suppose that (unknown to the supplier) 2% of the flash drives in the ship- 
ment are defective. Count the number X of bad flash drives in the sample. 


This is not quite a binomial setting. Removing | flash drive changes the proportion 
of bad flash drives remaining in the shipment. The conditional probability that 
the second flash drive chosen is bad changes when we know whether the first is 
good or bad. But removing | flash drive from a shipment of 10,000 changes the 
makeup of the remaining 9999 flash drives very little. The distribution of X is very 
close to the binomial distribution with n = 10 and p = 0.02. To illustrate this, let’s 
compute the probability that none of the 10 flash drives is defective. Using the 
binomial distribution, it’s 


P(X =0)= (7) )10.02)%0.98)" = 0.8171 


The actual probability of getting no defective flash drives is 
OS00 ee 70 108 979] 


i = x x eNO ee —a(()F 
P(no defectives) 10,000 *~ 9999 * 9998 9991 0.8170 


Those two probabilities are quite close! 


Almost all real-world sampling, such as taking an SRS from a population of in- 
terest, is done without replacement. As the previous example illustrates, sampling 
without replacement leads to a violation of the independence condition. 

The flash drives example shows how we can use binomial distributions in the 
statistical setting of selecting an SRS. When the population is much larger than 
the sample, a count of successes in an SRS of size n has approximately the bino- 
mial distribution with n equal to the sample size and p equal to the proportion 
of successes in the population. What counts as “much larger”? In practice, the 
binomial distribution gives a good approximation as long as we don’t sample more 
than 10% of the population. We refer to this as the 10% condition. 


10% CONDITION 
When taking an SRS of size n from a population of size N, we can use a 
binomial distribution to model the count of successes in the sample as long 


1 
BS 7) = == IN. 


10 


402 CHAPTER 6 RANDOM VARIABLES 


Here’s an example that shows why it’s important to check the 10% condition 
before calculating a binomial probability. You might recognize the setting from 
the first activity in the book (page 5). 


Hiring Discrimination—It Just 
Won't Fly! 


Sampling without replacement 


An airline has just finished training 25 first officers—15 male and 10 female— 

to become captains. Unfortunately, only eight captain positions are available 

right now. Airline managers decide to use a lottery to determine which pilots 

will fill the available positions. Of the 8 captains chosen, 5 are female and 3 are male. 


PROBLEM: Explain why the probability that 5 female pilots are chosen in a fair lottery is not 
2) 
AX= 5) = (J oaor%o.6o} = 0.124 


(The correct probability is 0.106.) 

SOLUTION: The managers are sampling without replacement when they do the lottery. There's a 
0.40 chance that the first pilot selected for a captain position is female. Once that person is cho- 
sen, the probability that the next pilot selected will be female is no longer 0.40. The binomial formula 
assumes that the conditional probability of success stays constant at 0.40 throughout the eight 
trials of this chance process. This calculation will be approximately correct if the success probability 
doesn’t change too much—as long as we don't sample more than 10% of the population. In this case, 
managers are sampling 8 out of 25 pilots—almost 1/3 of the population. That explains why the 
binomial probability is off by about 17% (0.018/0.106) from the correct answer. 


For Practice Try Exercise 


The Normal approximation to binomial distributions® As n gets 
larger, something interesting happens to the shape of the binomial distribution. 
Figure 6.16 shows histograms of binomial distributions for different values of 
and p. As the number of observations n becomes larger, the binomial distribution 
gets close to a Normal distribution. You can investigate the relationship between 
nand p yourself using the Normal Approximation to Binomial Distributions applet 
at the book’s Web site. 
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FIGURE 6.16 Histograms of binomial distributions with (a) n = 10 and p = 0.8, (b) n = 20 and p = 0.8, and (c) n= 50 and p= 0.8. 
As nincreases, the shape of the probability distribution gets more and more Normal. 


*This topic is not required for the AP® Statistics exam. Some teachers prefer to discuss this topic when 
presenting the sampling distribution of p (Chapter 7). 
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When n is large, we can use Normal probability calculations to approximate 
binomial probabilities. Here are the facts. 


NORMAL APPROXIMATION FOR BINOMIAL DISTRIBUTIONS: 


THE LARGE COUNTS CONDITION 


Suppose that a count X of successes has the binomial distribution with n trials 
and success probability p. When n is large, the distribution of X is approximately 
Normal with 


mean: lx = np and _ standard deviation: ox = Vnp(1 — p) 


As a tule of thumb, we will use the Normal approximation when n is so large 
that 


np = 10 aud 9s “(ll p) 2.110 


That is, the expected number of successes and failures are both at least 10. 
We refer to this as the Large Counts condition. 


The Normal approximation is easy to remember because it says to act as if X is 
Normal with exactly the same mean and standard deviation as the binomial. The 
accuracy of the Normal approximation improves as the sample size n increases. It 
is most accurate for any fixed n when f is close to 1/2 and least accurate when p 
is near 0 or 1. This is why the rule of thumb in the box depends on p as well as n. 


Attitudes toward Shopping 


Normal approximation to a binomial 


) Sample surveys show that fewer people enjoy shopping than in the past. A 
a survey asked a nationwide random sample of 2500 adults if they agreed or 
disagreed that “I like buying new clothes, but shopping is often frustrating 
and time-consuming,”! The population that the poll wants to draw conclu- 
sions about is all U.S. residents aged 18 and over. 


PROBLEM: Suppose that exactly 60% of all adult U.S. residents would say “Agree” if 
asked the same question. Let X = the number in the sample who agree. 


(a) Show that Xis approximately a binomial random variable. 


(b) Check the conditions for using a Normal approximation in this setting. 
(c) UseaNormal distribution to estimate the probability that 1520 or more of the sample agree. 


SOLUTION: 
(a) Let’s check the BINS. 
* Binary? Success = agree that shopping is frustrating, failure = don’t agree. 


* — Independent? The trials are not independent: the conditional probability of a success 
changes due to the sampling without replacement. But the 10% conditionis met because 2500 
people is much less than 10% of all U.S. adult residents. 


* Number? There are n = 2500 trials of this chance process. 
* Success? There is the same probability of selecting an adult who agrees on each trial: p = 0.6. 
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So the number in our sample who agree that shopping is frustrating is a random variable Xhaving 
roughly the binomial distribution with n = 2500 and p = 0.6. 


(b) Weneed to check the Large Counts condition. Because np = 2500(0.6) = 1500 and n(1 — p) = 
2500(0.4) = 1000 are both at least 10, we should be safe using the Normal approximation. 


(c) Step 1: State the distribution and the values of interest. Act. as though the count X 
has the Normal distribution with the same mean and standard deviation as the binomial distribution: 


[= np = 2500(0.6) = 1500 and o = Vap(1 — p) = V2500(0.6)(0.4) = 24.49 
We want to find P(X = 1520). Figure 6.17 shows this probability as the area under a Normal curve. 


Step 2: Perform calculations—show your work! Standardizing the 
boundary value gives 


W(1500,24.49) 


_ 1520 — 1500 
24.49 


= 0.82 


Using Table A, the probability we want is 
PA Z= 0.62) = 1 — 0.7939 = 0.2061 


4909 490 Using technology: The command normalcdf (lower:1520, upper: 
10000, :1500, 0:24.49) givesanarea of 0.2071. 


FIGURE 6.17 Normal distribution Step 3: Answer the question. There is about a 21% chance of getting a sample in which 1520 


to approximate the binomial or more agree with the statement. 
probability of getting 1520 or 


more successes when n = 2500 
and p = 0.6. For Practice Try Exercise 


We can also find the probability that 1520 or more of the sample agree that 
shopping is often frustrating and time-consuming using the command 1- 
binomedf (2500,0.6,1519), which yields 0.2131. The Normal approxima- 
tion, 0.2061, misses the more accurate binomial probability by about 0.007. 


Geometric Random Variables 


In a binomial setting, the number of trials 7 is fixed in advance, and the binomial ran- 
dom variable X counts the number of successes. The possible values of X are 0, 1, 2, 
...,n. In other situations, the goal is to repeat a chance process until a success occurs: 


e Roll a pair of dice until you get doubles. 
e In basketball, attempt a three-point shot until you make one. 
¢ Keep placing a $1 bet on the number 15 in roulette until you win. 


These are all examples of a geometric setting. Although the number of trials isn’t 
fixed in advance, the trials are independent and the probability of success remains 
constant. 


DT 


DEFINITION: Geometric setting 


A geometric setting arises when we perform independent trials of the same chance 
process and record the number of trials it takes to get one success. On each trial, the 
probability p of success must be the same. 


ACTIVITY 


MATERIALS: 


Calculator or computer ran- 
dom number generator to 
select student names and 
days of the week 


o. , 
4 e 
This is your lucky day! 


, Aarro a 


i 
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Here’s an Activity your class can try that involves a geometric setting. 


Is This Your Lucky Day? 


Your teacher is planning to give you 10 problems for homework. As an alternative, 
you can agree to play the Lucky Day Game. Here’s how it works. A student will 
be selected at random from your class and asked to pick a day of the week (for 
instance, Thursday). Then your teacher will use technology to randomly choose 
a day of the week as the “lucky day.” If the student picks the correct day, the class 
will have only one homework problem. If the student picks the wrong day, your 
teacher will select another student from the class at random. The chosen student 
will pick a day of the week and your teacher will use technology to choose a “lucky 
day.” If this student gets it right, the class will have two homework problems. The 
game continues until a student correctly guesses the lucky day. Your teacher will 
assign a number of homework problems that is equal to the total number of guesses 
made by members of your class. Are you ready to play the Lucky Day Game? 


1. Decide as a class about whether to “gamble” on the number of homework 
problems you will receive. You have 30 seconds. 


2. Play the Lucky Day Game and see what happens! 


In a geometric setting, if we define the random variable Y to be the number of 
trials needed to get the first success, then Y is called a geometric random variable. 
The probability distribution of Y is a geometric distribution. 


EE 


DEFINITION: Geometric random variable and geometric distribution 


The number of trials Y that it takes to get a success in a geometric setting is a 
geometric random variable. The probability distribution of Yis a geometric 
distribution with parameter p, the probability of a success on any trial. The possible 
values of Yare 1, 2,3,.... 


As with binomial random variables, it’s important to be able to distinguish situ- 
ations in which a geometric distribution does and doesn’t apply. 


The Lucky Day Game 


Geometric settings and random variables 


The random variable of interest in this game is Y = the number of picks it takes to 
correctly match the lucky day. Each pick is one trial of the chance process. Know- 
ing the result of one student’s pick tells us nothing about the result of any other 
pick. On each trial, the probability of a correct pick is 1/7. 


This is a geometric setting. Because Y counts the number of trials to get the first 
success, it is a geometric random variable with parameter p = 1/7. 
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What is the probability that the first student picks correctly and wins the Lucky 
Day Game? It’s P(Y = 1) = 1/7. That’s also the class’s chance of getting only one 
homework problem. For the class to have two homework problems, the first student 
selected must pick an incorrect day of the week and the second student must pick 
the lucky day correctly. The probability that this happens is P(Y = 2) = (6/7)(1/7) = 
0.1224. Likewise, P(Y = 3) = (6/7)(6/7)(1/7) = 0.1050. In general, the probability 
that the first correct pick comes on the kth trial is P(Y = k) = (6/7)*~ 1(1/7). Let's 
summarize what we’ve learned about calculating a geometric probability. 


GEOMETRIC PROBABILITY FORMULA | 


If Y has the geometric distribution with probability p of success on each trial, 
the possible values of Y are 1, 2, 3,.... If k is any one of these values, 


Pia bp) sap 


With our formula in hand, we can now compute any geometric probability. 


The Lucky Day Game 


Calculating geometric probabilities 

PROBLEM: Let the random variable Y be defined as in the previous example. 

(a) Find the probability that the class receives exactly 10 homework problems as a result of playing 
the Lucky Day Game. 

(b) Find P(Y< 10) and interpret this value in context. 

SOLUTION: Y =the number of attempts it takes to get a correct pick = the number of homework 
problems. 

(a) HY= 10) = (617) (1/7) = 0.0557. 

(b) AY<10) =AY=1)+ AY=2) + AY=3)+...+ AY=9) = 1/7 + (6/7)(1/7) + 
(6/7)2(117) +... + (6/7)°(1/7) = 0.7503. There’s about a 75% chance that the class will get 


less homework by playing the Lucky Day Game. 
: For Practice Try Exercise 


As you probably guessed, we used the calculator’s geomet pdf and geomet cdf 
commands for the computations in the previous example. The following Technology 
Corner shows you how we did it. 


TECHNOLOGY GEOMETRIC PROBABILITY ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


There are two handy commands on the TI-83/84 and TI-89 for finding geometric probabilities: geometpdf and 
geometcdf. The inputs for both commands are the success probability p and the value(s) of interest for the geometric 
random variable Y. 


geometpdf (p,k) computes P(Y = k) 
geometcdf£ (p,k) computes P(Y = k) 
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Let’s use these commands to confirm our answers in the previous example. 


(a) Find the probability that the class receives exactly 10 homework problems as a result of playing the Lucky Da 
Pp y y Pp playing y y 
Game. 


TI-83/84 TI-89 


Press| 2nd|| VARS (DISTR) and choose geomet pdf (. In the Stats/List Editor, Press (Distr) and choose 


OS 2.55 or later: In the dialog box, enter these values: Geometric Pdf.... 
p:1/7, xvalue:10, choose Paste, and then press ¢ Inthe dialog box, enterthese values: Prob Success, 


[ENTER] Older OS: Complete the command p:1/7, Xvalue:10, and then choose [ENTER |. 


geomet pdf (1/7, 10) and press [ENTER | 


Seometpdf (177,10) 
- 9356763859 


= Trae *~<“‘i‘i SPC 


=40 
=4e? 


1ist3=6666 
MAIN RAD AUTO 


These results agree with our previous answer using the geometric probability formula: 0.0357. 
(b) Find P(Y < 10) and interpret this value in context. ‘To find P(Y < 10), use the geomet cd£ command: 
P(Y < 10) = P(Y = 9) = geomet cdf (1/7, 9) 


Press|2nd|| VARS (DISTR) and choose geomet cdf£(. ¢ In the Stats/List Editor, Press |F5] (Distr) and choose 


OS 2.55 or later: In the dialog box, enter these values: Geometric Cdf.... 
p:1/7, x value:9,choose Paste,andthen press ¢ In the dialog box, enter these values: Prob 
ENTER|. Older OS: Complete the command geomet Success, p:1/7, Lower value:0, Upper 


cdf (1/7, 9) and press [ENTER |. value: 9, and then choose | ENTER |. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


geometcdf (1/7,9) 
- 7502652985 


1ist3=666 
MAIN Fab AUTO 


These results agree with our previous answer using the geometric probability formula: 0.7503. 


The table below shows part of the probability distribution of Y. We can’t show 
the entire distribution, because the number of trials it takes to get the first success 
could be a very large number. 


Value y;: 1 Z a +t 5 6 7 8 9 
Probability p;: 0.143 0.122 0.105 0.090 0.077 0.066 0.057 0.049 0. 042 


Figure 6.18 is a histogram of the probability distribution for values of Y from | 
to 26. Let’s describe what we see. 
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Shape: The heavily right-skewed shape is characteristic of any 
geometric distribution. That’s because the most likely value 
of a geometric random variable is 1. The probability of each 
successive value decreases by a factor of (1 — p). 


Center: The mean of Y is xy = 7. (Due to the infinite number 
of possible values of Y, the calculation of the mean is beyond 
the scope of this text.) If the class played the Lucky Day Game 
many times, they would receive an average of 7 homework 
problems. It’s no coincidence that p = 1/7 and py = 7. With 
probability of success 1/7 on each trial, we'd expect it to take an 
average of 7 trials to get the first success. 


Probability 


0 5 10 15 20 25 
Number of picks required (Y) 

FIGURE 6.18 Histogram showing 

the probability distribution of the Spread: The standard deviation of Y is cy = 6.48. (Due to the infinite number of 

geometric random variable Y = possible values of Y, the calculation of the standard deviation is beyond the scope 

number of trials needed for stu- of this text.) If the class played the Lucky Day game many times, the number of 


dents to pick correctly in the Lucky == homework problems they receive would typically differ from 7 by about 6.5 prob- 
Day Game. lems. That could mean a lot of homework! 


We can generalize the result for the mean of a geometric random variable. 


MEAN (EXPECTED VALUE) OF A GEOMETRIC RANDOM VARIABLE 


IfY is a geometric random variable with probability of success p on each 
l 
trial, then its mean (expected value) is uy = E(Y) = p That is, the expected 


number of trials required to get the first success is 1/p. 


CHECK YOUR UNDERSTANDING 


Suppose you roll a pair of fair, six-sided dice until you get doubles. Let T = the number 
of rolls it takes. 


1. Show that T is a geometric random variable. 
2. Find P(T = 3). Interpret this result in context. 


3. In the game of Monopoly, a player can get out of jail free by rolling doubles within 3 
turns. Find the probability that this happens. 


A Jury of Your Peers? 


In the chapter-opening Case Study on page 345, a defense attorney challenged 
the jury-pool selection process in his accused client’s trial. Here are the facts: 


e About 7.28% of the citizens in the court’s jurisdiction were black. 
e The jury pool had between 60 and 100 members, 3 of whom were black. 


Use what you have learned in this chapter to help answer the following questions. 
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For now, assume that the court carried out a proper random-selection pro- 
cess to obtain a jury pool with 100 members. 


Let X = the number of black citizens in the jury pool. What distri- 
bution does the random variable X have? Justify your answer. 
Find the mean and standard deviation of X. Interpret these values 
in context. 

If a jury pool has 3 or fewer blacks, should we be suspicious that 
the court did not carry out the random selection process cor- 
rectly? Compute P(X = 3) and use this result to support your 
answer. 


What if the jury pool had 60 members? Assume once again that the court 
carried out a proper random-selection process. Let Y = the number of black 
citizens in the jury pool. 


4. Without doing any calculations, decide if P(Y = 3) is greater than, 
equal to, or less than P(X < 3). Justify your answer. 

5. Using the logic of Question 4, explain why you do not have to 
consider jury pools with 61, 62, ..., 99 members to render a ver- 
dict about whether or not the jury-selection process was carried 
out properly. What is your verdict? 


Summary 


e A binomial setting consists of n independent trials of the same chance pro- 
cess, each resulting in a success or a failure, with probability of success p on 
each trial. Remember to check the BINS! The count X of successes is a bino- 
mial random variable. Its probability distribution is a binomial distribution. 


e ‘The binomial coefficient 


n n! 
(7) ~ k(n — B)! 


counts the number of ways k successes can be arranged among n trials. The 
factorial of n is 


ol = le = ING = Aye one 2 (QA) 


for positive whole numbers n, and 0! = 1. 

e If X has the binomial distribution with parameters n and p, the possible val- 
ues of X are the whole numbers 0, 1, 2,... , n. The binomial probability of 
observing k successes in 17 trials is 


n 


P(X = k) = (j))ota =e 


e Binomial probabilities are best found using technology. 


410 CHAPTER 6 RANDOM VARIABLES 


The mean and standard deviation of a binomial random variable X are 
bx=np ox = Vnp(1 — p) 

The binomial distribution with n trials and probability p of success gives a good 

approximation to the count of successes in an SRS of size n from a large popu- 


lation containing proportion p of successes. This is true as long as the sample 
size n is no more than 10% of the population size N (the 10% condition). 


The Normal approximation to the binomial distribution* says that if X is a 
count of successes having the binomial distribution with parameters n and 
p, then when n is large, X is approximately Normally distributed with mean 
np and standard deviation Vnp(1 — p). We will use this approximation when 
np = 10 and n(1 — p) = 10 (the Large Counts condition). 


A geometric setting consists of repeated trials of the same chance process in 
which the probability p of success is the same on each trial, and the goal is to 
count the number of trials it takes to get one success. If Y = the number of 
trials required to obtain the first success, then Y is a geometric random vari- 
able. Its probability distribution is called a geometric distribution. 


If Y has the geometric distribution with probability of success p, the possible 
values of Y are the positive integers 1, 2, 3,... . The geometric probability 
that Y takes any value is 


PGS bap) = ip 


The mean (expected value) of a geometric random variable Y is wy = 1/p. 


*This topic is not required for the AP® Statistics exam. Some teachers prefer to 
discuss this topic when presenting the sampling distribution of 6 (Chapter 7). 


6.3) TECHNOLOGY 
CORNERS 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


12. Binomial coefficients on the calculator 


13. Binomial probability on the calculator 


14. Geometric probability on the calculator 


Exercises 


In Exercises 69 to 72, determine whether the given random flower seeds from Seed Depot and plants them in her 
variable has a binomial distribution. Justify your answer. garden. Let X = the number of seeds that germinate. 
69. Sowing seeds Seed Depot advertises that its new 70. Long or short? Put the names of all the students 

i) 388 flower seeds have an 85% chance of germinating in your class in a hat. Mix them up, and draw four 


© 


(growing). Suppose that the company’s claim is true. names without looking. Let Y = the number whose 
Judy gets a packet with 20 randomly selected new last names have more than six letters. 


val 
388 


© 


Wits 


7B: 


The 


7. 
771393 


© 


76. 
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Lefties Exactly 10% of the students in a school are 
left-handed. Select students at random from the 
school, one at a time, until you find one who is 
left-handed. Let V = the number of students chosen. 


Taking the train According to New Jersey ‘Transit, 
the 8:00 A.M. weekday train from Princeton to New 
York City has a 90% chance of arriving on time on 

a randomly selected day. Suppose this claim is true. 
Choose 6 days at random. Let W = the number of 

days on which the train arrives late. 


Binomial setting? A binomial distribution will be 
approximately correct as a model for one of these two 
settings and not for the other. Explain why by briefly 
discussing both settings. 


When an opinion poll calls residential telephone 
numbers at random, only 20% of the calls reach a 
person. You watch the random digit dialing machine 
make 15 calls. X is the number that reach a person. 


When an opinion poll calls residential telephone 
numbers at random, only 20% of the calls reach 

a live person. You watch the random digit dialing 
machine make calls. Y is the number of calls needed 
to reach a live person. 


Binomial setting? A binomial distribution will be 
approximately correct as a model for one of these two 
sports settings and not for the other. Explain why by 
briefly discussing both settings. 


A National Football League kicker has made 80% 

of his field goal attempts in the past. This season he 
attempts 20 field goals. The attempts differ widely in 
distance, angle, wind, and so on. 


A National Basketball Association player has made 
80% of his free-throw attempts in the past. This sea- 
son he takes 150 free throws. Basketball free throws 
are always attempted from 15 feet away with no 
interference from other players. 


Elk Biologists estimate that a baby elk has a 44% 
chance of surviving to adulthood. Assume this esti- 
mate is correct. Suppose researchers choose 7 baby 
elk at random to monitor. Let X = the number who 
survive to adulthood. Use the binomial probability for- 
mula to find P(X = 4). Interpret this result in context. 


Rhubarb Suppose you purchase a bundle of 10 bare- 
root rhubarb plants. The sales clerk tells you that 5% 
of these plants will die before producing any rhubarb. 
Assume that the bundle is a random sample of plants 
and that the sales clerk’s statement is accurate. Let Y = 
the number of plants that die before producing any 
thubarb. Use the binomial probability formula to find 
P(Y = 1). Interpret this result in context. 


Tiles 
71393 


© 


78. 


79. 


ro 
cf) 


(b) 
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Elk Refer to Exercise 75. How surprising would it 
be for more than 4 elk in the sample to survive to 
adulthood? Calculate an appropriate probability to 
support your answer. 


Rhubarb Refer to Exercise 76. Would you be 
surprised if 3 or more of the plants in the bundle die 
before producing any rhubarb? Calculate an appro- 
priate probability to support your answer. 


Sowing seeds Refer to Exercise 69. 


Find the probability that exactly 17 seeds germinate. 
Show your work. 


If only 12 seeds actually germinate, should Judy 
be suspicious that the company’s claim is not true? 
Compute P(X = 12) and use this result to support 
your answer. 


. Taking the train Refer to Exercise 72. 


Find the probability that the train arrives late on 
exactly 2 days. Show your work. 


Would you be surprised if the train arrived late on 
2 or more days? Compute P(W = 2) and use this 
result to support your answer. 


Random digit dialing When an opinion poll calls 

a residential telephone number at random, there 

is only a 20% chance that the call reaches a live 
person. You watch the random digit dialing machine 
make 15 calls. Let X = the number of calls that 
reach a live person. 


Find and interpret jy. 


Find and interpret ox. 


Lie detectors A federal report finds that lie detector 
tests given to truthful persons have probability about 
0.2 of suggesting that the person is deceptive.!? A 
company asks 12 job applicants about thefts from 
previous employers, using a lie detector to assess 
their truthfulness. Suppose that all 12 answer truth- 
fully. Let X = the number of people who the lie 
detector says are being deceptive. 


Find and interpret jy. 


Find and interpret ox. 


Random digit dialing Refer to Exercise 81. Let 
Y = the number of calls that don’t reach a live 
person. 


Find the mean of Y. How is it related to the mean 
of X? Explain why this makes sense. 


Find the standard deviation of Y. How is it related to the 
standard deviation of X? Explain why this makes sense. 
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Lie detectors Refer to Exercise 82. Let Y = the 
number of people who the lie detector says are tell- 
ing the truth. 


Find P(Y = 10). How is this related to P(X = 2)? Explain. 


Calculate uy and oy. How do they compare with jux 
and ox? Explain why this makes sense. 


1 in 6 wins As a special promotion for its 20-ounce 
bottles of soda, a soft drink company printed a mes- 
sage on the inside of each cap. Some of the caps 
said, “Please try again,” while others said, “You’re 

a winner!” ‘The company advertised the promotion 
with the slogan “1 in 6 wins a prize.” Suppose the 
company is telling the truth and that every 20-ounce 
bottle of soda it fills has a 1-in-6 chance of being a 
winner. Seven friends each buy one 20-ounce bottle 
of the soda at a local convenience store. Let X = the 
number who win a prize. 


Explain why X is a binomial random variable. 


Find the mean and standard deviation of X. Interpret 
each value in context. 


The store clerk is surprised when three of the friends 
win a prize. Is this group of friends just lucky, or is the 
company’s l-in-6 claim inaccurate? Compute P(X = 3) 
and use the result to justify your answer. 


Aircraft engines Engineers define reliability as the 
probability that an item will perform its function un- 
der specific conditions for a specific period of time. 
A certain model of aircraft engine is designed so that 
each engine has probability 0.999 of performing 
properly for an hour of flight. Company engineers 
test an SRS of 350 engines of this model. Let X = 
the number that operate for an hour without failure. 


Explain why X is a binomial random variable. 


Find the mean and standard deviation of X. Interpret 
each value in context. 


‘Two engines failed the test. Are you convinced that 
this model of engine is less reliable than it’s supposed 
to be? Compute P(X = 348) and use the result to 
justify your answer. 


Airport security ‘The ‘Transportation Security Admini- 
stration (‘T’SA) is responsible for airport safety. On some 
flights, TSA officers randomly select passengers for an 
extra security check before boarding. One such flight 
had 76 passengers— 12 in first class and 64 in coach 
class. Some passengers were surprised when none of 
the 10 passengers chosen for screening were seated in 
first class. Can we use a binomial distribution to ap- 
proximate this probability? Justify your answer. 
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*This topic is not required for the AP® Statistics exam. Some teachers prefer to 


discuss this topic when presenting the sampling distribution of p (Chapter 7). 


Scrabble In the game of Scrabble, each player 
begins by drawing 7 tiles from a bag containing 100 
tiles. There are 42 vowels, 56 consonants, and 2 
blank tiles in the bag. Cait chooses her 7 tiles and is 
surprised to discover that all of them are vowels. Can 
we use a binomial distribution to approximate this 
probability? Justify your answer. 


10% condition ‘To use a binomial distribution to 
approximate the count of successes in an SRS, why 
do we require that the sample size n be no more than 
10% of the population size N? 


“Large Counts condition To use a Normal distribu- 
tion to approximate binomial probabilities, why do 
we require that both np and n(1 — p) be at least 10? 


*On the Web What kinds of Web sites do males 
aged 18 to 34 visit most often? Half of male Internet 
users in this age group visit an auction site such as 
eBay at least once a month.|’ A study of Internet use 
interviews a random sample of 500 men aged 18 to 
34. Let X = the number in the sample who visit an 
auction site at least once a month. 


Show that X is approximately a binomial random 
variable. 


Check the conditions for using a Normal approxima- 
tion in this setting. 


Use a Normal distribution to estimate the probability 
that at least 235 of the men in the sample visit an 
online auction site at least once a month. 


*Checking for survey errors One way of checking 
the effect of undercoverage, nonresponse, and other 
sources of error in a sample survey is to compare the 
sample with known facts about the population. About 
12% of American adults identify themselves as black. 
Suppose we take an SRS of 1500 American adults 
and let X be the number of blacks in the sample. 


Show that X is approximately a binomial random 
variable. 


Check the conditions for using a Normal approxima- 
tion in this setting. 


Use a Normal distribution to estimate the probability 
that the sample will contain between 165 and 195 
blacks. 


Using Benford’s law According to Benford’s law 
(Exercise 5, page 359), the probability that the first 
digit of the amount on a randomly chosen invoice 
isa | ora 2 is 0.477. Suppose you examine an SRS 
of 90 invoices from a vendor and find 29 that have 
first digits 1 or 2. Do you suspect that the invoice 


Ot 


Ob: 


96. 


oF. 

pg 
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(b) 


98. 
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amounts are not genuine? Compute an appropriate 
probability to support your answer. 


A .300 hitter In baseball, a 0.300 hitter gets a hit in 
30% of times at bat. When a baseball player hits 0.300, 
fans tend to be impressed. Typical Major Leaguers bat 
about 500 times a season and hit about 0.260. A hit- 
ter’s successive tries seem to be independent. Could 

a typical Major Leaguer hit 0.300 just by chance? 
Compute an appropriate probability to support your 
answer. 


Geometric or not? Determine whether each of the 
following scenarios describes a geometric setting. If 
so, define an appropriate geometric random variable. 


A popular brand of cereal puts a card with | of 5 
famous NASCAR drivers in each box. There is a 1/5 
chance that any particular driver’s card ends up in 
any box of cereal. Buy boxes of the cereal until you 
have all 5 drivers’ cards. 


In a game of 4-Spot Keno, Lola picks 4 numbers 
from | to 80. The casino randomly selects 20 win- 
ning numbers from | to 80. Lola wins money if 

she picks 2 or more of the winning numbers. ‘The 
probability that this happens is 0.259. Lola decides 
to keep playing games of 4-Spot Keno until she wins 
some money. 


Geometric or not? Determine whether each of the 
following scenarios describes a geometric setting. If 
so, define an appropriate geometric random variable. 


Shuffle a standard deck of playing cards well. Then 
turn over one card at a time from the top of the deck 
until you get an ace. 


Lawrence likes to shoot a bow and arrow in his free 
time. On any shot, he has about a 10% chance of hit- 
ting the bull’s-eye. As a challenge one day, Lawrence 
decides to keep shooting until he gets a bull’s-eye. 


1-in-6 wins Alan decides to use a different strategy 
for the l-in-6 wins game of Exercise 85. He keeps 
buying one 20-ounce bottle of the soda at a time 
until he gets a winner. 


Find the probability that he buys exactly 5 bottles. 
Show your work. 


Find the probability that he buys no more than 
8 bottles. Show your work. 


Cranky mower To start her old lawn mower, Rita has 
to pull a cord and hope for some luck. On any par- 
ticular pull, the mower has a 20% chance of starting. 


Find the probability that it takes her exactly 3 pulls 
to start the mower. Show your work. 


Find the probability that it takes her more than 10 
pulls to start the mower. Show your work. 


99! 
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Using Benford’s law According to Benford’s law 
(Exercise 5, page 359), the probability that the first 
digit of the amount of a randomly chosen invoice is 
an 8 ora 9 is 0.097. Suppose you examine randomly 
selected invoices from a vendor until you find one 
whose amount begins with an 8 ora 9. 


How many invoices do you expect to examine until 
you get one that begins with an 8 or 9? Justify your 
answer. 


In fact, you don’t get an amount starting with an 

8 or 9 until the 40th invoice. Do you suspect that 
the invoice amounts are not genuine? Compute an 
appropriate probability to support your answer. 


Roulette Marti decides to keep placing a $1 bet on 
number 15 in consecutive spins of a roulette wheel 
until she wins. On any spin, there’s a 1-in-38 chance 
that the ball will land in the 15 slot. 


How many spins do you expect it to take until Marti 
wins? Justify your answer. 


Would you be surprised if Marti won in 3 or fewer 
spins? Compute an appropriate probability to sup- 
port your answer. 


Multiple choice: Select the best answer for Exercises 
101 to 105. 


101. 


Joe reads that | out of 4 eggs contains salmonella bac- 
teria. So he never uses more than 3 eggs in cooking. 
If eggs do or don’t contain salmonella independently 
of each other, the number of contaminated eggs 
when Joe uses 3 chosen at random has the following 
distribution: 


binomial; n = 4 and p = 14 
binomial; n = 3 and p = 1A 
binomial; n = 3 and p = 1/3 
geometric; p = 1/4 
geometric; p = 1/3 


Exercises 102 and 103 refer to the following setting. A fast- 
food restaurant runs a promotion in which certain food 
items come with game pieces. According to the restaurant, 
1 in 4 game pieces is a winner. 


102. 


(a) 
(b) 
(c) 


If Jeff gets + game pieces, what is the probability that 
he wins exactly | prize? 


0.25 (d) (F025 710.75) 
1.00 (e) (0.75)3(0.25)! 
(70251075) 
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103. If Jeff keeps playing until he wins a prize, what is the 
probability that he has to play the game exactly 5 times? 


(a) (0.25) (d) (0.75)4(0.25) 
(b) (0.75)4 (e) (7)0.75)%0.25) 
(c) (0.75) 


104. Each entry in a table of random digits like Table D 
has probability 0.1 of being a 0, and the digits are 
independent of one another. If many lines of 40 
random digits are selected, the mean and standard 
deviation of the number of 0s will be approximately 


a) mean = 0.1, standard deviation = 0.05. 
b) mean = 0.1, standard deviation = 0.1. 


( 
( 
(c) mean = 4, standard deviation = 0.05. 
(d) mean = 4, standard deviation = 1.90. 
( 


e) mean = 4, standard deviation = 3.60. 


105. *In which of the following situations would it be ap- 
propriate to use a Normal distribution to approximate 
probabilities for a binomial distribution with the 
given values of n and p? 


(a) n=10,p=0.5 

(b) n= 40, p = 0.88 

(c) n=100,p=0.2 
(d) n= 100,p = 0.99 
(ec) n= 1000, p = 0.003 


*This topic is not required for the AP® Statistics exam. Some teachers prefer to 
discuss this topic when presenting the sampling distribution of 6 (Chapter 7). 
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106. Spoofing (4.2) To collect information such as pass- 
_ words, online criminals use “spoofing” to direct In- 

ternet users to fraudulent Web sites. In one study of 
Internet fraud, students were warned about spoofing 
and then asked to log in to their university account 
starting from the university’s home page. In some 
cases, the login link led to the genuine dialog box. 
In others, the box looked genuine but in fact was 
linked to a different site that recorded the ID and 
password the student entered. The box that appeared 
for each student was determined at random. An alert 
student could detect the fraud by looking at the true 
Internet address displayed in the browser status bar, 
but most just entered their ID and password. Is this 
study an experiment? Why? What are the explana- 
tory and response variables? 


107. Smoking and social class (5.3) As the dangers of 


=> smoking have become more widely known, clear 
© class differences in smoking have emerged. British 


government statistics classify adult men by oc- 
cupation as “managerial and professional” (43% of 
the population), “intermediate” (34%), or “routine 
and manual” (23%). A survey finds that 20% of 
men in managerial and professional occupations 
smoke, 29% of the intermediate group smoke, and 
38% in routine and manual occupations smoke. !* 


(a) Use a tree diagram to find the percent of all adult 
British men who smoke. 


(b) Find the percent of male smokers who have routine 
and manual occupations. 


Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


Buckley Farms produces homemade potato chips that it 
sells in bags labeled “16 ounces.” The total weight of each bag 
follows an approximately Normal distribution with a mean of 
16.15 ounces and a standard deviation of 0.12 ounces. 
(a) Ifyou randomly selected | bag of these chips, what 
is the probability that the total weight is less than 
16 ounces? 

(b) If you randomly selected 10 bags of these chips, 
what is the probability that exactly 2 of the bags will 
have a total weight less than 16 ounces? 


(c) Buckley Farms ships its chips in boxes that contain 
6 bags. ‘The empty boxes have a mean weight of 10 
ounces and a standard deviation of 0.05 ounces. 
Calculate the mean and standard deviation of the 
total weight of a box containing 6 bags of chips. 
Buckley Farms decides to increase the mean weight 
of each bag of chips so that only 5% of the bags have 
weights that are less than 16 ounces. Assuming that 
the standard deviation remains 0.12 ounces, what 
mean weight should Buckley Farms use? 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you think 
each solution is “complete,” “substantial,” “developing,” or “mini- 
mal.” If the solution is not complete, what improvements would you 
suggest to the student who wrote it? Finally, your teacher will provide 
you with a scoring rubric. Score your response and note what, if any- 
thing, you would do differently to improve your own score. 


Chapter Review 


Section 6.1: Discrete and Continuous Random Variables 


A random variable assigns numerical values to the outcomes 
of a chance process. ‘The probability distribution of a ran- 
dom variable describes its possible values and their probabil- 
ities. There are two types of random variables: discrete and 
continuous. Discrete random variables take on a fixed set of 
values with gaps in between. Continuous random variables 
take on all values in an interval of numbers. 

As in Chapter 1, we are often interested in the shape, 
center, and spread of a probability distribution. The shape 
of a discrete probability distribution can be identified by 
graphing a probability histogram, with the height of each 
bar representing the probability of a single value. The cen- 
ter is usually identified by the mean (expected value) of the 
random variable. ‘The expected value is the average value of 
the random variable if the chance process is repeated many 
times. The spread of a probability distribution is usually 
identified by the standard deviation, which describes how 
much the values of a random variable typically differ from 
the mean value, in many repetitions of the chance process. 

Continuous probability distributions, such as the Nor- 
mal distribution, describe the distribution of continuous 
random variables. A density curve is used to display a con- 
tinuous probability distribution. Probabilities for continu- 
ous random variables are determined by finding the area 
under the density curve and above the values of interest. 


Section 6.2: Transforming and Combining Random Variables 


In this section, you learned how linear transformations of a 
random variable affect the shape, center, and spread of its 
probability distribution. As you learned in Chapter 2, a lin- 
ear transformation does not change the shape (unless you 
multiply by a negative number) but can change the center 
and spread depending on the type of transformation. Multi- 
plying (or dividing) each value of the random variable by a 
positive constant b multiplies (divides) the mean and stan- 
dard deviation by b. Adding a constant a to (subtracting a 
from) each value of the random variable adds a to (subtracts 
a from) the mean but doesn’t change the standard deviation. 

You also learned how to calculate the mean and stan- 
dard deviation for a combination of two or more random 
variables. If you are adding two random variables, X and Y, 
the mean and standard deviation of X + Y are 


Lix+y = pix + py and oxsy = Vox + oF 


Likewise, if you are subtracting two random variables, X and Y, 
the mean and standard deviation of X — Y are 


bx-y = fix — fly and ox-y = Vox + oF 


The formulas for the standard deviation of X + Y and 
X-—Y are only correct if X and Y are independent, that is, if 
knowing the value of one variable doesn’t provide any ad- 
ditional information about the other variable. Also, if X and 


Y are both Normally distributed, then X + Y and X- Y are 
both Normally distributed as well. 

To determine which formulas to use for a particular prob- 
lem, it is important to be able to distinguish linear trans- 
formations and combinations of random variables. Linear 
transformations take the values of one random variable and 
add, subtract, multiply, or divide them by a constant. Com- 
binations of random variables take two or more random vari- 
ables and add or subtract them. When a problem involves 
both linear transformations and a combination of random 
variables, remember to do the linear transformations first. 


Section 6.3: Binomial and Geometric Random Variables 


In this section, you learned about two common types of 
discrete random variables, binomial random variables and 
geometric random variables. Binomial random variables 
count the number of successes in a fixed number of trials (7), 
whereas geometric random variables count the number of 
trials needed to get one success. Otherwise, the binomial 
and geometric settings have the same conditions: there must 
be two possible outcomes for each trial (success or failure), 
the trials must be independent, and the probability of suc- 
cess p must stay the same throughout all trials. 

To calculate probabilities for a binomial distribution 
with 7 trials and probability of success p on each trial, use 
technology or the binomial probability formula 


P(X =k) = @it = 


The mean and standard deviation of a binomial random 
variable X are 
bx =np and ox = Vnp(1 — p) 

The shape of a binomial distribution depends on both 
the number of trials n and the probability of success p. 
When the number of trials is large enough that both np 
and n(1 — fp) are at least 10, the distribution of the binomial 
random variable X has an approximately Normal distribu- 
tion. Be sure to check the large counts condition before 
using a Normal approximation to a binomial distribution. 

A common application of the binomial distribution is 
when we count the number of times a particular outcome 
occurs in a random sample from some population. Because 
sampling is almost always done without replacement, the 
independence condition is violated. However, if the sam- 
ple size is a small fraction of the population size (less than 
10%), the lack of independence isn’t a concern. Be sure to 
check the 10% condition when sampling is done without 
replacement before using a binomial distribution. 

Finally, to calculate probabilities for a geometric distri- 
bution with probability of success p on each trial, use tech- 
nology or the geometric probability formula 


PY =k) =(1—p)t'p 
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Learning Objective Section Related Example Relevant Chapter 
on Page(s) Review Exercise(s) 


Compute probabilities using the probability distribution of a discrete 
random variable. : 349 R6.1 


Calculate and interpret the mean (expected value) of a discrete 
random variable. ; 350, 352 R6.1, R6.3 


Calculate and interpret the standard deviation of a discrete random 
variable. ; oes} R6.1, R6.3 


Compute probabilities using the probability distribution 
of certain continuous random variables. : Ge), Clo R6.4 


Describe the effects of transforming a random variable by adding or 
subtracting a constant and multiplying or dividing by a constant. ‘ 365, 366, 368 R6.2, R6.3 


Find the mean and standard deviation of the sum or difference 
of independent random variables. : SWZ, BUS, SAL, SIT R6.3, R6.4 


Find probabilities involving the sum or difference of independent 
Normal random variables. : 380, 381 


Determine whether the conditions for using a binomial random 
variable are met. : 388 


Compute and interpret probabilities involving binomial distributions. : 390, 393, 396 


Calculate the mean and standard deviation of a binomial random 
variable. Interpret these values in context. : 399 


Find probabilities involving geometric random variables. : 406 


*When appropriate, use the Normal approximation to the binomial 
distribution to calculate probabilities. i 403 


*This topic is not required for the AP® Statistics exam. 


Chapter 6 Chapter Review Exercises 
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These exercises are designed to help you review the impor- (a) Find P(X = 5). 
tant ideas and methods of the chapter. (b) Is pain score a discrete or continuous random vari- 
R6.1 Knees Patients receiving artificial knees often able? Explain. 
experience pain after surgery. The pain is measured (c) Find P(X S 2). Is this the same as P(X < 2)? Explain. 
on a subjective scale with possible values of 1 (low) (d) Compute the expected pain score and the standard 
to 5 (high). Let X be the pain score for a randomly deviation of the pain scores. 
selected patient. The following table gives part of the R6.2 A glass act Ina process for manufacturing glassware, 
probability distribution for X. glass stems are sealed by heating them in a flame. 


Let X be the temperature (in degrees Celsius) for 
a randomly chosen glass. The mean and standard 
deviation of X are pry = 550°C and ox = 5.7°C. 


Value: 1 2 3 4 5 
Probability: 0.1 0.2 0.3 0.3 2? 


(a) Is temperature a discrete or continuous random 
variable? Explain. 

(b) How is P(X < 540) related to P(X = 540)? Explain. 

(c) The target temperature is 550°C. What are the 
mean and standard deviation of the number of 
degrees off target, D = X — 550? 

(d) A manager asks for results in degrees Fahrenheit. The 
conversion of X into degrees Fahrenheit is given by 


9 
y= 5 X + 32. What are the mean py and the standard 


deviation oy of the temperature of the flame in the 
Fahrenheit scale? 


R6.3 Keno Ina game of 4-Spot Keno, the player picks 4 
numbers from | to 80. The casino randomly selects 
20 winning numbers from | to 80. The table below 
shows the possible outcomes of the game and their 
probabilities, along with the amount of money 
(Payout) that the player wins for a $1 bet. If X = the 
payout for a single $1 bet, you can check that py = 
$0.70 and ox = $6.58. 


Matches: 0 1 2 3 4 
Payout x;: $0 $0 $1 $3 $120 
Probability p;: 0.308 0.433 0.213 0.043 0.003 


(a) Interpret the values of jz and oy in context. 

(b) Jerry places a single $5 bet on 4-Spot Keno. Find 
the expected value and the standard deviation of his 
winnings. 

(c) Marla plays five games of 4-Spot Keno, betting $1 
each time. Find the expected value and the standard 
deviation of her total winnings. 

(d) Based on your answers to (b) and (c), which player 
would the casino prefer? Justify your answer. 


R6.4 Applying torque A machine fastens plastic 
screw-on caps onto containers of motor oil. If the 
machine applies more torque than the cap can 
withstand, the cap will break. Both the torque 
applied and the strength of the caps vary. The 
capping-machine torque T follows a Normal dis- 
tribution with mean 7 inch-pounds and standard 
deviation 0.9 inch-pounds. The cap strength C 
(the torque that would break the cap) follows a 
Normal distribution with mean 10 inch-pounds 
and standard deviation 1.2 inch-pounds. 


(a) What is the probability that a randomly selected cap 
has a strength greater than 11] inch-pounds? 

(b) Explain why it is reasonable to assume that the cap 
strength and the torque applied by the machine are 
independent. 


*This topic is not required for the AP® Statistics exam. Some teachers prefer to 
discuss this topic when presenting the sampling distribution of 6 (Chapter 7). 
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(c) Let the random variable D = C — T. Find its mean 
and standard deviation. 

(d) What is the probability that a randomly selected cap 
will break while being fastened by the machine? 
Show your work. 


Exercises R6.5 and R6.6 refer to the following setting. 
According to Mars, Incorporated, 20% of its plan M&M’S 
candies are orange. Assume that the company’s claim 

is true. Suppose that you reach into a large bag of plain 
M&MV’S (without looking) and pull out 8 candies. Let 

X = the number of orange candies you get. 


R6.5 Orange M&M’S 


(a) Explain why it is reasonable to use the binomial 
distribution for probability calculations involving X. 

(b) Find and interpret the expected value of X. 

(c) Find and interpret the standard deviation of X. 


R6.6 Orange M&M’S 


(a) Would you be surprised if none of the candies were 
orange? Compute an appropriate probability to sup- 
port your answer. 

(b) How surprising would it be to get 5 or more orange 
candies? Compute an appropriate probability to 
support your answer. 


R6.7 Sushi Roulette In the Japanese game show Sushi 
Roulette, the contestant spins a large wheel that’s di- 
vided into 12 equal sections. Nine of the sections have 
a sushi roll, and three have a “wasabi bomb.” When 
the wheel stops, the contestant must eat whatever food 
is on that section. ‘To win the game, the contestant 
must eat one wasabi bomb. Find the probability that it 
takes 3 or fewer spins for the contestant to get a wasabi 
bomb. Show your method clearly. 


R6.8* Is this coin balanced? While he was a prisoner 
of war during World War IL, John Kerrich tossed a 
coin 10,000 times. He got 5067 heads. If the coin is 
perfectly balanced, the probability of a head is 0.5. 


(a) Find the mean and the standard deviation of the 
number of heads in 10,000 tosses, assuming the 
coin is perfectly balanced. 

(b) Explain why the Normal approximation is appro- 
priate for calculating probabilities involving the 
number of heads in 10,000 tosses. 

(c) Is there reason to think that Kerrich’s coin was not 
balanced? 'To answer this question, use a Normal 
distribution to estimate the probability that tossing 
a balanced coin 10,000 times would give a count 
of heads at least this far from 5000 (that is, at least 
5067 heads or at most 4933 heads). 
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Chapter 6 AP® Statistics Practice Test 


Section |: Multiple Choice Select the best answer for each question. 


Questions T6.1 to T6.3 refer to the following setting. A 
psychologist studied the number of puzzles that subjects 
were able to solve in a five-minute period while listening to 
soothing music. Let X be the number of puzzles completed 
successfully by a randomly chosen subject. The psychologist 
found that X had the following probability distribution: 


Value: 1 2 3 4 
Probability: 0.2 0.4 0.3 0.1 


T6.1 What is the probability that a randomly chosen 
subject completes more than the expected number of 
puzzles in the five-minute period while listening to 
soothing music? 


(a 


0.1 
(b) 0.4 
0.8 
l 


(eu, 


( 
( 
(e) Cannot be determined 


T6.2 The standard deviation of X is 0.9. Which of the fol- 
lowing is the best interpretation of this value? 
(a) About 90% of subjects solved 3 or fewer puzzles. 


(b) About 68% of subjects solved between 0.9 puzzles 
less and 0.9 puzzles more than the mean. 


(c) The typical subject solved an average of 0.9 puzzles. 


(d) The number of puzzles solved by subjects typically 
differed from the mean by about 0.9 puzzles. 


(e) The number of puzzles solved by subjects typically 
differed from one another by about 0.9 puzzles. 


T6.3 Let D be the difference in the number of puzzles 
solved by two randomly selected subjects in a five- 
minute period. What is the standard deviation of D? 


(a)0 (b)081 (09 (127 (18 


T6.4 Suppose a student is randomly selected from your 
school. Which of the following pairs of random vari- 
ables are most likely independent? 

(a) X = student’s height; Y = student’s weight 

(b) X = student’s IO; Y = student’s GPA 

(c) X = student’s PSAT’ Math score; Y = student’s PSAT’ 
Verbal score 


(d) X = average amount of homework the student does 
8g 
per night; Y = student’s GPA 


(e) X = average amount of homework the student does 
g 
per night; Y = student’s height 


T6.5 Acertain vending machine offers 20-ounce bottles of 
soda for $1.50. The number of bottles X bought from 
the machine on any day is a random variable with 
mean 50 and standard deviation 15. Let the random 
variable Y equal the total revenue from this machine on 
a given day. Assume that the machine works properly 
and that no sodas are stolen from the machine. What 
are the mean and standard deviation of Y? 


(a) py = $1.50, oy = $22.50 
(Oe see 
) 


(c) py = $75, oy = $18.37 
(d) [ike = $75, Oye = $22.50 
(e) py = $75, oy = $33.75 


T6.6 The weight of tomatoes chosen at random from a 
bin at the farmer’s market follows a Normal distribu- 
tion with mean yz = 10 ounces and standard devia- 
tion og = | ounce. Suppose we pick four tomatoes at 
random from the bin and find their total weight T. 
The random variable T is 


(a) Normal, with mean 10 ounces and standard devia- 
tion | ounce. 

(b) Normal, with mean 40 ounces and standard devia- 
tion 2 ounces. 

(c) Normal, with mean 40 ounces and standard devia- 
tion 4 ounces. 

(d) binomial, with mean 40 ounces and standard devia- 
tion 2 ounces. 

(e) binomial, with mean 40 ounces and standard devia- 
tion 4 ounces. 


T6.7 Which of the following random variables is geometric? 


(a) The number of times I have to roll a die to get two 6s. 

(b) ‘The number of cards I deal from a well-shuffled deck 
of 52 cards until I get a heart. 

(c) The number of digits I read in a randomly selected 
row of the random digits table until I find a 7. 

(d) The number of 7s in a row of 40 random digits. 

(e) The number of 6s I get if ] roll a die 10 times. 


T6.8 Seventeen people have been exposed to a particular 
disease. Each one independently has a 40% chance 
of contracting the disease. A hospital has the capacity 
to handle 10 cases of the disease. What is the prob- 
ability that the hospital’s capacity will be exceeded? 


(a) 0.011 (d) 0.965 
(b) 0.035 (e) 0.989 
(c) 0.092 


T6.9 The figure shows the probability distribution of a 
discrete random variable X. Note that the cursor is on 
the histogram bar representing a value of 6. Which of 
the following best describes this random variable? 


NORMAL FLOAT AUTO REAL RADIAN CL f 


Ploti:LasL2 


min=6 
max<? n=.01000188 


(a) Binomial with n = 8, p = 0.1 
(b) Binomial with n = 8, p = 0.3 
(c) Binomial with n = 8, p = 0.8 
(d) Geometric with p = 0.1 
(e) Geometric with p = 0.2 


AP® Statistics Practice Test Cin 9 


T6.10 A test for extrasensory perception (ESP) involves 

asking a person to tell which of 5 shapes—a circle, 
star, triangle, diamond, or heart— appears on a 
hidden computer screen. On each trial, the com- 
puter is equally likely to select any of the 5 shapes. 
Suppose researchers are testing a person who does 
not have ESP and so is just guessing on each trial. 
What is the probability that the person guesses the 
first + shapes incorrectly but gets the fifth correct? 


(a) 1/5 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T6.11 Let Y denote the number of broken eggs in a randomly 
selected carton of one dozen “store brand” eggs at a local 
supermarket. Suppose that the probability distribution of 
Y is as follows. 


Value y;: 0 1 7 3 4 
Probability p;: 0.78 0.11 0.07 0.03 0.01 


(a) What is the probability that at least 10 eggs in a 
randomly selected carton are unbroken? 

(b) Calculate and interpret jy. 

(c) Calculate and interpret oy. Show your work. 

(d) A quality control inspector at the store keeps 
looking at randomly selected cartons of eggs until 
he finds one with at least 2 broken eggs. Find the 
probability that this happens in one of the first three 
cartons he inspects. 


T6.12 Ladies Home Journal magazine reported that 66% 
of all dog owners greet their dog before greeting 
their spouse or children when they return home 
at the end of the workday. Assume that this claim 
is true. Suppose 12 dog owners are selected at 
random. Let X = the number of owners who greet 
their dogs first. 


—a 
~ 
ma 


Explain why it is reasonable to use the binomial 
distribution for probability calculations involving X. 


& 


Only 4 of the owners in the random sample greeted 
their dogs first. Does this give convincing evidence 
against the Ladies Home Journal claim? Calculate 
an appropriate probability to support your answer. 


T6.13 Ed and Adelaide attend the same high school, 
but are in different math classes. ‘The time E that 
it takes Ed to do his math homework follows a 
Normal distribution with mean 25 minutes and 
standard deviation 5 minutes. Adelaide’s math 
homework time A follows a Normal distribution 
with mean 50 minutes and standard deviation 
10 minutes. Assume that EF; and A are independent 
random variables. 


—a 
o 
a 


Randomly select one math assignment of Ed’s and 
one math assignment of Adelaide’s. Let the ran- 
dom variable D be the difference in the amount 
of time each student spent on their assignments: 
D =A — E. Find the mean and the standard 
deviation of D. Show your work. 

Find the probability that Ed spent longer on his 
assignment than Adelaide did on hers. Show your 
work. 

T6.14 According to the Census Bureau, 13% of American 
adults (aged 18 and over) are Hispanic. An opinion 
poll plans to contact an SRS of 1200 adults. 


(b 


ed 


(a) What is the mean number of Hispanics in such 
samples? What is the standard deviation? 


(b) Should we be suspicious if the sample selected for 
the opinion poll contains 15% Hispanic people? 
Compute an appropriate probability to support your 
answer. 
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Building Better Batteries 


Everyone wants to have the latest technological gadget. That’s why iPods, digital cameras, smartphones, 
Game Boys, and the Wii have sold millions of units. These devices require lots of power and can drain 
batteries quickly. Battery manufacturers are constantly searching for ways to build longer-lasting batteries. 

A particular manufacturer produces AA batteries that are designed to last an average of 17 hours with a 
standard deviation of 0.8 hours. Quality control inspectors select a random sample of 50 batteries during 
each hour of production, and they then drain them under conditions that mimic normal use. Here are 
the lifetimes (in hours) of the batteries from one such sample: 


lo.73: 15.60 N63) A/S? 164 728 - toe? Mi28- 127 1750 146 > 1650- Tealo 
15.59 A754 loAo, 15563 16.82 1716 1662 167), 16.69 -17.98 16:36 17,80" 16:61 
199 dao 720" Wye loos Ness 1748 1538 deol 15.98. tose: 169s 16.01 
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ACTIVITY 


Introduction 


The battery manufacturer in the Case Study could find the true mean lifetime 
of all the batteries produced in an hour. Quality control inspectors would simply 
measure the lifetime of each battery (by draining it) and then calculate the aver- 
age. With this method, the company would know the truth about the population 
mean jt, but it would have no batteries left to sell! 

Instead of taking a census, the manufacturer collects data from a random sam- 
ple of 50 batteries produced that hour. The company’s goal is to use the sample 
mean lifetime x to estimate the unknown population mean p. This is an example 
of statistical inference: we use information from a sample to draw conclusions 
about a larger population. 

To make such an inference, we need to know how close the sample mean xX 
is likely to be to the population mean ju. After all, different random samples of 
50 batteries from the same hour of production would yield different values of x. 
How can we describe this sampling distribution of possible x-values? We can think 
of x as a random variable because it takes numerical values that describe the out- 
comes of the random sampling process. As a result, we can examine its probability 
distribution using what we learned in Chapter 6. 

This same reasoning applies to other types of inference settings. Here are a few 
examples. 


e Each month, the Current Population Survey (CPS) interviews a random sample 
of individuals in about 60,000 U.S. households. The CPS uses the proportion of 
unemployed people in the sample / to estimate the national unemployment rate p. 


e¢ Tom is cooking a large turkey breast for a holiday meal. He wants to be sure 
that the turkey is safe to eat, which requires a minimum internal temperature 
of 165°F. Tom uses a thermometer to measure the temperature of the turkey 
meat at four randomly chosen points. If the minimum reading in the sample 
is 170°F, can Tom safely serve the turkey? 


¢ How much do gasoline prices vary in a large city? To find out, a reporter re- 
cords the price per gallon of regular unleaded gasoline at a random sample of 
10 gas stations in the city on the same day. The range (maximum — minimum) 
of the prices in the sample is 25 cents. What can the reporter say about the 
range of gas prices at all the city’s stations? 


The following Activity gives you a chance to estimate an unknown population 
value based on data from a random sample. 


The German Tank Problem 


MATERIALS: 


Tags or pieces of cardstock 
numbered 1 to N; small brown 
paper bag; index card and 
prelabeled graph grid for each 
team; prizes for the winners 


During World War II, the Allies captured several German tanks. Each tank had 
a serial number on it. Allied commanders wanted to know how many tanks the 
Germans had so that they could allocate their forces appropriately. They sent the 
serial numbers of the captured tanks to a group of mathematicians in Washington, 
D.C., and asked for an estimate of the total number of German tanks N. In this 
Activity, you and your teammates will play the role of the mathematicians. 


More recently, people used the 
mathematicians’ method from the 
German tank problem to estimate 
the number of iPhones produced. 
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1. Your teacher will create tags numbered 1, 2, 3, 
..., N to represent the German tanks and place 
them in a bag. The class will be divided into teams 
of three or four students. 


2. The teacher will mix the tags well and ask four 
students to draw one tag each from the bag. Each 
selected tag represents the serial number of a 
captured German tank. All four numbers should be 
written on the board for everyone to see. Return the 
tags to the bag. 


3. Each team will have 15 minutes to come up 
with a statistical formula for estimating the total 
number of tanks N in the bag. You should have 
time to try several ideas. When you are satisfied with 
your method, calculate your estimate of N. Write your team members’ names, 
your formula, and your estimate on the index card provided. 


4. When time is called, each team must give its index card to your teacher. 
The teacher will make a chart on the board showing the formulas and esti- 
mates. Each team will have one minute to explain why it chose the formula 
it did. 

5. The teacher will reveal the actual number of tanks. Which team came closest 
to the correct answer? 


6. What if the Allies had captured four other German tanks? Which team’s 
formula would consistently produce the best estimate? Students should 
help choose nine more simple random samples of four tanks from the bag. 
After each sample is taken, the four serial numbers chosen should be writ- 
ten on the board, and the tags should be returned to the bag and mixed 
thoroughly. 

7. Each team should use its formula to estimate the total number of tanks N for 
each of the nine new samples. The team should then make a dotplot of its 10 
estimates. 


8. Compare the teams’ dotplots. As a class, decide which team used the best 
method for estimating the number of tanks. 


Sampling distributions are the key to inference when data are produced by 
random sampling. Because the results of random samples include an element 
of chance, we can’t guarantee that our inferences are correct. What we can 
guarantee is that our methods usually give correct answers. The reasoning of sta- 
tistical inference rests on asking, “How often would this method give a correct 
answer if I used it very many times?” If our data come from random sampling, 
the laws of probability answer the question “What would happen if we did this 
many times?” 

Section 7.1 presents the basic ideas of sampling distributions. The most 
common applications of statistical inference involve proportions and means. 
Section 7.2 focuses on sampling distributions of sample proportions. Section 7.3 
investigates sampling distributions of sample means. 
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WHAT YOU WILL LEARN 


What Is a Sampling 
Distribution? 


By the end of the section, you should be able to: 


Distinguish between a parameter and a statistic. e Determine whether or not a statistic is an unbiased 
Use the sampling distribution of a statistic to evaluate a estimator of a population parameter. 


claim about a parameter. 
Distinguish among the distribution of a population, the variability of a statistic. 
distribution of a sample, and the sampling distribution of 

a Statistic. 


It is common practice to use Greek 
letters for parameters and Roman letters 
for statistics. In that case, the population 
proportion would be zr (pi, the Greek 
letter for “p”) and the sample proportion 
would be p. We'll stick with the notation 
that’s used on the AP® exam, however. 


Describe the relationship between sample size and the 


What is the average income of American households? Each March, the govern- 
ment’s Current Population Survey (CPS) asks detailed questions about income. 
The random sample of about 60,000 households contacted in March 2012 had 
a mean “total money income” of $69,677 in 2011.! (The median income was 
lower, of course, at $50,054.) That $69,677 describes the sample, but we use it to 
estimate the mean income of all households. 


Parameters and Statistics 


As we begin to use sample data to draw conclusions about a wider population, we 
must be clear about whether a number describes a sample or a population. For the 
sample of households contacted by the CPS, the mean income was x = $69,677. 
The number $69,677 is a statistic because it describes this one CPS sample. The 
population that the poll wants to draw conclusions about is all 121 million U.S. 
households. In this case, the parameter of interest is the mean income yp of all 
these households. We don’t know the value of this parameter. 


(MU 


DEFINITION: Parameter, statistic 
A parameter is a number that describes some characteristic of the population. 
A statistic is a number that describes some characteristic of a sample. 


The value of a parameter is usually not known because we cannot examine 
the entire population. The value of a statistic can be computed directly from the 
sample data. We often use a statistic to estimate an unknown parameter. 

Remember s and p: statistics come from samples, and parameters come from 
populations. As long as we were doing data analysis, the distinction between popu- 
lation and sample rarely came up. Now, however, it is essential. The notation we 
use should reflect this distinction. For instance, we write ju (the Greek letter mu) 
for the population mean and x for the sample mean. We use fp to represent a 
population proportion. The sample proportion f is used to estimate the unknown 
parameter p. 
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From Ghosts to Cold Cabins 


Parameters and statistics 


PROBLEM: Identify the population, the parameter, the sample, and the statistic in each of the 
following settings. 


(a) The Gallup Poll asked a random sample of 515 U.S. adults whether or not they believe in ghosts. 
Of the respondents, 160 said “Yes.”* 


(b) During the winter months, the temperatures outside the Starneses’ cabin in Colorado can 
stay well below freezing (32°F, or 0°C) for weeks at a time. To prevent the pipes from freezing, 
Mrs. Starnes sets the thermostat at 50°F. She wants to know how low the temperature actually 
gets in the cabin. A digital thermometer records the indoor temperature at 20 randomly chosen 
times during a given day. The minimum reading is 38°F. 


SOLUTION: 


(a) The population is all U.S. adults, and the parameter of interest is p, the proportion of all U.S. 
adults who believe in ghosts. The sample is the 515 people who were interviewed in this Gallup Poll. 


160 
The statistic is p = 515 0.31, the proportion of the sample who say they believe in ghosts. 


(b) The population is all times during the day in question; the parameter of interest is the true 
minimum temperature in the cabin that day. The sample consists of the 20 temperature readings at 
randomly selected times. The statistic is the sample minimum, 38°F. 


For Practice Try Exercise 


CHECK YOUR UNDERSTANDING 

Each boldface number in Questions | and 2 is the value of either a parameter or a statistic. 
In each case, state which it is and use appropriate notation to describe the number. 

1. On Tuesday, the bottles of Arizona Iced Tea filled in a plant were supposed to contain 
an average of 20 ounces of iced tea. Quality control inspectors sampled 50 bottles at random 
from the day’s production. These bottles contained an average of 19.6 ounces of iced tea. 
2. Ona New York-to—Denver flight, 8% of the 125 passengers were selected for 
random security screening before boarding. According to the ‘Transportation Security 
Administration, 10% of passengers at this airport are chosen for random screening. 


Sampling Variability 


How can x, based ona sample of only a few thousand of the 121 million American 
households, be an accurate estimate of ju? After all, a second random sample 
taken at the same time would choose different households and likely produce a 
different value of x. This basic fact is called sampling variability: the value of a 
statistic varies in repeated random sampling. 
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ACTIVITY | 


MATERIALS: 

200 colored chips, including 
100 of the same color; large 
bag or other container 


To make sense of sampling variability, we ask, “What would happen if we took 
many samples?” Here’s how to answer that question: 
e ‘Take a large number of samples from the same population. 
¢ Calculate the statistic (like the sample mean x or sample proportion f) for 
each sample. 
e¢ Make a graph of the values of the statistic. 
e Examine the distribution displayed in the graph for shape, center, and spread, 
as well as outliers or other unusual features. 


The following Activity gives you a chance to see sampling variability in action. 


Reaching for Chips 


Before class, your teacher will prepare a population of 200 colored chips, with 100 
having the same color (say, red). The parameter is the actual proportion p of red 
chips in the population: p = 0.50. In this Activity, you will investigate sampling 
variability by taking repeated random samples of size 20 from the population. 

1. After your teacher has mixed the chips thoroughly, each student in the class 
should take a sample of 20 chips and note the sample proportion f of red chips. 
When finished, the student should return all the chips to the bag, stir them up, 
and pass the bag to the next student. 

Note: If your class has fewer than 25 students, have some students take two 
samples. 

2. Each student should record the p-value in a chart on the board and plot this 
value on a class dotplot. Label the graph scale from 0.10 to 0.90 with tick marks 
spaced 0.05 units apart. 

3. Describe what you see: shape, center, spread, and any outliers or other un- 
usual features. 


When Mr. Caldwell’s class did the “Reaching for Chips” Activity, his 35 stu- 
dents produced the graph shown in Figure 7.1. Here’s what the class said about its 
distribution of p-values. 


Shape: The graph is roughly symmetric with a single peak 
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FIGURE 7.1 Dotplot of sample proportions obtained by the 


35 students in Mr. Caldwell’s class. 


: at 0.5. 
T 


0.7 08 0.9 Center: The mean of our sample proportions is 0.499. This 
is the balance point of the distribution. 


Spread: The standard deviation of our sample proportions is 
0.112. The values of f are typically about 0.112 away from 


the mean. 
Outliers: There are no obvious outliers or other unusual features. 


Of course, the class only took 35 different simple random samples of 20 chips. 
There are many, many possible SRSs of size 20 from a population of size 200 (about 
1.6- 10°’, actually). If we took every one of those possible samples, calculated the value 
of f for each, and graphed all those p-values, then we'd have a sampling distribution. 
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EE 


DEFINITION: Sampling distribution 


The sampling distribution of a statistic is the distribution of values taken by the 
statistic in all possible samples of the same size from the same population. 


It’s usually too difficult to take all possible samples of size n to obtain the sam- 
pling distribution ofa statistic. Instead, we can use simulation to imitate the process 
of taking many, many samples and create an approximate sampling distribution. 


Reaching for Chips 


Simulating a sampling distribution 


We used Fathom software to simulate choosing 500 SRSs of size 
n = 20 from a population of 200 chips, 100 red and 100 blue. Figure 7.2 
is a dotplot of the values of f, the sample proportion of red chips, from 
these 500 samples. 


PROBLEM: 
(a) There is one dot on the graph at 0.15. Explain what this value represents. 
(b) Describe the distribution. Are there any obvious outliers? 


(c) Would it be surprising to get a sample proportion of 0.85 or higher in an SRS of 
size 20 when p = 0.5? Justify your answer. 


FIGURE 7.2 Dotplot of the sample propor- (d) Suppose your teacher prepares a bag with 200 chips and claims that half of them 
tion p of red chips in 500 simulated SRSs, are red. A classmate takes an SRS of 20 chips; 17 of them are red. What would you 


created by Fathom software. 


conclude about your teacher's claim? Explain. 


SOLUTION: 

(a) Inone SRS of 20 chips, there were 3 red chips. So p = 3/20 = 0.15 for this sample. 

(b) Shape: Symmetric, unimodal, and somewhat bell-shaped. Center: Around 0.5. Spread: The values 
of p fall mostly between 0.25 and 0.75. Outliers: One sample with p = 0.15 stands out. 

(c) tis very unlikely to obtain an SRS of 20 chips in which p = 0.85 from a population in which 

p= 0.5. A value of p this large or larger never occurred in 500 simulated samples. 

(4) This student's result gives strong evidence against the teacher's claim. As noted in part (c), itis 
very unlikely to get a sample proportion of 0.85 or higher when p = 0.5. 


For Practice Try Exercise 9 | 


Strictly speaking, the sampling distribution is the ideal pattern that would 
emerge if we looked at all possible samples of size 20 from our population of chips. 
A distribution obtained from simulating a smaller number of random samples, 
like the 500 values of f in Figure 7.2, is only an approximation to the sampling 
distribution. One of the uses of probability theory in statistics is to obtain sampling 
distributions without simulation. We’ll get to the theory later. The interpretation 
of a sampling distribution is the same, however, whether we obtain it by simula- 
tion or by the mathematics of probability. 
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AP® EXAM TIP Terminology 
matters. Don’t say “sample 
distribution” when you mean 


sampling distribution. You will 
lose credit on free response 
questions for misusing 
statistical terms. 


SAMPLING DISTRIBUTIONS 


Figure 7.3 illustrates the process of choosing many random samples of 20 chips 
and finding the sample proportion of red chips f for each one. Follow the 
flow of the figure from the population at the left, to choosing an SRS and finding 
the f for this sample, to collecting together the f’s from many samples. The first 
sample has f = 0.40. The second sample is a different group of chips, with 6 = 0.55, 
and so on. The dotplot at the right of the figure shows the distribution of the val- 
ues of f from 500 separate SRSs of size 20. This dotplot displays the approximate 
sampling distribution of the statistic p. 


Distributions of 
sample data 
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FIGURE 7.3 The idea of a sampling distribution: take many samples from the same population, 
collect the f’s from all the samples, and display the distribution of the ’s. The dotplot shows the 
results of 500 samples. 


As Figure 7.3 shows, there are three distinct distributions involved when we 
sample repeatedly and measure a variable of interest. The population distribution 
gives the values of the variable for all individuals in the population. In this case, 
the individuals are the 200 chips and the variable we’re recording is color. Our 
parameter of interest is the proportion of red chips in the population, p = 0.50. 
Each random sample that we take consists of 20 chips. 

The distribution of sample data shows the values of the variable “color” for the 
individuals in the sample. For each sample, we record a value for the statistic f, the 
sample proportion of red chips. Finally, we collect the values of f from all possible 
samples of the same size and display them in the sampling distribution. 

Be careful: The population distribution and the distribution of sample 
data describe individuals. A sampling distribution describes how a statistic og 
varies in many samples from the population. 


CHECK YOUR UNDERSTANDING 

Mars, Incorporated, says that the mix of colors in its M&M’S® Milk Chocolate Candies 
is 24% blue, 20% orange, 16% green, 14% yellow, 13% red, and 13% brown. Assume that 
the company’s claim is true. We want to examine the proportion of orange M&M’S in 
repeated random samples of 50 candies. 


1. Graph the population distribution. Identify the individuals, the variable, and the 
parameter of interest. 
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2. Imagine taking an SRS of 50 M&M’S. Make a graph showing a possible distribution of 
the sample data. Give the value of the appropriate statistic for this sample. 


3. Which of the graphs that follow could be the approximate sampling distribution of the 
statistic? Explain your choice. 


Describing Sampling Distributions 


The fact that statistics from random samples have definite sampling distributions 
allows us to answer the question “How trustworthy is a statistic as an estimate of a 
parameter?” To get a complete answer, we consider the shape, center, and spread 
of the sampling distribution. For reasons that will be clear later, we'll save shape 
for last. 


Center: Biased and unbiased estimators Let’s return to the familiar 
chips example. How well does the sample proportion of red chips estimate the 
population proportion of red chips, p = 0.5? The dotplot in the margin shows 
the approximate sampling distribution of f once again. We noted earlier that the 
center of this distribution is very close to 0.5, the parameter value. In fact, if we 
took all possible samples of 20 chips from the population, calculated f for each 
sample, and then found the mean of all those p-values, we'd get exactly 0.5. For 
this reason, we say that f is an unbiased estimator of p. 


(I 


DEFINITION: Unbiased estimator 


A statistic used to estimate a parameter is an unbiased estimator if the mean of its 
sampling distribution is equal to the value of the parameter being estimated. 


If we take many samples, the value of an unbiased estimator will sometimes 
exceed the value of the parameter and sometimes be less. However, because the 
sampling distribution of the statistic is centered at the true value, we will not con- 
sistently overestimate or underestimate the parameter. This is consistent with our 
definition of bias from Chapter 4. 

We will confirm in Section 7.2 that the sample proportion f is an unbiased es- 
timator of the population proportion p. This is a very helpful result if we’re dealing 
with a categorical variable (like color). With quantitative variables, we might be 
interested in estimating the population mean, median, minimum, maximum, Q), 
Q3, variance, standard deviation, JOR, or range. Which (if any) of these are unbi- 
ased estimators? The following Activity should shed some light on this question. 
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ACTIVITY | Sampling heights 


MATERIALS: In this Activity, you will use a population of quantitative data to investigate 
Small piece of cardstock for whether a given statistic is an unbiased estimator of its corresponding population 
each student; bag parameter. 
1. Each student should write his or her height (in inches) neatly on a 
small piece of cardstock and then place it in the bag. 
2. After your teacher has mixed the cards thoroughly, each student in the 
class should take a sample of four cards and record the heights of the four 
chosen students. When finished, the student should return the cards to the 
bag, mix them up, and pass the bag to the next student. 
Note: If your class has fewer than 25 students, have some students take 
two samples. 
3. For your SRS of four students from the class, calculate the sample mean X 
and the sample range (maximum — minimum) of the heights. Then go to the 
board and record the heights of the four students in your sample, the sample 
mean x, and the sample range in a chart like the one below. 


Height (in.) Sample mean (x) Sample range (max — min) 
62, 75, 68, 63 67 75 — 62 = 13 


4. Plot the values of your sample mean and sample range on the two class 
dotplots drawn by your teacher. 

5. Once everyone has finished, find the population mean yu and the population 
range. 

6. Based on your approximate sampling distributions of x and the sample range, 
which statistic appears to be an unbiased estimator? Which appears to be a 
biased estimator? 


When Mrs. Washington’s class did the “Sampling Heights” Activity, they pro- 
duced the graphs shown in Figure 7.4. Her students concluded that the sample 
mean X is probably an unbiased estimator of the population mean ju. Their reason: 
the center of the approximate sampling distribution of xX, 65.67 inches, is close 
to the population mean of 66.07 inches. On the other hand, Mrs. Washington’s 


Approximate sampling distribution Approximate sampling distribution 
of x (n = 4) of sample range (n = 4) 
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FIGURE 7.4 Results from Mrs. Washington’s class. The sample mean appears to be an unbiased estimator. 
The sample range appears to be a biased estimator. 
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students decided that the sample range is a biased estimator of the population 
range. Why? Because the center of the sampling distribution for this statistic was 
10.12 inches, much less than the corresponding parameter value of 21 inches. 

To confirm the class’s conclusions, we used Fathom software to simulate taking 
250 SRSs of n = 4 students. For each sample, we plotted the mean height x and 
the range of the heights. Figure 7.5 shows the approximate sampling distributions 
for these two statistics. It looks like the class was right: X is an unbiased estimator 
of 44, but the sample range is clearly a biased estimator. The range of the sample 
heights tends to be much lower, on average, than the population range. 
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FIGURE 7.5 Results from a Fathom simulation of 250 SRSs of size n = 4 from the students in Mrs. 
Washington's class. The sample mean is an unbiased estimator. The sample range is a biased estimator. 


THINK Why do we divide by n — 1 when calculating the sample 
ABOUT IT 


variance? In Chapter 1, we introduced the sample variance s? as a measure of 
spread for a set of quantitative data. The idea of s? is simple: it’s a number that de- 
scribes the “average” squared deviation of the values in the sample from their mean 
x. It probably surprised you when we computed this average by dividing by n — 1 


instead of n. Now we're ready to tell you why we defined s? = Soyo — xy. 
a 


In an inference setting involving a quantitative variable, we 
might be interested in estimating the variance o” of the population 
distribution. The most logical choice for our estimator is the sam- 
ple variance s?. We used Fathom software to take 500 SRSs of size n = 4 
from the population distribution of heights in Mrs. Washington’s 
class. Note that the population variance is 0? = 22.19. For each 
sample, we recorded the value of two statistics: 


; ] _ ; 
0 alee var = 52 = eG — x) (the sample variance) 
a 


Value of variance from sample data 


! Ny 
FIGURE 7.6 Results from a Fathom simulation of — nei — #) 
500 SRSs of size n = 4 from the population distri- : ; ; ee 
bution of heights in Mrs. Washington’s class. The Figure 7.6 shows the approximate sampling distributions of these 


sample variance s? (labeled “var” in the figure) isan two statistics. We used histograms to show the overall pattern 
unbiased estimator. The “varn” statistic (dividing by more clearly. The vertical lines mark the means of these two 
ninstead of n — 1) is a biased estimator. distributions. 
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FIGURE 7.7 The approximate 
sampling distribution of the 
sample proportion 6 from SRSs 
of size n = 100 and n = 1000 
drawn from a population with 
proportion p = 0.37 who 

have watched Survivor. Both 
dotplots show the results of 
400 SRSs. 


We can see that “varn” is a biased estimator of the population variance. The 
mean of its sampling distribution (marked with a blue line segment) is clearly less 
than the value of the population parameter, 22.19. However, the statistic “var” 
(otherwise known as the sample variance sz) is an unbiased estimator. Its values 
are centered at 22.19. That’s why we divide by n — 1 and not n when calculating 


the sample variance: to get an unbiased estimator of the population variance. 
.——_—_ 


Spread: Low variability is better! To get a trustworthy estimate of an 
unknown population parameter, start by using a statistic that’s an unbiased esti- 
mator. This ensures that you won't consistently overestimate or underestimate the 
parameter. Unfortunately, using an unbiased estimator doesn’t guarantee that the 
value of your statistic will be close to the actual parameter value. The following 
example illustrates what we mean. 


Who Watches Survivor? 
Why sample size matters 


Television executives and companies who advertise on T'V are interested in how 
many viewers watch particular shows. According to Nielsen ratings, Survivor 
was one of the most-watched television shows in the United States during 
every week that it aired. Suppose that the true proportion of U.S. adults who 
have watched Survivor is p = 0.37. 


The top dotplot in Figure 7.7 shows the results of drawing 400 SRSs of size 
n = 100 from a population with p = 0.37. We see that a sample of 100 people 
often gave a p quite far from the population parameter. That is why a Gallup 
Poll asked not 100, but 1000 people whether they had watched Survivor. Let’s 
repeat our simulation, this time taking 400 SRSs of size n = 1000 from a population 
with proportion p = 0.37 who have watched Survivor. The bottom dotplot in Figure 


77 displays the distribution of the 400 values of / from these new samples. Both 
graphs are drawn on the same horizontal scale to make comparison easy. 
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We can see that the spread of the top dotplot in Figure 7.7 is much greater than the 
spread of the bottom dotplot. With samples of size 100, the values of f vary from 
0.25 to 0.54. The standard deviation of these p-values is about 0.05. Using SRSs of 
size 1000, the values of only vary from 0.328 to 0.412. The standard deviation of 
these p-values is about 0.015, so most random samples of 1000 people give a f that 
is within 0.03 of the actual population parameter, p = 0.37. 


The sample proportion f from a random sample of any size is an unbiased 
estimator of the parameter p. As we can see from the previous example, though, 
larger random samples have a clear advantage. They are much more likely to pro- 
duce an estimate close to the true value of the parameter. Said another way, larger 
random samples give us more precise estimates than smaller random samples. 
That’s because a large random sample gives us more information about the un- 
derlying population than a smaller sample does. 4 

Taking a larger sample doesn’t fix bias. Remember that even a very large rT) 
voluntary response sample or convenience sample is worthless because of bias. 

There are general rules for describing how the spread of the sampling distribu- 
tion of a statistic decreases as the sample size increases. In Sections 7.2 and 7.3, 
we'll reveal these rules for the sampling distributions of / and x. One important 
and surprising fact is that the variability of a statistic in repeated sampling does not 
depend very much on the size of the population. 


VARIABILITY OF A STATISTIC 


The variability of a statistic is described by the spread of its sampling distri- 
bution. This spread is determined mainly by the size of the random sample. 
Larger samples give smaller spreads. ‘The spread of the sampling distribution 
does not depend much on the size of the population, as long as the popula- 
tion is at least 10 times larger than the sample. 


Why does the size of the population have little influence on the behavior of 
statistics from random samples? Imagine sampling harvested corn by thrusting 
a scoop into a large sack of corn kernels. The scoop doesn’t know whether it is 
surrounded by a bag of corn or by an entire truckload. As long as the corn is well 
mixed (so that the scoop selects a random sample), the variability of the result 
depends only on the size of the scoop. 

The fact that the variability of a statistic is controlled by the size of the sample 
has important consequences for designing samples. Suppose a researcher wants 
to estimate the proportion of all U.S. adults who use Twitter regularly. A random 
sample of 1000 or 1500 people will give a fairly precise estimate of the parameter 
because the sample size is large. Now consider another researcher who wants 
to estimate the proportion of all Ohio State University students who use Twitter 
regularly. It can take just as large an SRS to estimate the proportion of Ohio State 
University students who use Twitter regularly as to estimate with equal precision 
the proportion of all U.S. adults who use Twitter regularly. We can’t expect to 
need a smaller SRS at Ohio State just because there are about 60,000 Ohio State 
students and about 235 million adults in the United States. 
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CHECK YOUR UNDERSTANDING 
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The histogram above left shows the intervals (in minutes) between eruptions of Old 
Faithful geyser for all 222 recorded eruptions during a particular month. For this popula- 
tion, the median is 75 minutes. We used Fathom software to take 500 SRSs of size 10 from 
the population. The 500 values of the sample median are displayed in the histogram above 
right. ‘The mean of these 500 values is 73.5. 


1. Is the sample median an unbiased estimator of the population median? Justify your answer. 
2. Suppose we had taken samples of size 20 instead of size 10. Would the spread of the 
sampling distribution be larger, smaller, or about the same? Justify your answer. 


3. Describe the shape of the sampling distribution. 


Bias, variability, and shape We can think of the true value of the popu- 
lation parameter as the bull’s-eye on a target and of the sample statistic as an ar- 
row fired at the target. Both bias and variability describe what happens when we 
take many shots at the target. Bias means that our aim is 
off and we consistently miss the bull’s-eye in the same 
direction. Our sample values do not center on the popu- 
lation value. High variability means that repeated shots 
are widely scattered on the target. Repeated samples do 
not give very similar results. Figure 7.8 shows this target 
illustration of the two types of error. 

Notice that low variability (shots are close together) 
can accompany high bias (shots are consistently away 
from the bull’s-eye in one direction). And low or no bias 


High bias, low variability Low bias, high variability (shots center on the bull’s-eye) can accompany high vari- 
ia) (b) ability (shots are widely scattered). Ideally, we'd like our 
estimates to be accurate (unbiased) and precise (have low 


. variability). See Figure 7.8(d). 
The following example attempts to tie these ideas to- 
gether in a familiar setting. 


High bias, high variability The ideal: no bias, low variability FIGURE 7.8 Bias and variability. (a) High bias, low variability. 
(b) Low bias, high variability. (c) High bias, high variability. (d) The 
(c) (d) ideal: no bias, low variability. 
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The German Tank Problem 


Evaluating estimators: Shape, center, spread 


Refer to the Activity on page 422. Mrs. Friedman’s student teams came 
up with four different methods for estimating the number of tanks in 
the bag: (1) “maxmin” = maximum + minimum, (2) “meanpl2sd” = 
x + 2s,, (3) “twicemean” = 2x, and (4) “twomedian” = 2(median). 
She added one more method, called “partition.” Figure 7.9 shows the 

results of taking 250 SRSs of 4 tanks and recording the value of the five 

statistics for each sample. The vertical line marks the actual value of the 
population parameter N: there were 342 tanks in the bag. 


PROBLEM: Use the information in Figure 7.9 to help answer 
these questions. 


(a) Which of the four statistics proposed by the student teams is 
the best estimator? Justify your answer. 


(b) Why was the partition method, which uses the statistic 
(5/4) - maximum, recommended by the mathematicians in Washington, 
D.C.2 


SOLUTION: 


(a) Meanpl26d is a biased estimator: the center of its sampling 
distribution is too high. This statistic produces consistent over- 
estimates of the number of tanks. The other three statistics pro- 
posed by the students appear to be unbiased estimators. All three 
sampling distributions have roughly symmetric shapes, so these 
statistics are about equally likely to underestimate or overestimate 
the number of tanks. Because maxmin has the smallest variability 
among the three, it would generally produce estimates that are closer to the actual number of tanks. 
Among the students’ proposed statistics, maxmin would be the best estimator. 


FIGURE 7.9 Results from a Fathom simulation of 250 SRSs of 
4 tanks. The approximate sampling distributions of five different 
statistics are shown. 


(b) The partition method uses a statistic (5/4 - maximum) that is an unbiased estimator and that 
has much less variability than any of the student teams’ statistics. Its sampling distribution is left- 
skewed, 50 the mean of the distribution is less than its median. Because more than half of the dots 
in the graph are to the right of the mean, the statistic is more likely to overestimate than underesti- 
mate the number of tanks. The mathematicians believed that it would be better to err on the side of 
caution and give the military commanders an estimate that is slightly too high. 


For Practice Try Exercise 


The lesson about center and spread is clear: given a choice of statistics to 
estimate an unknown parameter, choose one with no or low bias and mini- 
mum variability. Shape is a more complicated issue. We have seen sampling 
distributions that are left-skewed, right-skewed, roughly symmetric, and even 
approximately Normal. The same statistic can have sampling distributions 
with different shapes depending on the population distribution and the sam- 
ple size. Our advice: be sure to consider the shape of the sampling distribution 
before doing inference. 


Popa (a) A random sample of 1000 people who signed a card 
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Summary 


e A parameter is a number that describes a population. To estimate an un- 
known parameter, use a statistic calculated from a sample. 


e The population distribution of a variable describes the values of the variable 
for all individuals in a population. The sampling distribution of a statistic 
describes the values of the statistic in all possible samples of the same size 
from the same population. Don’t confuse the sampling distribution with a 
distribution of sample data, which gives the values of the variable for all 
individuals in a particular sample. 

e A statistic can be an unbiased estimator or a biased estimator of a param- 
eter. A statistic is a biased estimator if the center (mean) of its sampling distri- 
bution is not equal to the true value of the parameter. 

e The variability of a statistic is described by the spread of its sampling distribu- 
tion. Larger samples give smaller spread. 

e When trying to estimate a parameter, choose a statistic with low or no bias 
and minimum variability. 


Exercises 


For Exercises | and 2, identify the population, the param- For each boldface number in Exercises 3 to 6, (1) state 
eter, the sample, and the statistic in each setting. whether it is a parameter or a statistic and (2) use appropri- 


tare ate notation to describe each number; for example, p = 0.65. 
. ealthy livin 
‘ : 3. Get your bearings A large container is full of ball 


bearings with mean diameter 2.5003 centimeters (cm). 

This is within the specifications for acceptance of the 

container by the purchaser. By chance, an inspector 

chooses 100 bearings from the container that have 

(b) ‘Tom is cooking a large turkey breast for a holiday mean diameter 2.5009 cm. Because this is outside the 
meal. He wants to be sure that the turkey is safe to eat, specified limits, the container is mistakenly rejected. 
which requires a minimum internal temperature of 
165°F. ‘Tom uses a thermometer to measure the tem- 
perature of the turkey meat at four randomly chosen 
points. ‘The minimum reading in the sample is 170°F. 


saying they intended to quit smoking were contacted 
9 months later. It turned out that 210 (21%) of the sampled 
individuals had not smoked over the past 6 months. 


4. Voters Voter registration records show that 41% of 
voters in a state are registered as Democrats. ‘To test 
a random digit dialing device, you use it to call 250 
randomly chosen residential telephones in the state. 


2. The economy Of the registered voters contacted, 33% are registered 
(a) Each month, the Current Population Survey interviews Denoee: 
a random sample of individuals in about 60,000 5. Unlisted numbers A telemarketing firm in a large city 
US. households. One of their goals is to estimate the uses a device that dials residential telephone numbers 
national unemployment rate. In October 2012, 7.9% of in that city at random. Of the first 100 numbers dialed, 
those interviewed were unemployed. 48% are unlisted. This is not surprising because 52% of 


(yep emahdacmdiie ates way inaleee Che TS all residential phones in the city are unlisted. 


find out, a reporter records the price per gallon of regular 6. How tall? A random sample of female college 
unleaded gasoline at a random sample of 10 gas stations students has a mean height of 64.5 inches, which 
in the city on the same day. The range (maximum — is greater than the 63-inch mean height of all adult 


minimum) of the prices in the sample is 25 cents. American women. 
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Exercises 7 and 8 refer to the small population {2, 6, 8, 
10, 10, 12} with mean js = 8 and range 10. 


10. 


Sampling distribution 
List all 15 possible SRSs of size n = 2 from the popu- 
lation. Find the value of x for each sample. 


Make a graph of the sampling distribution of x. 
Describe what you see. 


Sampling distribution 


List all 15 possible SRSs of size n = 2 from the popula- 
tion. Find the value of the range for each sample. 


Make a graph of the sampling distribution of the 
sample range. Describe what you see. 


Doing homework A school newspaper article claims 
that 60% of the students at a large high school did all 
their assigned homework last week. Some skeptical 
AP® Statistics students want to investigate whether this 
claim is true, so they choose an SRS of 100 students 
from the school to interview. What values of the sample 
proportion p would be consistent with the claim that 
the population proportion of students who completed 
all their homework is p = 0.60? To find out, we used 
Fathom software to simulate choosing 250 SRSs of size 
n = 100 students from a population in which p = 0.60. 
The figure below is a dotplot of the sample proportion 
p of students who did all their homework. 


o 
o 
oo o 
@ oo o 
oooo o 
eoooo 6 
eoooc0o0 @ 
ecooce00 @ 
eoooceoco Go 
eoooo00000 oO 
eeoooco0oeo 0 
@eoo0o0o0o0o00o0 @ 
@oo0o0oco0o0o00 @ o 
@oeoooeoo0ooo00o oo @G 
© Geococo0oceco000 6 
S@oococc0c0coocoooeo 6 Go 
@o0o0o0cco00000000 6 GD 
@eeo0o0o0ocooceoooeocoo00 Go GB 
G@oo00o00e0oceo0e0e0000 6 GG 
Seococeooooocc0ooceco000 8 GO 
Seooccooccoocecceco00 8 G 
OO GOCSCGODGGCOCOOGOGCCOGO000 O00 
045 O50 O55 060 O65 O70 O75 
proportion_yes 


There is one dot on the graph at 0.73. Explain what 
this value represents. 


Describe the distribution. Are there any obvious outliers? 


Would it be surprising to get a sample proportion of 
0.45 or lower in an SRS of size 100 when p = 0.6? 
Justify your answer. 


Suppose that 45 of the 100 students in the actual 
sample say that they did all their homework last 
week. What would you conclude about the newspa- 
per article’s claim? Explain. 


Tall girls According to the National Center for 
Health Statistics, the distribution of heights for 
16-year-old females is modeled well by a Normal 
density curve with mean ys = 64 inches and 
standard deviation o = 2.5 inches. To see if this 


distribution applies at their high school, an AP® 
Statistics class takes an SRS of 20 of the 300 
16-year-old females at the school and measures 
their heights. What values of the sample mean x 
would be consistent with the population distribu- 
tion being N(64, 2.5)? To find out, we used Fath- 
om software to simulate choosing 250 SRSs of size 
n = 20 students from a population that is N(64, 
2.5). The figure below is a dotplot of the sample 
mean height x of the students in each sample. 


SBesora% 


Sog8800 


62.0 62.5 63.0 63.5 64.0 645 65.0 655 66.0 


(a) ‘There is one dot on the graph at 62.4. Explain what 
this value represents. 


(b) Describe the distribution. Are there any obvious outliers? 


(c) Would it be surprising to get a sample mean of 64.7 
or more in an SRS of size 20 when ps = 64? Justify 
your answer. 


(d) Suppose that the average height of the 20 girls in the 
class’s actual sample is X = 64.7. What would you 
conclude about the population mean height y for 
the 16-year-old females at the school? Explain. 


11. Doing homework Refer to Exercise 9. 


(a) Make a bar graph of the population distribution 
given that the newspaper's claim is correct. 


(b) Sketch a possible graph of the distribution of sample 
data for the SRS of size 100 taken by the AP® Statis- 
tics students. 


12. Tall girls Refer to Exercise 10. 
(a) Make a graph of the population distribution. 


(b) Sketch a possible dotplot of the distribution of 
sample data for the SRS of size 20 taken by the AP® 
Statistics class. 


Exercises 13 and 14 refer to the following setting. During 
the winter months, outside temperatures at the Starneses’ 
cabin in Colorado can stay well below freezing (32°F, 

or 0°C) for weeks at a time. To prevent the pipes from 
freezing, Mrs. Starnes sets the thermostat at 50°F. The 
manufacturer claims that the thermostat allows variation 
in home temperature that follows a Normal distribution 
with o = 3°F. To test this claim, Mrs. Starnes programs 
her digital thermometer to take an SRS of n = 10 readings 
during a 24-hour period. Suppose the thermostat is 
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working properly and that the actual temperatures in the 1, 


cabin vary according to a Normal distribution with mean 
p = 50°F and standard deviation a = 3°F- 


13. Cold cabin? The Fathom screen shot below shows 
the results of taking 500 SRSs of 10 temperature read- 
ings from a population distribution that is N(50, 3) 


and recording the sample variance sz each time. 18. 


Sample variance & 
(a) Describe the approximate sampling distribution. 


(b) Suppose that the variance from an actual sample is 
sz = 25. What would you conclude about the ther- 
mostat manufacturer’s claim? Explain. 


14. Cold cabin? The Fathom screen shot below shows 
the results of taking 500 SRSs of 10 temperature read- 
ings from a population distribution that is N(50, 3) 
and recording the sample minimum each time. 


38 40 42 44 46 48 50 52 


Sample minimum 


(a) Describe the approximate sampling distribution. 

(b) Suppose that the minimum of an actual sample is 
40°F. What would you conclude about the thermo- 
stat manufacturer’s claim? Explain. 


15. Asample of teens A study of the health of teenagers 
plans to measure the blood cholesterol levels of an 
SRS of 13- to 16-year-olds. The researchers will report 
the mean x from their sample as an estimate of the 


mean cholesterol level yu in this population. Explain (a) 
to someone who knows little about statistics what it 
means to say that ¥ is an unbiased estimator of ju. (b) 


16. Predict the election A polling organization plans to 


ask a random sample of likely voters who they plan 20. 


to vote for in an upcoming election. The researchers 
will report the sample proportion f that favors the in- 
cumbent as an estimate of the population proportion 
p that favors the incumbent. Explain to someone 
who knows little about statistics what it means to say 
that p is an unbiased estimator of p. 


A sample of teens Refer to Exercise 15. The sample 
mean X is an unbiased estimator of the population 
mean {4 no matter what size SRS the study chooses. 
Explain to someone who knows nothing about 
statistics why a large random sample will give more 
trustworthy results than a small random sample. 


Predict the election Refer to Exercise 16. The 
sample proportion f is an unbiased estimator of the 
population proportion p no matter what size random 
sample the polling organization chooses. Explain 

to someone who knows nothing about statistics why 
a large random sample will give more trustworthy 
results than a small random sample. 


Bias and variability ‘The figure below shows his- 
tograms of four sampling distributions of different 
statistics intended to estimate the same parameter. 


Population parameter 


(i) 


Population parameter 


(ii) 


_ foots d. 


| Population parameter 


(iii) 


| Population parameter 
(iv) 
Which statistics are unbiased estimators? Justify your 
answer. 


Which statistic does the best job of estimating the 
parameter? Explain. 


IRS audits ‘The Internal Revenue Service plans to 
examine an SRS of individual federal income tax 
returns. ‘lhe parameter of interest is the proportion 
of all returns claiming itemized deductions. Which 
would be better for estimating this parameter: an 
SRS of 20,000 returns or an SRS of 2000 returns? 
Justify your answer. 


Section 7.1 


Multiple choice: Select the best answer for Exercises 21 
to 24. 


21. Ata particular college, 78% of all students are 
receiving some kind of financial aid. The school 
newspaper selects a random sample of 100 stu- 
dents and 72% of the respondents say they are 
receiving some sort of financial aid. Which of the 
following is true? 


(a) 78% is a population and 72% is a sample. 

(b) 72% is a population and 78% is a sample. 

(c) 78% isa parameter and 72% is a statistic. 

(d) 72% isa parameter and 78% is a statistic. 

(e) 78% is a parameter and 100 is a statistic. 

22. A statistic is an unbiased estimator of a parameter when 
(a) the statistic is calculated from a random sample. 


(b) ina single sample, the value of the statistic is equal 
to the value of the parameter. 


(c) in many samples, the values of the statistic are very 
close to the value of the parameter. 


(d) in many samples, the values of the statistic are cen- 
tered at the value of the parameter. 


(e) in many samples, the distribution of the statistic has 
a shape that is approximately Normal. 


23. Ina residential neighborhood, the median value 
of a house is $200,000. For which of the following 


sample sizes is the sample median most likely to be 


above $250,000? 
(a) n= 10 
(b) n= 50 
(c) n= 100 
(d) n= 1000 
(e) Impossible to determine without more information. 


24. Increasing the sample size of an opinion poll will 
reduce the 


(a) bias of the estimates made from the data collected in 


the poll. 

(b) variability of the estimates made from the data col- 
lected in the poll. 

(c) effect of nonresponse on the poll. 

(d) variability of opinions in the sample. 


(e) variability of opinions in the population. 


25. Dem bones (2.2) Osteoporosis is a condition 
2 in which the bones become brittle due to loss of 


minerals. To diagnose osteoporosis, an elaborate 
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apparatus measures bone mineral density (BMD). 
BMD is usually reported in standardized form. The 
standardization is based on a population of healthy 
young adults. The World Health Organization 
(WHO) criterion for osteoporosis is a BMD score 
that is 2.5 standard deviations below the mean for 
young adults. BMD measurements in a population 
of people similar in age and gender roughly follow 
a Normal distribution. 


What percent of healthy young adults have osteopo- 
tosis by the WHO criterion? 


Women aged 70 to 79 are, of course, not young 
adults. The mean BMD in this age group is about 
—2 on the standard scale for young adults. Suppose 
that the standard deviation is the same as for young 
adults. What percent of this older population has 
osteoporosis? 


Squirrels and their food supply (3.2) Animal 
species produce more offspring when their supply 
of food goes up. Some animals appear able to 
anticipate unusual food abundance. Red squirrels 
eat seeds from pinecones, a food source that some- 
times has very large crops. Researchers collected 
data on an index of the abundance of pinecones 
and the average number of offspring per female 
over 16 years.» Computer output from a least- 
squares regression on these data and a residual 
plot are shown below. 


Predictor Coef SE Coef ae P 
Constant 1.4146 0.2517 5.62 0.000 
Cone index 0.4399 0.1016 4.33 0.001 
S = 0.600309 R-Sq = 57.2% R-Sq(adj) = 54.2% 
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Cone index 
(a) Give the equation for the least-squares regression 
line. Define any variables you use. 
(b) Isa linear model appropriate for these data? Explain. 
a . 

(c) Interpret the values of r“ and s in context. 
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ee Sample Proportions 


WHAT YOU WILL LEARN By the end of the section, you should be able to: 


e Find the mean and standard deviation of the sampling e If appropriate, use a Normal distribution to calculate 
distribution of a sample proportion p. Check the 10% probabilities involving p. 


condition before calculating >. 


Determine if the sampling distribution of A is approxi- 
mately Normal. 


What proportion of U.S. teens know that 1492 was the year in which Columbus “discov- 
ered” America? A Gallup Poll found that 210 out of a random sample of 501 American 
teens aged 13 to 17 knew this historically important date.* The sample proportion 

a 

p= SO = 0.42 
is the statistic that we use to gain information about the unknown population 
proportion p. Because another random sample of 501 teens would likely result 
in a different estimate, we can only say that “about” 42% of U.S. teenagers know 
that Columbus discovered America in 1492. In this section, we'll use sampling 
distributions to clarify what “about” means. 


AS FAR AS I KNow THE WORLD 

THERE ISN'T A. THERE ISN'T A So WHY IS HAS MANY 

“CLEVELAND DAY”... “CINCINNATI DAY"... THERE A = MYSTERIES. 
THERE ISN'T AN | | “COLUMBUS | 

‘AKRON DAY"... Hl 


Sarai 


The Sampling Distribution of p 


How good is the statistic p as an estimate of the parameter p? To find out, we ask, 
“What would happen if we took many samples?” The sampling distribution of 6 
answers this question. How do we determine the shape, center, and spread of the 
sampling distribution of f? Let’s start with a simulation. 


ACTIVITY | The Candy Machine 


MATERIALS: Imagine a very large candy machine filled with orange, brown, and yellow can- 
Computer with Internet dies. When you insert money, the machine dispenses a sample of candies. In this 
access—one for the class or Activity, you will use an applet to investigate the sample-to-sample variability in 
one per pair of students the proportion of orange candies dispensed by the machine. 
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= ; 1. Launch the Reese’s Pieces® applet at www.rossmanchance. 
Sampling Reese's Pieces| com. Change the population proportion of orange candies to 
p = 0.45 (the applet calls this value 7 instead of p). 


2. Click on the “Draw Samples” button. An animated 
simple random sample of n = 25 candies should be dis- 
pensed. Figure 7.10 shows the results of one such sample. Was 
your sample proportion of orange candies close to the actual 
population proportion, p = 0.45? Look at the value of f in the 
applet window. 


04 02 03 04f05 06 o7 08 3. Click “Draw Samples” 9 more times, so that you have a 


| | | | Mean= 0.480 Std Dev= 0.000 total of 10 sample results. Look at the dotplot of your p-values. 
$22] alan What is the mean of your 10 sample proportions? What is their 
f=0.48 Current Sarnple: 1 standard deviation? 
i T Count Samples... 4. To take many more samples quickly, enter 390 in the 
sample size: [25 I Plot Normal Curve “number of samples” box. Click on the Animate box to turn 
number of samples: [1 the animation off. Then click “Draw Samples.” You have 
iiaihaiidh now taken a total of +00 samples of 25 candies from the ma- 
Draw Samples chine. Describe the shape, center, and spread of the approxi- 
= mate sampling distribution of shown in the dotplot. 
FIGURE 7.10 The result of 5. How would the sampling distribution of the sample proportion p change 


taking one SRS of 25 candies if the machine dispensed n = 50 candies each time instead of 25? “Reset” the 
from a large candy machine in applet. ‘Take 400 samples of 50 candies. Describe the shape, center, and spread 
which 45% of the candies are of the approximate sampling distribution. 


Lee: 6. How would the sampling distribution of 6 change if the proportion of orange 
candies in the machine was p = 0.15 instead of p = 0.45? Does your answer 
depend on whether n = 25 or n = 50? Use the applet to investigate these 
questions. Then write a brief summary of what you learned. 


7. For what combinations of n and p is the sampling distribution of f approxi- 
mately Normal? Use the applet to investigate. 


Figure 7.11 shows one set of possible results from Step 4 of “The Candy 
Machine” Activity. Let’s describe what we see. 


Shape: Roughly symmetric and somewhat bell-shaped. It looks 
as though a Normal curve would approximate this distribution 
fairly well. 


Center: The mean of the 400 sample proportions is 0.449. This 
is quite close to the actual population proportion, p = 0.45. 


01 02 03 04805 06 O7 O8 


Mean = 0.449 Std Dev= 0.105 Spread: The standard deviation of the 400 values of f from these 
ase scint = samples is 0.105. 
f=0.40 6 The dotplot in Figure 7.11] is the approximate sampling dis- 
=0. | urrent Sample: 400 tribution of #. If we took all possible SRSs of n = 25 candies 
‘i fos Ci p P 
“ie Count Samples... 


from the machine and graphed the value of f for each sample, 
then we'd have the sampling distribution of 6. We can get an 
idea of its shape, center, and spread from Figure 7.11. 


sample size: |25 [~ Plot Normal Curve 


number of samples: |400 
[ Animate 


Dept sampiss | FIGURE 7.11 The result of taking 400 SRSs of 25 candies from a large candy machine in which 
__Reset__| 45% of the candies are orange. The dotplot shows the approximate sampling distribution of A. 
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Sampling Candies 


Effect of n and p on shape, center, and spread 


02 03 04 06 0.6 oF 


Mean= 0.446 Std Dev= 0.070 


f=0.50 Current Sample: 400 


n: foas 
sample size: [50 
400 

[~ Animate 


Draw Samples 
Reset 


FIGURE 7.12 The approximate sampling distribution of p 
for 400 SRSs of 50 candies from a population in which 
p = 0.45 of the candies are orange. 


[~ Count Samples... 
I~ Plot Normal Curve 
number of samples: 


51: n 
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Mean= 0.143 Std Dev= 0.069 
q sel al 


p= 0.08 Current Sample: 400 


Tw: 10.15 
sample size: [25 


400 
l Animate 


Draw Samples 
Reset 


[~ Count Samples... 
[~ Plot Normal Curve 
number of samples: 


In a similar way, we can explore the sampling distribution of 
f when n = 50 (Step 5 of the Activity). As Figure 7.12 shows, 
the dotplot is once again roughly symmetric and somewhat 
bell-shaped. This graph is also centered at about 0.45. With 
samples of size 50, however, there is less spread in the values 
of p. The standard deviation in Figure 7.12 is 0.070. For the 
samples of size 25 in Figure 7.1], it is 0.105. ‘To repeat what 
we said earlier, larger samples give the sampling distribution 
a smaller spread. 


What if the actual proportion of orange candies in the machine 
were p = 0.15? Figure 7.13(a) shows the approximate sampling 
distribution of 6 when n = 25. Notice that the dotplot is slightly 
right-skewed. The graph is centered close to the population pa- 
rameter, p = 0.15. As for the spread, it’s similar to the standard 
deviation in Figure 7.12, where n = 50 and p = 045. If we 
increase the sample size to n = 50, the sampling distribution of 
p should show less variability. The standard deviation in Figure 
7.13(b) confirms this. Note that we can’t just visually compare 
the graphs because the horizontal scales are different. The dot- 
plot is more symmetrical than the graph in Figure 7.13(a) and is 
once again centered at a value that is close to p = 0.15. 


o O14 02 O03 
Mean=0.148 Std Dev= 0.051 


ol 1 


fp=0.14 


Current Sample: 400 


n: fos 
sample size: [so 
400 
[~ Animate 


_Draw Samples | 
Reset 


! Count Samples... 
[~ Plot Normal Curve 
number of samples: 


FIGURE 7.13 The result of taking 400 SRSs of (a) size n = 25 and (b) size n = 50 candies from a large candy machine 
in which 15% of the candies are orange. The dotplots show the approximate sampling distribution of p in each case. 


THINK 
ABOUT IT 
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What have we learned so far about the sampling distribution of jh? 


Shape: In some cases, the sampling distribution of f can be approximated by a 
Normal curve. This seems to depend on both the sample size n and the popula- 
tion proportion p. 

Center: The mean of the distribution is ju, = p. This makes sense because the 
sample proportion f is an unbiased estimator of p. 


Spread: For a specific value of p, the standard deviation aj gets smaller as n gets 
larger. The value of oj depends on both n and p. 


To sort out the details of shape and spread, we need to make an important con- 
nection between the sample proportion f and the number of “successes” X in the 
sample. 

In the candy machine example, we started by taking repeated SRSs of n = 25 
candies from a population with proportion p = 0.45 of orange candies. For any 
such sample, we can think of each candy that comes out of the machine as a trial 
of this chance process. A “success” occurs when we get an orange candy. Let X = 
the number of orange candies obtained. As long as the number of candies in the 
machine is very large, X will have close to a binomial distribution with n = 25 
and p = 0.45. (Refer to the 10% condition on page 401.) The sample proportion 
of successes is closely related to X: 


count of successes insample X 


p= 


size of sample n 


How is the sampling distribution of 6 related to the binomial 
count X? From Chapter 6, we know that the mean and standard deviation of 
a binomial random variable X are 


ix=np and ox=Vnp(l — p) 


Because p = X/n = (1/n)X, we're just multiplying the random variable X by a 
constant (1/7) to get the random variable fp. Recall from Chapter 6 that multiply- 
ing by a constant multiplies both the mean and the standard deviation of the new 
random variable by that constant. We have 


] 
bg = | (np) = p (confirming that f is an unbiased estimator of p) 


np(1l — 1 - 
05 = VT =p) = P.e=w 


(as sample size increases, spread decreases) 


That takes care of center and spread. What about shape? Multiplying a random 
variable by a constant doesn’t change the shape of the probability distribution. So 
the sampling distribution of f will have the same shape as the distribution of the 
binomial random variable X. 

If you studied the optional material in Chapter 6 about the Normal approxima- 
tion to a binomial distribution, then you already know the punch line. Whenever 
np and n(1 — p) are at least 10, a Normal distribution can be used to approximate 
the sampling distribution of p. 


eee 
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SAMPLING DISTRIBUTIONS 


Here’s a summary of the important facts about the sampling distribution of . 


SAMPLING DISTRIBUTION OF A SAMPLE PROPORTION 


Figure 7.14 displays the facts in a form that helps you recall the big idea of 
a sampling distribution. The mean of the sampling distribution of f is the true 
value of the population proportion p. The standard deviation of f gets smaller as 
the sample size n increases. In fact, because the sample size n is under the square 
root sign, we’d have to take a sample four times as large to cut the standard devia- 
tion in half. 


SRS size n 


> 


Standard 


SRS sizen 4 ae 
—__—>P deviation 
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FIGURE 7.14 Select a large SRS from a population in which proportion p are successes. The 
sampling distribution of the proportion p of successes in the sample is approximately Normal. 
The mean is p and the standard deviation is Vp(1 — p)/n. 


The two conditions in the preceding box are very important. (1) Large Counts 
condition: If we assume that the sampling distribution of p is approximately 
Normal when it isn’t, any calculations we make using a Normal distribution will 
be flawed. (2) 10% condition: When we’re sampling without replacement from a 
(finite) population, the observations are not independent, because knowing the 
outcome of one trial helps us predict the outcome of future trials. But the standard 
deviation formula assumes that the observations are independent. If we sample too 
large a fraction of the population, our calculated value of ag will be inaccurate. 
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Because larger random samples give better information, it sometimes makes 
sense to sample more than 10% of a population. In such a case, there’s a more 
accurate formula for calculating the standard deviation gj. It uses something 
called a finite population correction (FPC). We'll avoid situations that require 
the FPC in this text. 


CHECK YOUR UNDERSTANDING 


About 75% of young adult Internet users (ages 18 to 29) watch online videos. Suppose that 
a sample survey contacts an SRS of 1000 young adult Internet users and calculates the 
proportion f in this sample who watch online videos. 


1. What is the mean of the sampling distribution of 6? Explain. 

2. Find the standard deviation of the sampling distribution of f. Check that the 10% 
condition is met. 

3. Is the sampling distribution of 6 approximately Normal? Check that the Large 
Counts condition is met. 


4. Ifthe sample size were 9000 rather than 1000, how would this change the sampling 
distribution of p? 


Using the Normal Approximation for p 


Inference about a population proportion p is based on the sampling distribution of f. 
When the sample size is large enough for np and n(1 — p) to both be at least 10 (the 
Large Counts condition), the sampling distribution of f is approximately Normal. In 
that case, we can use a Normal distribution to calculate the probability of obtaining 
an SRS in which f lies in a specified interval of values. Here is an example. 


Going to College 


Normal calculations involving 6 


A polling organization asks an SRS of 1500 first-year college students how far 
away their home is. Suppose that 35% of all first-year students attend college 
within 50 miles of home. 


PROBLEM: Find the probability that the random sample of 1500 students will give a result 
within 2 percentage points of this true value. Show your work. 


SOLUTION: 


Step 1: State the distribution and the values of interest. We want to find the 
probability that p falls between 0.33 and 0.37 (within 2 percentage points, or 0.02, of 0.35). In 
symbols, that’s P.0.33 = p = 0.37). We have an SRS of size n = 1500 drawn from a population in 
which the proportion p = 0.35 attend college within 50 miles of home. What do we know about the 
sampling distribution of p? 

* Itsmeanis j1p = p = 0.35. 
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¢ What about the standard deviation? We need to check the 10% 
condition. To use the standard deviation formula we derived, the 
N(0.35,0.0123) population must contain at least 10(1500) = 15,000 people. 
There are over 1.7 million first-year college students, so 


Pe a (0.35)(0.65) 
= n 1500 


Op= 0.0123 


= 0.0123 


* Can we use a Normal distribution to approximate the sampling 
distribution of p? Check the Large Counts condition: 

np = 1500(0.35) = 525 and n(1 — p) = 1500(0.65) = 975. 
Both are much larger than 10, so the Normal approximation will be 
quite accurate. 


Figure 7.15 shows the Normal distribution that we'll use with the 


Gat Meador 03 area of interest shaded and the mean, standard deviation, and bound- 


9.33 Value ofp 0.37 ary values labeled. 
FIGURE 7.15 The Normal approximation to the sampling Step 2: Perform calculations—show your work! The 
distribution of p. standardized scores for the two boundary values are 
(0) 5) = (0) 15) (O)eH/ = (05) 
= = —1.63 and z= ————— = 1,63 
0.0123 0.0123 


Figure 7.16 shows the area under the standard Normal curve 
corresponding to these standardized values. Using Table A, the 
desired probability is 
NO29= 9 =0:57)— i\— loa 2= 1.65) 
= 0.9484 — 0.0516 = 0.8968 


Probability = 0.0516 


Probability = 0.8968 


Using technology: The command normalcdf (lower:0.33, 
eee aia upper:0.37, £:0.35, 0:0.0123) givesanarea of 

— -\- = 0.8961. 
ig ae cy Step 3: Answer the question. About 90% of all SRSs of size 


FIGURE 7.16 Probabilities as areas under the standard 1500 will give a result within 2 percentage points of the truth about 
Normal curve. the population. 


For Practice Try Exercise 


Summary 


e =When we want information about the population proportion p of successes, 
we often take an SRS and use the sample proportion f to estimate the un- 
known parameter p. The sampling distribution of f describes how the sam- 
ple proportion varies in all possible samples from the population. 


e The mean of the sampling distribution of f is equal to the population propor- 
tion p. That is, p is an unbiased estimator of p. 


Dilke 


28. 


29. 
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e The standard deviation of the sampling distribution of f is Vp(1 — p)/n 
for an SRS of size n. This formula can be used if the population is at 
least 10 times as large as the sample (the 10% condition). The standard de- 
viation of f gets smaller as the sample size n gets larger. Because of the square 
root, a sample four times larger is needed to cut the standard deviation in half. 


e When the sample size n is large, the sampling distribution of f is close to 
a Normal distribution with mean p and standard deviation Vp(1 — p)/n. 
In practice, use this Normal approximation when both np = 10 and 
n(1 — p) = 10 (the Large Counts condition). 


The candy machine Suppose a large candy machine 
has +5% orange candies. Use Figures 7.1] and 7.12 
(pages 441 and 442) to help answer the following 
questions. 


Would you be surprised if a sample of 25 candies 
from the machine contained 8 orange candies (that’s 
32% orange)? How about 5 orange candies (20% 
orange)? Explain. 


Which is more surprising: getting a sample of 25 
candies in which 32% are orange or getting a sample 
of 50 candies in which 32% are orange? Explain. 


The candy machine Suppose a large candy 
machine has 15% orange candies. Use Figure 7.13 
(page 442) to help answer the following questions. 


Would you be surprised if a sample of 25 candies 
from the machine contained 8 orange candies (that’s 
32% orange)? How about 5 orange candies (20% 
orange)? Explain. 


Which is more surprising: getting a sample of 
25 candies in which 32% are orange or getting a 
sample of 50 candies in which 32% are orange? 
Explain. 


The candy machine Suppose a large candy ma- 
chine has 45% orange candies. Imagine taking an 
SRS of 25 candies from the machine and observing 
the sample proportion f of orange candies. 


What is the mean of the sampling distribution of f? 
Why? 


Find the standard deviation of the sampling dis- 
tribution of p. Check to see if the 10% condition 
is met. 


Exercises 


31. 


Is the sampling distribution of f approximately 
Normal? Check to see if the Large Counts 
condition is met. 


If the sample size were 100 rather than 25, how 
would this change the sampling distribution of f? 


The candy machine Suppose a large candy 
machine has 15% orange candies. Imagine taking an 
SRS of 25 candies from the machine and observing 
the sample proportion f of orange candies. 


What is the mean of the sampling distribution of f? 
Why? 


Find the standard deviation of the sampling distribu- 
tion of p. Check to see if the 10% condition is met. 


Is the sampling distribution of f approximately 
Normal? Check to see if the Large Counts condition 
is met. 


If the sample size were 225 rather than 25, how 
would this change the sampling distribution of f? 


Airport security The ‘Transportation Security Ad- 
ministration (‘I’SA) is responsible for airport safety. 
On some flights, TSA officers randomly select pas- 
sengers for an extra security check before boarding. 
One such flight had 76 passengers— 12 in first class 
and 64 in coach class. TSA officers selected an SRS 
of 10 passengers for screening. Let f be the propor- 
tion of first-class passengers in the sample. 


Is the 10% condition met in this case? Justify your 
answer. 


Is the Large Counts condition met in this case? 
Justify your answer. 
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32. Scrabble In the game of Scrabble, each player 
begins by drawing 7 tiles from a bag containing 
100 tiles. There are 42 vowels, 56 consonants, and 
2 blank tiles in the bag. Cait chooses an SRS of 
7 tiles. Let f be the proportion of vowels in her 
sample. 


(a) Is the 10% condition met in this case? Justify your 
answer. 


(b) Is the Large Counts condition met in this case? 
Justify your answer. 


In Exercises 33 and 34, explain why you cannot use the 
methods of this section to find the desired probability. 


33. Hispanic workers A factory employs 3000 union- 
ized workers, of whom 30% are Hispanic. The 
15-member union executive committee contains 
3 Hispanics. What would be the probability of 
3 or fewer Hispanics if the executive committee 
were chosen at random from all the workers? 


34. Studious athletes A university is concerned about 

the academic standing of its intercollegiate athletes. 
A study committee chooses an SRS of 50 of the 316 
athletes to interview in detail. Suppose that 40% of 
the athletes have been told by coaches to neglect 
their studies on at least one occasion. What is the 
probability that at least 15 in the sample are among 
this group? 


35. Do you drink the cereal milk? A USA Today Poll 
asked a random sample of 1012 U.S. adults what 
they do with the milk in the bowl after they have 
eaten the cereal. Let f be the proportion of people 
in the sample who drink the cereal milk. A spokes- 
man for the dairy industry claims that 70% of all 
U.S. adults drink the cereal milk. Suppose this 


claim is true. 


(a) What is the mean of the sampling distribution of f? 
Why? 


(b) Find the standard deviation of the sampling distribution 
of p. Check to see if the 10% condition is met. 


(c) Is the sampling distribution of f approximately 
Normal? Check to see if the Large Counts condi- 
tion is met. 


(d) Ofthe poll respondents, 67% said that they drink 
the cereal milk. Find the probability of obtaining 
a sample of 1012 adults in which 67% or fewer 
say they drink the cereal milk if the milk industry 
spokesman’s claim is true. Does this poll give con- 
vincing evidence against the claim? Explain. 


36. Do you go to church? ‘The Gallup Poll asked 
a random sample of 1785 adults whether they 


SH 


38. 


40. 


ale 


attended church during the past week. Let fp be the 
proportion of people in the sample who attended 
church. A newspaper report claims that 40% of all 
U.S. adults went to church last week. Suppose this 
claim is true. 


What is the mean of the sampling distribution of p? 
Why? 


Find the standard deviation of the sampling distribution 
of p. Check to see if the 10% condition is met. 


Is the sampling distribution of p approximately 
Normal? Check to see if the Large Counts 
condition is met. 


Of the poll respondents, 44% said they did attend 
church last week. Find the probability of obtaining 
a sample of 1785 adults in which 44% or more say 
they attended church last week if the newspaper 
report’s claim is true. Does this poll give convincing 
evidence against the claim? Explain. 


Do you drink the cereal milk? What sample size 
would be required to reduce the standard deviation 
of the sampling distribution to one-half the value you 
found in Exercise 35(b)? Justify your answer. 


Do you go to church? What sample size would be 
required to reduce the standard deviation of the sam- 
pling distribution to one-third the value you found in 
Exercise 36(b)? Justify your answer. 


Students on diets A sample survey interviews an 
SRS of 267 college women. Suppose that 70% of col- 
lege women have been on a diet within the past 12 
months. What is the probability that 75% or more of 
the women in the sample have been on a diet? Show 
your work. 


Who owns a Harley? Harley-Davidson motorcycles 
make up 14% of all the motorcycles registered in 
the United States. You plan to interview an SRS of 
500 motorcycle owners. How likely is your sample to 
contain 20% or more who own Harleys? Show your 
work. 


On-time shipping A mail-order company adver- 
tises that it ships 90% of its orders within three 
working days. You select an SRS of 100 of the 5000 
orders received in the past week for an audit. The 
audit reveals that 86 of these orders were shipped 
on time. 


If the company really ships 90% of its orders on time, 
what is the probability that the proportion in an SRS 
of 100 orders is 0.86 or less? Show your work. 


A critic says, “Aha! You claim 90%, but in your 
sample the on-time percentage is lower than that. 


So the 90% claim is wrong.” Explain in simple 
language why your probability calculation in 
(a) shows that the result of the sample does not 
refute the 90% claim. 


42. Underage drinking The Harvard College Alcohol 
Study finds that 67% of college students support 
efforts to “crack down on underage drinking.” Does 
this result hold at a large local college? To find out, 
college administrators survey an SRS of 100 students 
and find that 62 support a crackdown on underage 
drinking. 


(a) Suppose that the proportion of all students attending 
this college who support a crackdown is 67%, the 
same as the national proportion. What is the prob- 
ability that the proportion in an SRS of 100 students 
is 0.62 or less? Show your work. 


(b) A writer in the college’s student paper says that 
“support for a crackdown is lower at our school 
than nationally.” Write a short letter to the editor 
explaining why the survey does not support this 
conclusion. 


Multiple choice: Select the best answer for Exercises 43 
to 46. Exercises 43 to 45 refer to the following setting. ‘The 
magazine Sports Illustrated asked a random sample of 750 
Division I college athletes, “Do you believe performance- 
enhancing drugs are a problem in college sports?” Sup- 
pose that 30% of all Division | athletes think that these 
drugs are a problem. Let f be the sample proportion who 
say that these drugs are a problem. 


43. Which of the following are the mean and standard 
deviation of the sampling distribution of the sample 
proportion p? 


(a) Mean = 0.30, SD = 0.017 
(b) Mean = 0.30, SD = 0.55 
(c) Mean = 0.30, SD = 0.0003 
(d) Mean = 225,SD = 12.5 
(e) Mean = 225, SD = 157.5 


44. Decreasing the sample size from 750 to 375 would 
multiply the standard deviation by 


@) 2. (c) 1/2. (e) none of these. 

(b) V2. (dy 1/2. 

45. ‘The sampling distribution of f is approximately 
Normal because 


(a) there are at least 7500 Division I college athletes. 


RV 
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np = 225 and n(1 — p) = 525 are both at least 10. 
a random sample was chosen. 

the athletes’ responses are quantitative. 

the sampling distribution of f always has this shape. 


In a congressional district, 55% of the registered 
voters are Democrats. Which of the following is 
equivalent to the probability of getting less than 50% 
Democrats in a random sample of size 100? 


0.50 — 0.55 
ez ze 100 ) 
0.50 — 0.55 
0.55(0.45) 
100 


0.55 — 0.50 
0.55(0.45) 
100 


oz e005 ) 
100(0.55)(0.45) 


(z.< 0.55 — 0.50 ) 

V100(0.55)(0.45) 
Sharing music online (5.2) A sample survey reports 
that 29% of Internet users download music files 
online, 21% share music files from their computers, 
and 12% both download and share music.’ Make a 
Venn diagram that displays this information. What 


percent of Internet users neither download nor share 
music files? 


California’s endangered animals (4.1) ‘The Califor- 
nia Department of Fish and Game publishes a list of 
the state’s endangered animals. The reptiles on the 
list are given below. 


Desert tortoise Southern rubber boa 

Olive Ridley sea turtle Loggerhead sea turtle 

Island night lizard Barefoot banded gecko 

Flat-tailed horned lizard — Coachella Valley fringe-toed lizard 
Green sea turtle Blunt-nosed leopard lizard 
Leatherback sea turtle Giant garter snake 

Alameda whip snake San Francisco garter snake 


(a) 


(b) 


Describe how you would use ‘Table D at line 111 to 
choose an SRS of 3 of these reptiles to study. 


Use your method from part (a) to select your sample. 
Identify the reptiles you chose. 


450 CHAPTER 7 SAMPLING DISTRIBUTIONS 


73 Sample Means 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e Find the mean and standard deviation of the sampling e If appropriate, use a Normal distribution to calculate 
distribution of a sample mean x. Check the 10% condi- probabilities involving x. 


tion before calculating oy. 

Explain how the shape of the sampling distribution of x 
is affected by the shape of the population distribution 
and the sample size. 


Sample proportions arise most often when we are interested in categorical vari- 
ables. We then ask questions like “What proportion of U.S. adults have watched 
Survivor?” or “What percent of the adult population attended church last week?” 
But when we record quantitative variables—household income, lifetime of car 
brake pads, blood pressure —we are interested in other statistics, such as the me- 
dian or mean or standard deviation of the variable. The sample mean x is the 
most common statistic computed from quantitative data. This section describes 
the sampling distribution of the sample mean. The following Activity and the 
subsequent example give you a sense of what lies ahead. 


ACTIVITY | Penny for Your Thoughts 


MATERIALS: Your teacher will assemble a large population of pennies of various ages.° In this 
Large container with several Activity, your class will investigate the sampling distribution of the mean year x 
hundred pennies in a sample of pennies for SRSs of several different sizes. Then, you will compare 
these distributions of the mean year with the population distribution. 
1. Your teacher will provide a dotplot of the population distribution of 
penny years. 
2. Have each member of the class take an SRS of 5 pennies from the 
population and record the year on each penny. Be sure to replace 
these coins in the container before the next student takes a sample. 


If your class has fewer than 25 students, have each person take two 
samples. 


3. Calculate the mean year X of the 5 pennies in your sample. 


4. Make a class dotplot of the sample mean years for SRSs of size 5 
using the same scale as you did for the population distribution. Use 

X’s instead of dots when making the graph. 

5. Repeat the process in Steps 2 to 4 for samples of size 25. Use the same 
scale for your dotplot and place it beside the graph for samples of size 5. 
6. Compare the population distribution with the two approximate sam- 
pling distributions of x. What do you notice about shape, center, and 
spread as the sample size increases? 
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Making Money 


A first look at the sampling distribution of x 


Figure 7.17(a) is a histogram of the earnings of a population of 61,742 households 
that had earned income greater than zero in a recent year.’ 


As we expect, the distribution of earned incomes is strongly skewed to the right 
and very spread out. The right tail of the distribution is even longer than the his- 
togram shows because there are too few high incomes for their bars to be visible 
on this scale. We cut off the earnings scale at $400,000 to save space. The mean 
earnings for these 61,742 households was pu = $69,750. 


‘Take an SRS of 100 households. The mean earnings in this sample is ¥ = $66,807. 
That’s less than the mean of the population. Take another SRS of size 100. The 
mean for this sample is x = $70,820. That’s higher than the mean of the popula- 
tion. What would happen if we did this many times? Figure 7.17(b) is a histogram 
of the mean earnings for 500 samples, each of size n = 100. The scales in Figures 
7.17(a) and 7.17(b) are the same, for easy comparison. Although the distribution 
of individual earnings is skewed and very spread out, the distribution of sample 
means is roughly symmetric and much less spread out. Both distributions are cen- 


tered at pp = $69,750. 


Percent of households 


A 


Household earnings (thousands of dollars) 


100 
mn 


N 
n 
! 


Because both histograms 
use the same scales, you can 
directly compare this graph 
with the one to the left. 


i) 
=] 
l 


Percent of samples 


T A T T T T 
200 300 400 100 200 300 400 


Mean household earnings for samples of size 100 
(thousands of dollars) 


(b) 


FIGURE 7.17 (a) The distribution of earned income in a population of 61,472 households. (b) The 
distribution of the mean earnings x for 500 SRSs of n = 100 households from this population. 


This example illustrates an important fact that we will make precise in this 
section: averages are less variable than individual observations. 


The Sampling Distribution of x: Mean and 
Standard Deviation 


Figure 7.17 suggests that when we choose many SRSs from a population, the sam- 
pling distribution of the sample mean is centered at the population mean yu and is 
less spread out than the population distribution. Here are the facts. 
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MEAN AND STANDARD DEVIATION OF 
THE SAMPLING DISTRIBUTION OF x 


Suppose that x is the mean of an SRS of size n drawn from a large popula- 
tion with mean yu and standard deviation o. Then: 


e ‘The mean of the sampling distribution of x is juz = p. 


e ‘The standard deviation of the sampling distribution of x is 


as long as the 10% condition is satisfied: n < jy N. 


The behavior of ¥ in repeated samples is much like that of the sample pro- 


AP® EXAM TIP Notation ee: 
portion p: 


matters. The symbols p, x, 
D, Ll, O, Lp, Tp, Lx, ANd ox 
all have specific and different e The values of ¥ are less spread out for larger samples. Their standard deviation 
meanings. Either use notation decreases at the rate Vn, so you must take a sample four times as large to cut 
correctly—or don’t use it at all. the standard deviation of the distribution of x in half. 

You can expect to lose credit if 7 
you use incorrect notation. 


e The sample mean X is an unbiased estimator of the population mean p. 


You should use the formula ¢/Vn for the standard deviation of ¥ only when the 
population is at least 10 times as large as the sample (the 10% condition). 


Notice that these facts about the mean and standard deviation of x are true no 
matter what shape the population distribution has. 


This Wine Stinks 


Mean and standard deviation of x 


Sulfur compounds such as dimethyl sulfide (DMS) are sometimes present in wine. 
DMS causes “offodors” in wine, so winemakers want to know the odor threshold, 
the lowest concentration of DMS that the human nose can detect. Extensive stud- 
ies have found that the DMS odor threshold of adults follows a distribution with 
mean {4 = 25 micrograms per liter and standard deviation o = 7 micrograms per 
liter. Suppose we take an SRS of 10 adults and determine the mean odor threshold 
X for the individuals in the sample. 


PROBLEM: 

(a) What is the mean of the sampling distribution of x? Explain. 

(b) What is the standard deviation of the sampling distribution of x? Check that the 10% condition 
is met. 

SOLUTION: 

(a) Because x is an unbiased estimator of j1, jug = [4 = 25 micrograms per liter. 


or 7 
(b) The standard deviationis og = —— = ——— = 2.214 because there are at least 10(10) = 100 
Vn V10 


adults in the population. e 1 


For Practice Try Exercise 


THINK 
ABOUT IT 


ACTIVITY 
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Can we confirm the formulas for the mean and standard 
deviation of X? Choose an SRS of size n from a population, and measure a 
variable X on each individual in the sample. Call the individual measurements 
Xj, X2,..., Xy. If the population is large relative to the sample, we can think of 
these X;’s as independent random variables, each with mean yz and standard devia- 
tion o. Because 


=X LG) 
we can use the rules for random variables from Chapter 6 to find the mean and 
standard deviation of x. If we let T= X; + X, + --- + X,, then x = — 
Using the addition rules for means and variances, we get 
Ler = bx, + pix, Fo + py, = et pte + p= np 
of = of, toh, ts tok Hot tot te et += no" 

= op = Vno? = oVn 

Because x is just a constant multiple of the random variable T, 


1 ] 
by = 


Ser = 7 (ne) = 
l l n l l os 
Ox = Tor = —(oVn) = Ve a 


Sampling from a Normal Population 


We have described the mean and standard deviation of the sampling distribution 
of a sample mean x but not its shape. That’s because the shape of the distribution 
of x depends on the shape of the population distribution. In one important case, 
there is a simple relationship between the two distributions. The following Activ- 
ity shows what we mean. 


Exploring the Sampling Distribution of 


X for a Normal Population 


MATERIALS: 

Computer with Internet 
access—one for the class or 
one per pair of students 


Professor David Lane of Rice University has developed a wonderful applet for 
investigating the sampling distribution of x. It’s dynamic, and it’s fun to play with. 
In this Activity, you’ll use Professor Lane’s applet to explore the shape of the sam- 
pling distribution when the population is Normally distributed. 

1. Search for “online statbook sampling distributions applet” and go to the Web 
site. When the BEGIN button appears on the left side of the screen, click on it. 
You will then see a yellow page entitled “Sampling Distributions” like the one in 
the following figure. 
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OB Sampling Distrlbotions Miley 2. There are choices for the population distribution: 

cir 1600 Normal, uniform, skewed, and custom. The default is 
Normal. Click the “Animated” button. What happens? 
Click the button several more times. What do the black 
boxes represent? What is the blue square that drops 
down onto the plot below? What does the red horizon- 
tal band under the population histogram tell us? 

Look at the left panel. Important numbers are 
displayed there. Did you notice that the colors of the 
numbers match up with the objects to the right? As you 
make things happen, the numbers change accordingly, 
like an automatic scorekeeper. 

3. Click on “Clear lower 3” to start clean. Then click 
on the “1,000” button under “Sample:” repeatedly until 
you have simulated taking 10,000 SRSs of size n = 5 
from the population (look for “Reps = 10000” on the 
left panel in black letters). Answer these questions: 


¢ Does the approximate sampling distribution (blue bars) have a 
recognizable shape? Click the box next to “Fit normal.” 


¢ Compare the mean of the approximate sampling distribution with the 
mean of the population. 
¢ How is the standard deviation of the approximate sampling distribution 
related to the standard deviation of the population? 
4. Click “Clear lower 3.” Use the drop-down menus to set up the bottom graph 
to display the mean for samples of size n = 20. Then sample 10,000 times. How 
do the two distributions of x compare: shape, center, and spread? 
5. What have you learned about the shape of the sampling distribution of x 
when the population has a Normal shape? 


As the previous Activity demonstrates, if the population distribution is Normal, 
then so is the sampling distribution of x. This is true no matter what the sample 
size 1s. 


SAMPLING DISTRIBUTION OF A SAMPLE 
MEAN FROM A NORMAL POPULATION 


We already knew the mean and standard deviation of the sampling distribu- 
tion. All we have added is the Normal shape. Now we have enough informa- 
tion to calculate probabilities involving * when the population distribution is 
Normal. 
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Young Women’s Heights 


Finding probabilities involving the sample mean 
PROBLEM: The height of young women follows a Normal distribution with mean 1 = 64.5 inches 
and standard deviation o = 2.5 inches. 

(a) Find the probability that a randomly selected young woman is taller than 66.5 inches. Show 
your work. 


(b) Find the probability that the mean height of an SRS of 10 young women exceeds 66.5 inches. 
Show your work. 


SOLUTION: 


(a) Step 1: State the distribution and the values of interest. Let X be the height ofa 

randomly selected young woman. The random variable X follows a Normal distribution with 1 = 64.5 

inches and o = 2.5 inches. We want to find P(X > 66.5). Figure 7.18 shows the distribution (purple 

curve) with the area of interest shaded and the mean, standard deviation, and boundary value labeled. 

Step 2: Perform calculations—show your work! The standardized score for the boundary 

66.5 — 64.5 
TRS) 


valueis z = 
0.2119. 


= 0.80. Using Table A, P(X > 66.5) = P(Z> 0.80) = 1 — 0.7881 = 


Using technology: The command normalcdf (lower:66.5, upper:10000, :64.5, 
o:2.5) givesanareaof0.2119. 


Sampling 
distribution 
of x 


Step 3: Answer the question. The probability of choosing 
a young woman at random whose height exceeds 66.5 inches is 
about 0.21. 


(b) Step 1: State the distribution and the values of 
interest. Foran SRS of 10 young women, the sampling distribu- 
tion of their sample mean height x will have mean wz = 44 = 64.5 


Population 

distribution 
inches. The 10% condition is met because there are at least 10(10) = 
100 young women in the population. So the standard deviation is 


= 64.5 in. 66.5 in. Ox 


| oe 72) 
Va V10 


0.79. Because the population distribution 


FIGURE 7.18 The sampling distribution of the mean height x 
for SRSs of 10 young women compared with the population 
distribution of young women’s heights. 


Figure 7.18 compares the population 
distribution and the sampling 
distribution of x. It also shows 

the areas corresponding to the 
probabilities that we computed. You 
can see that it is much less likely for 
the average height of 10 randomly 
selected young women to exceed 
66.5 inches than it is for the height of 
one randomly selected young woman 
to exceed 66.5 inches. 


is Normal, the values of x will follow an N(64.5, 0.79) distribution. We 
want to find P(x > 66.5) inches. Figure 7.18 shows the distribution 
(blue curve) with the area of interest shaded and the mean, standard 
deviation, and boundary value labeled. 


Step 2: Perform calculations—show your work! The standardized score for the boundary 

ae _ 66.5 — 64.5 
0.79 

Using Table A. P(x > 66.5) = P(Z> 2.53) = 1 — 0.9943 = 0.0057. 


Using technology: The command normalcdf (lower:66.5, upper:10000, w:64.5, 
o:0.79) givesan area of 0.0057. 


Zz = Gee. 


Step 3: Answer the question. It is very unlikely (less than a 1% chance) that we would choose 
an SRS of 10 young women whose average height exceeds 66.5 inches. 


For Practice Try Exercise 
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The fact that averages of several observations are less variable than indi- 
vidual observations is important in many settings. For example, it is common 
practice to repeat a measurement several times and report the average of the 
results. Think of the results of n repeated measurements as an SRS from the 
population of outcomes we would get if we repeated the measurement forever. 
The average of the n results (the sample mean X) is less variable than a single 
measurement. 


CHECK YOUR UNDERSTANDING 

The length of human pregnancies from conception to birth varies according to a distribu- 
tion that is approximately Normal with mean 266 days and standard deviation 16 days. 

1. Find the probability that a randomly chosen pregnant woman has a pregnancy that 
lasts for more than 270 days. Show your work. 


Suppose we choose an SRS of 6 pregnant women. Let x = the mean pregnancy length 
for the sample. 


2. What is the mean of the sampling distribution of x? Explain. 


3. Compute the standard deviation of the sampling distribution of x. Check that the 
10% condition is met. 


4. Find the probability that the mean pregnancy length for the women in the sample 
exceeds 270 days. Show your work. 


The Central Limit Theorem 


Most population distributions are not Normal. The household incomes in 
Figure 7.17(a) on page 451, for example, are strongly skewed. Yet Figure 7.17(b) 
suggests that the distribution of means for samples of size 100 is approximately 
Normal. What is the shape of the sampling distribution of x when the popula- 
tion distribution isn’t Normal? The following Activity sheds some light on this 
question. 


ACTIVITY | Exploring the Sampling Distribution of 


X for a Non-Normal Population 


MATERIALS: Let’s use the sampling distributions applet from the previous Activity (page 453) to 
Computer with Internet investigate what happens when we start with a non-Normal population distribution. 
access —one for the class or 1. Go to the Web site and launch the applet. Select “Skewed” population. Set 
one per pair of students the bottom two graphs to display the mean—one for samples of size 2 and the 


other for samples of size 5. Click the Animated button a few times to be sure you 
see what’s happening. Then “Clear lower 3” and take 10,000 SRSs. Describe 
what you see. 


2. Change the sample sizes ton = 10 and n = 16 and repeat Step 1. What do 
you notice? 


Parent populston (can be changed with the mouse) 
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3. Now change the sample sizes ton = 20 andn = 25 
and take 10,000 more samples. Did this confirm what 
Clear lower 3 you saw in Step 2? 


Skewed 


4. Clear the page, and select “Custom” distribution. 
Click on a point on the population graph to insert a 


— bar of that height. Or click on a point on the horizon- 
5 tal axis, and drag up to define a bar. Make a distribu- 
1.000 tion that looks as strange as you can. (Note: You can 
10,000 . . . . 
shorten a bar or get rid of it completely by clicking 
Distribution of Means, N«2 . : 
on the top of the bar and dragging down to the axis.) 
——s Then repeat Steps | to 3 for your custom distribution. 
F Fitnormal Cool, huh? 


Mean ’ 
N=5 - 
 Fitnormal 


of Means, N=5 


5. Summarize what you learned about the shape of 
the sampling distribution of x. 


It is a remarkable fact that as the sample size increases, the sampling distri- 
bution of x changes shape: it looks less like that of the population and more 
like a Normal distribution. When the sample size is large enough, the sampling 
distribution of x is very close to Normal. This is true no matter what shape the 
population distribution has, as long as the population has a finite standard devia- 
tion a. This famous fact of probability theory is called the central limit theorem 
(sometimes abbreviated as CLT). 


DEFINITION: Central limit theorem (CLT) 


Draw an SRS of size n from any population with mean , and finite standard deviation o. 
The central limit theorem (CLT) says that when nis large, the sampling distribution 
of the sample mean x is approximately Normal. 


How large a sample size n is needed for the sampling distribution of x to be 
close to Normal depends on the population distribution. More observations are 
required if the shape of the population distribution is far from Normal. In that 
case, the sampling distribution of x will also be very non-Normal if the 
sample size is small. Be sure you understand what the CLT does—and @ 


doesn’t—say. 


A Strange Population Distribution 
The CLT in action 


We used the sampling distribution applet to create a population distribution with 
a very strange shape. See the graph at the top of the next page. 
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Figure 7.19 below shows the approximate 
sampling distribution of the sample mean 
x for SRSs of size (a) n = 2, (b) n = 5, 
(c) n = 10, and (d) n = 25. As n increases, 
the shape becomes more Normal. For SRSs 
of size 2, the sampling distribution is very 
non-Normal. The distribution of x for 10 


observations is slightly skewed to the right 
but already resembles a Normal curve. By 
n = 25, the sampling distribution is even more Normal. The contrast between 
the shapes of the population distribution and the distribution of the mean when 
n = 10 or 25 is striking. 


Distribution of Means, N=2 


204 304 Distribution of Means, N=5 
745 
596 
447 
298 
149 
0 2 
u + 


Distnbution of Means, N=10 Distribution of Means, N=25 
1560 1812 
1300 1510 
1040 1208 
780 906 
520 604 
260 302 
0 74 0 v4 


FIGURE 7.19 The central limit theorem in action: the distribution of sample means x from a 
strongly non-Normal population becomes more Normal as the sample size increases. (a) The 
distribution of x for samples of size 2. (b) The distribution of x for samples of size 5. (c) The distri- 
bution of x for samples of size 10. (d) The distribution of x for samples of size 25. 


As the previous example illustrates, even when the population distribution 
is very non-Normal, the sampling distribution of ¥ often looks approximately 
Normal with sample sizes as small as n = 25. To be safe, we'll require that n be 
at least 30 to invoke the CLT. With that issue settled, we can now state the Normal/ 
Large Sample condition for sample means. 


NORMAL/LARGE SAMPLE CONDITION FOR SAMPLE MEANS 


e Ifthe population distribution is Normal, then so is the sampling distribu- 
tion of x. This is true no matter what the sample size nis. 


e If the population distribution is not Normal, the central limit theorem 
tells us that the sampling distribution of x will be approximately Normal 
in most cases ifn = 30. 


The central limit theorem allows us to use Normal probability calculations to 
answer questions about sample means from many observations even when the 
population distribution is not Normal. 
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Servicing Air Conditioners 
Calculations using the CLT 


Your company has a contract to perform preventive maintenance on thousands 
of air-conditioning units in a large city. Based on service records from the past 
year, the time (in hours) that a technician requires to complete the work follows a 
strongly right-skewed distribution with = 1 hour and o = | hour. In the coming 
week, your company will service an SRS of 70 air-conditioning units in the city. 
You plan to budget an average of 1.1 hours per unit for a technician to complete 
the work. Will this be enough? 


PROBLEM: Whatis the probability that the average maintenance time x for 70 units exceeds 
1.1 hours? Show your work. 


SOLUTION: 


Step 1: State the distribution and the values of interest. The sampling distribution 
of the sample mean time x spent working on 70 units has 


O (Ae = jh = 1 hour 
(1,0.1 iati 
° standard deviation o; 


0.12 because the 


(el il 
V70 ~V70 


10% condition is met (there are more than 10(70) = 700 air- 
conditioning units in the population) 

* an approximately Normal shape because the Normal/Large Sample 
condition is met: n= 70 = 30 

The distribution of x is therefore approximately N(1, 0.12). We want to 
find P(x > 1.1). Figure 7.20 shows the Normal curve with the area of 


interest shaded and the mean, standard deviation, and boundary value 
Average maintenance time (hours) beled 


1 44 


FIGURE 7.20 The Normal approximation from the central Step 2: Perform calculations—show your work! The stan- 
limit theorem for the average time needed to maintain an air —dardized score for the boundary value is 
conditioner. 11-1 
z= = 0.83 
0.12 
Using TableA, Ax > 1.1) = P(Z> 0.83) = 1 — 0.7967 = 0.2033. 


Using technology: The command normalcdf (lower:1.1, upper:10000, w:1, 
a:0.12) gives anarea of 0.2023. 

Step 3: Answer the question. Ifyou budget 1.1 hours per unit, there is about a 20% chance 
that the technicians will not complete the work within the budgeted time. You will have to decide if 
this risk is worth taking or if you should schedule more time for the work. 


For Practice Try Exercise 


Figure 7.21 on the next page summarizes the facts about the sampling distri- 
bution of x. It reminds us of the big idea of a sampling distribution. Keep taking 
random samples of size n from a population with mean ju. Find the sample mean 
X for each sample. Collect all the x’s and display their distribution: the sampling 
distribution of x. Sampling distributions are the key to understanding statistical 
inference. Keep this figure in mind for future reference. 
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SRS size n 


SRS size n 


SRS size n 


Population 
Mean uU 
Std. dev. 


FIGURE 7.21 The sampling distribution of a sample mean x has mean ,. and standard deviation 
o/\V/n. \t has a Normal shape if the population distribution is Normal. If the population distribu- 
tion isn’t Normal, the sampling distribution of x is approximately Normal if the sample size is 
large enough. 


Building Better Batteries 


— MH ) Refer to the chapter-opening Case Study on page 421. Assum- 

ing the process is working properly, the population distribution of 
: battery lifetimes has mean yp = 17 hours and standard deviation 
a = 0.8. We don’t know the shape of the population distribution. 


4 ‘Wie 1. Make an appropriate graph to display the sample data. 
ip Describe what you see. 

2. Assume that the battery production process is working prop- 
© e N erly. Describe the shape, center, and spread of the sampling 
= distribution of x for random samples of 50 batteries. Justify 
your answers. 


For the random sample of 50 batteries, the average lifetime was x = 16.718 hours. 


3. Find the probability of obtaining a random sample of 50 batter- 
ies with a mean lifetime of 16.718 hours or less if the production 
process is working properly. Show your work. Based on your an- 
swer, do you believe that the process is working properly? Why 

or why not? 


The plant manager also wants to know what proportion p of all the batter- 
ies produced that day lasted less than 16.5 hours, which he has declared “un- 
suitable.” From past experience, about 27% of batteries made at the plant are 
unsuitable. If the manager does not find convincing evidence that the propor- 
tion of unsuitable batteries p produced that day is greater than 0.27, the whole 
batch of batteries will be shipped to customers. 
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4, Assume that the actual proportion of unsuitable batteries pro- 
duced that day is p = 0.27. Describe the shape, center, and spread 
of the sampling distribution of / for random samples of 50 batter- 
ies. Justify your answers. 


For the random sample of 50 batteries, the sample proportion with lifetimes 
less than 16.5 hours was p = 0.32. 


5. Find the probability of obtaining a random sample of 50 batteries 
in which 32% or more of the batteries are unsuitable if p = 0.27. 
Show your work. Based on your answer, should the entire batch 
of batteries be shipped to customers? Why or why not? 


49. 
452 


Summary 


e = When we want information about the population mean ju for some variable, 
we often take an SRS and use the sample mean X to estimate the unknown 
parameter pu. The sampling distribution of x describes how the statistic x 
varies in all possible samples of the same size from the population. 

e The mean of the sampling distribution is yz, so xX is an unbiased estimator 
of pu. 

e The standard deviation of the sampling distribution of ¥ is ¢/Vn for an SRS 
of size n if the population has standard deviation a. That is, averages are less 
variable than individual observations. This formula can be used if the popu- 
lation is at least 10 times as large as the sample (10% condition). 

e Choose an SRS of size n from a population with mean py and standard devia- 
tion a. If the population distribution is Normal, then so is the sampling dis- 
tribution of the sample mean x. If the population distribution is not Normal, 
the central limit theorem (CLT) states that when n is large, the sampling 
distribution of < is approximately Normal. 

e We can use a Normal distribution to calculate approximate probabilities for 
events involving ¥ whenever the Normal/Large Sample condition is met: 


e — Ifthe population distribution is Normal, so is the sampling distribution of x. 


e =6Ifn = 30, the CLT tells us that the sampling distribution of x will be 
approximately Normal in most cases. 


Exercises 


Songs on an iPod David’s iPod has about 10,000 Suppose we choose an SRS of 10 songs from this 
songs. The distribution of the play times for these population and calculate the mean play time x of 
songs is heavily skewed to the right with a mean of these songs. What are the mean and the standard 


225 seconds and a standard deviation of 60 seconds. deviation of the sampling distribution of x? Explain. 
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CHAPTER 7 


Making auto parts A grinding machine in an auto 
parts plant prepares axles with a target diameter 

je = 40.125 millimeters (mm). 'The machine has 
some variability, so the standard deviation of the 
diameters is ¢ = 0.002 mm. The machine operator 
inspects a random sample of 4 axles each hour for 
quality control purposes and records the sample 
mean diameter x. Assuming that the process is work- 
ing properly, what are the mean and standard devia- 
tion of the sampling distribution of x? Explain. 


Songs on an iPod Refer to Exercise 49. How many 
songs would you need to sample if you wanted the 
standard deviation of the sampling distribution of x 
to be 30 seconds? Justify your answer. 


Making auto parts Refer to Exercise 50. How many 
axles would you need to sample if you wanted the 
standard deviation of the sampling distribution of x 
to be 0.0005 mm? Justify your answer. 


Larger sample Suppose that the blood cholesterol 
level of all men aged 20 to 34 follows the Normal 
distribution with mean js = 188 milligrams per deci- 
liter (mg/dl) and standard deviation o = 41 mg/dl. 


Choose an SRS of 100 men from this population. 
Describe the sampling distribution of x. 


Find the probability that x estimates jz within 
+3 mg/dl. (This is the probability that x takes a 
value between 185 and 191 mg/dl.) Show your work. 


Choose an SRS of 1000 men from this population. 
Now what is the probability that x falls within 

+3 mg/dl of 4? Show your work. In what sense is the 
larger sample “better”? 


Dead battery? A car company has found that the 
lifetime of its batteries varies from car to car according 
to a Normal distribution with mean jz = 48 months 
and standard deviation o = 8.2 months. The company 
installs a new brand of battery on an SRS of 8 cars. 


If the new brand has the same lifetime distribution 
as the previous type of battery, describe the sampling 
distribution of the mean lifetime x. 


The average life of the batteries on these 8 cars turns 
out to be X = 42.2 months. Find the probability that 
the sample mean lifetime is 42.2 months or less if the 
lifetime distribution is unchanged. What conclusion 
would you draw? 


Bottling cola A bottling company uses a filling ma- 
chine to fill plastic bottles with cola. The bottles are 
supposed to contain 300 milliliters (ml). In fact, the 
contents vary according to a Normal distribution with 
mean jt = 298 ml and standard deviation o = 3 ml. 


What is the probability that a randomly selected 
bottle contains less than 295 ml? Show your work. 


(b) 


mike 
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What is the probability that the mean contents of six 
randomly selected bottles are less than 295 ml? Show 
your work. 


. Cereal A company’s cereal boxes advertise 


9.65 ounces of cereal. In fact, the amount of cereal 
in a randomly selected box follows a Normal distri- 
bution with mean yp = 9.70 ounces and standard 
deviation o = 0.03 ounces. 


What is the probability that a randomly selected 
box of the cereal contains less than 9.65 ounces of 
cereal? Show your work. 


Now take an SRS of 5 boxes. What is the probability 
that the mean amount of cereal x in these boxes is 
9.65 ounces or less? Show your work. 


What does the CLT say? Asked what the central 
limit theorem says, a student replies, “As you 

take larger and larger samples from a population, the 
histogram of the sample values looks more and more 
Normal.” Is the student right? Explain your answer. 


What does the CLT say? Asked what the central limit 
theorem says, a student replies, “As you take larger and 
larger samples from a population, the spread of the 
sampling distribution of the sample mean decreases.” 
Is the student right? Explain your answer. 


Songs on an iPod Refer to Exercise 49. 


Explain why you cannot safely calculate the probabil- 
ity that the mean play time X is more than 4 minutes 
(240 seconds) for an SRS of 10 songs. 


Suppose we take an SRS of 36 songs instead. Explain 
how the central limit theorem allows us to find the prob- 
ability that the mean play time is more than 240 seconds. 
Then calculate this probability. Show your work. 


Lightning strikes The number of lightning strikes 
on a square kilometer of open ground in a year has 
mean 6 and standard deviation 2.4. ‘The National 
Lightning Detection Network (NLDN) uses auto- 
matic sensors to watch for lightning in a random 
sample of 10 one-square-kilometer plots of land. 


What are the mean and standard deviation of the 
sampling distribution of x, the sample mean number 
of strikes per square kilometer? 


Explain why you cannot safely calculate the prob- 
ability that x <5 based on a sample of size 10. 


Suppose the NLDN takes a random sample of 

n = 50 square kilometers instead. Explain how the 
central limit theorem allows us to find the prob- 
ability that the mean number of lightning strikes per 
square kilometer is less than 5. Then calculate this 
probability. Show your work. 


61. 
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Airline passengers get heavier In response to the 
increasing weight of airline passengers, the Federal 
Aviation Administration (FAA) told airlines to as- 
sume that passengers average 190 pounds in the 
summer, including clothes and carry-on baggage. 
But passengers vary, and the FAA did not specify a 
standard deviation. A reasonable standard deviation 
is 35 pounds. Weights are not Normally distributed, 
especially when the population includes both men 
and women, but they are not very non-Normal. A 
commuter plane carries 30 passengers. 


Explain why you cannot calculate the probability 
that a randomly selected passenger weighs more than 
200 pounds. 


Find the probability that the total weight of 30 ran- 
domly selected passengers exceeds 6000 pounds. Show 
your work. (Hint: ‘To apply the central limit theorem, 
restate the problem in terms of the mean weight.) 


How many people in a car? A study of rush-hour 
traffic in San Francisco counts the number of people 
in each car entering a freeway at a suburban inter- 
change. Suppose that this count has mean 1.5 and 
standard deviation 0.75 in the population of all cars 
that enter at this interchange during rush hours. 


Could the exact distribution of the count be 
Normal? Why or why not? 


‘Traffic engineers estimate that the capacity of the 
interchange is 700 cars per hour. Find the probability 
that 700 randomly selected cars at this freeway en- 
trance will carry more than 1075 people. Show your 
work. (Hint: Restate this event in terms of the mean 
number of people x per car.) 


More on insurance An insurance company claims 
that in the entire population of homeowners, the 
mean annual loss from fire is 42 = $250 and the 
standard deviation of the loss is 7 = $1000. The 
distribution of losses is strongly right-skewed: many 
policies have $0 loss, but a few have large losses. An 
auditor examines a random sample of 10,000 of the 
company’s policies. If the company’s claim is correct, 
what's the probability that the average loss from fire in 
the sample is no greater than $275? Show your work. 


Bad carpet ‘The number of flaws per square yard in 
a type of carpet material varies with mean 1.6 flaws 
per square yard and standard deviation 1.2 flaws per 
square yard. The population distribution cannot be 
Normal, because a count takes only whole-number 
values. An inspector studies a random sample of 200 
square yards of the material, records the number of 
flaws found in each square yard, and calculates x, 
the mean number of flaws per square yard inspected. 
Find the probability that the mean number of flaws 
exceeds 1.8 per square yard. Show your work. 
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Multiple choice: Select the best answer for Exercises 65 
to 68. 


65. 


Scores on the mathematics part of the SAT exam in 
a recent year were roughly Normal with mean 515 
and standard deviation 114. You choose an SRS of 
100 students and average their SAT’ Math scores. 
Suppose that you do this many, many times. Which 
of the following are the mean and standard deviation 
of the sampling distribution of x? 


Mean = 515, SD = 114 

Mean = 515, SD = 114/V100 
Mean = 515/100, SD = 114/100 
Mean = 515/100, SD = 114/100 


Cannot be determined without knowing the 100 
scores. 


. Why is it important to check the 10% condition 


before calculating probabilities involving x? 


To reduce the variability of the sampling distribution 
of xX. 


To ensure that the distribution of x is approximately 
Normal. 


To ensure that we can generalize the results to a 
larger population. 


To ensure that x will be an unbiased estimator of 1. 


To ensure that the observations in the sample are 
close to independent. 


. Anewborn baby has extremely low birth weight 


(ELBW) if it weighs less than 1000 grams. A study 
of the health of such children in later years exam- 
ined a random sample of 219 children. Their mean 
weight at birth was x = 810 grams. This sample 
mean is an unbiased estimator of the mean weight 
ein the population of all ELBW babies, which 
means that 


in all possible samples of size 219 from this 
population, the mean of the values of x will 
equal 810. 


in all possible samples of size 219 from this 
population, the mean of the values of x will equal ju. 


as we take larger and larger samples from this popula- 
tion, x will get closer and closer to pu. 


in all possible samples of size 219 from this 
population, the values of x will have a distribution 
that is close to Normal. 


the person measuring the children’s weights does so 
without any error. 
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68. ‘The number of hours a lightbulb burns before failing 
varies from bulb to bulb. The population distribution 
of burnout times is strongly skewed to the right. The 
central limit theorem says that 


(a) as we look at more and more bulbs, their average 
burnout time gets ever closer to the mean yp for all 


bulbs of this type. 


(b) the average burnout time of a large number of bulbs 
has a sampling distribution with the same shape 
(strongly skewed) as the population distribution. 


(c) the average burnout time of a large number of bulbs 
has a sampling distribution with similar shape but 
not as extreme (skewed, but not as strongly) as the 
population distribution. 


(d) the average burnout time of a large number of bulbs 
has a sampling distribution that is close to Normal. 


(e) the average burnout time of a large number of bulbs 
has a sampling distribution that is exactly Normal. 


Exercises 69 to 72 refer to the following setting. In the lan- 
guage of government statistics, you are “in the labor force” 
if you are available for work and either working or actively 
seeking work. The unemployment rate is the proportion of 
the labor force (not of the entire population) who are unem- 
ployed. Here are data from the Current Population Survey 


= Ve Ws 


as 
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for the civilian population aged 25 years and over in a recent 
year. lhe table entries are counts in thousands of people. 


Highest education Total population Inlabor force Employed 


Didn’t finish high 27,669 12,470 11,408 
school 

High school but no 59,860 37,834 35,857 
college 

Less than bachelor’s 47,556 34,439 32,977 
degree 

College graduate 51,582 40,390 39,293 


Unemployment (1.1) Find the unemployment rate 
for people with each level of education. How does 
the unemployment rate change with education? 


Unemployment (5.1) What is the probability that a 
randomly chosen person 25 years of age or older is in 
the labor force? Show your work. 


Unemployment (5.3) If you know that a randomly 
chosen person 25 years of age or older is a college 
graduate, what is the probability that he or she is in 
the labor force? Show your work. 


Unemployment (5.3) Are the events “in the labor 
force” and “college graduate” independent? Justify 
your answer. 


Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


The principal of a large high school is concerned about 
the number of absences for students at his school. To in- 
vestigate, he prints a list showing the number of absences 
during the last month for each of the 2500 students at the 
school. For this population of students, the distribution of 
absences last month is skewed to the right with a mean of 
p = 1.1 anda standard deviation of o = 1.4. 

Suppose that a random sample of 50 students is selected 
from the list printed by the principal and the sample mean 
number of absences is calculated. 


(a) What is the shape of the sampling distribution of 
the sample mean? Explain. 


(b) What are the mean and standard deviation of the 
sampling distribution of the sample mean? 

(c) What is the probability that the mean number of 
absences in a random sample of 50 students is less 
than 1? 

(d) Because the population distribution is skewed, the 
principal is considering using the median number 
of absences last month instead of the mean number 
of absences to summarize the distribution. Describe 
how the principal could use a simulation to estimate 
the standard deviation of the sampling distribution 
of the sample median for samples of size 50. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score. 


Chapter Review 


Section 7.1: What Is a Sampling Distribution? 


In this section, you learned the “big ideas” of sampling 
distributions. ‘The first big idea is the difference between 
a statistic and a parameter. A parameter is a number that 
describes some characteristic of a population. A statistic 
estimates the value of a parameter using a sample from 
the population. Making the distinction between a statistic 
and a parameter will be crucial throughout the rest of the 
course. 

The second big idea is that statistics vary. For example, 
the mean weight in a sample of high school students is 
a variable that will change from sample to sample. This 
means that statistics have distributions, but parameters do 
not. The distribution of a statistic in all possible samples 
of the same size is called the sampling distribution of the 
statistic. 

The third big idea is the distinction between the distri- 
bution of the population, the distribution of the sample, 
and the sampling distribution of a sample statistic. Review- 
ing the illustration on page 428 will help you understand 
the difference between these three distributions. When you 
are writing your answers, be sure to indicate which distri- 
bution you are referring to. Don’t make ambiguous state- 
ments like “the distribution will become less variable.” 

The fourth big idea is how to describe a sampling dis- 
tribution. ‘To adequately describe a sampling distribution, 
you need to address shape, center, and spread. If the center 
(mean) of the sampling distribution is the same as the value 
of the parameter being estimated, then the statistic is called 
an unbiased estimator. An estimator is unbiased if it doesn’t 
consistently under- or overestimate the parameter in many 
samples. Ideally, the spread of a sampling distribution will 
be very small, meaning that the statistic provides precise 
estimates of the parameter. Larger sample sizes result in 
sampling distributions with smaller spreads. 


Section 7.2: Sample Proportions 

In this section, you learned about the shape, center, and 
spread of the sampling distribution of a sample propor- 
tion. When the Large Counts condition (np = 10 and 
n(1 —p) = 10) is met, the sampling distribution of f will 


be approximately Normal. The mean of the sampling 
distribution of p is y4j= p, the population proportion. 
As a result, the sample proportion f is an unbiased es- 
timator of the population proportion p. When the 10% 


l 
condition (x < aN) is met, the standard deviation 


of the sampling distribution of the sample proportion is 


ee a eat 
| ear This formula tells us that the variability 
of the distribution of f is smaller when the sample size 
is larger. 


Section 7.3: Sample Means 


In this section, you learned about the shape, center, and 
spread of the sampling distribution of a sample mean. 
When the population is Normal, the sampling distribu- 
tion of x will also be Normal for any sample size. When 
the population is not Normal and the sample size is small, 
the sampling distribution of x will resemble the popula- 
tion shape. However, the central limit theorem says that 
the sampling distribution of x will become approximately 
Normal for larger sample sizes (typically when n = 30), 
no matter what the population shape. When you are using 
a Normal distribution to calculate probabilities involving 
the sampling distribution of x, make sure that the Normal/ 
Large Sample condition is met. 

The mean of the sampling distribution of x is uz = pu, 
the population mean. As a result, the sample mean <x is 
an unbiased estimator of the population mean py. When 


l 
the 10% condition (: < aN) is met, the standard de- 


viation of the sampling distribution of the sample mean is 


= —°_. This formula tells us that the variability of the 
Vn 


distribution of x is smaller when the sample size is larger. 

Finally, when you are using a Normal distribution to 
calculate probabilities involving the sampling distribution 
of f or x, make sure that you (1) state the distribution and 
values of interest, (2) perform calculations—show your 
work, and (3) answer the question. 
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What Did You Learn? 


Learning Objective Section Related Example 


on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Distinguish between a parameter and a statistic. al 425 R7.1 


Use the sampling distribution of a statistic to evaluate a claim about 

a parameter. Wall 427 R7.5, R7.7 
Distinguish among the distribution of a population, the distribution 

of a sample, and the sampling distribution of a statistic. Gall Discussion on 428 RY 
Determine whether or not a statistic is an unbiased estimator of a Discussion on 

population parameter. Fall 430-431; 435 R7.3 
Describe the relationship between sample size and the variability of 

a Statistic. Gall 432 RS 


Find the mean and standard deviation of the sampling distribu- 
tion of a sample proportion p. Check the 10% condition before 


calculating os. UP 445 R7.4 
Determine if the sampling distribution of 6 is approximately Normal. Ue 445 R7.4 
If appropriate, use a Normal distribution to calculate probabilities 

involving /. UL 445 R7.4, R7.5 


Find the mean and standard deviation of the sampling distri- 
bution of a sample mean x. Check the 10% condition before 


calculating o;. 3 452 R7.6 
Explain how the shape of the sampling distribution of x is affected 

by the shape of the population distribution and the sample size. 3 457 R7.6, R7.7 
If appropriate, use a Normal distribution to calculate probabilities 


involving x. R7.6, R7.7 


Chapter 7 Chapter Review Exercises 


These exercises are designed to help you review the impor- nation. Unknown to the producer, 3% of all eggs 
tant ideas and methods of the chapter. shipped had salmonella. Identify the population, 


the parameter, the sample, and the statistic. 
R7.1 Bad eggs Sale of eggs that are contaminated with 


salmonella can cause food poisoning in consum- Exercises R7.2 and R7.3 refer to the following setting. 
ers. A large egg producer takes an SRS of 200 eggs Researchers in Norway analyzed data on the birth 
from all the eggs shipped in one day. The laboratory weights of 400,000 newborns over a 6-year period. The 
reports that 9 of these eggs had salmonella contami- distribution of birth weights is approximately Normal 
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with a mean of 3668 grams and a standard deviation of 
511 grams.* In this population, the range (maximum — 
minimum) of birth weights is 3+17 grams. We used Fathom 
software to take 500 SRSs of size n = 5 and calculate 

the range (maximum — minimum) for each sample. The 
dotplot below shows the results. 


200 
9% 90500°%000? 


8250 
i 
$8 
3h 
38 
83 
8 
aft 
80s 
8s 
#33 
Boe 
809 
CT 
83° 
o> 
63 


OREO 2 
2 ODS D0 OD 00 OO D_0000 00: 


0 500 1000 1500 2000 2500 3000 
Sample range 


R7.2 Birth weights 


(a) Sketch a graph that displays the distribution of birth 
weights for this population. 

(b) Sketch a possible graph of the distribution of birth 
weights for an SRS of size 5. 

(c) In the graph above, there is a dot at approximately 
2750. Explain what this value represents. 


R7.3 Birth weights 


(a) Is the sample range an unbiased estimator of the 
population range? Give evidence from the graph 
above to support your answer. 

(b) Explain how we could decrease the variability of the 
sampling distribution of the sample range. 


R7.4. Do you jog? The Gallup Poll once asked a random 
sample of 1540 adults, “Do you happen to jog?” 
Suppose that the true proportion of all adults who 
jog isp =0.15. 

(a) What is the mean of the sampling distribution of f? 
Justify your answer. 

(b) Find the standard deviation of the sampling dis- 
tribution of 6. Check that the 10% condition is 
met. 

(c) Is the sampling distribution of / approximately 
Normal? Justify your answer. 

(d) Find the probability that between 13% and 17% of a 
random sample of 1540 adults are joggers. 

R7.5 Bag check Thousands of travelers pass through 
the airport in Guadalajara, Mexico, each day. 
Before leaving the airport, each passenger must pass 
through the Customs inspection area. Customs 
agents want to be sure that passengers do not bring 
illegal items into the country. But they do not have 
time to search every traveler’s luggage. Instead, they 
require each person to press a button. Either a red 


(a) 


R7.6 


(a) 


(b) 


(c) 


(d) 


R7.7 
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or a green bulb lights up. If the red light shows, 
the passenger will be searched by Customs agents. 
A green light means “go ahead.” Customs agents 
claim that the proportion of all travelers who will 
be stopped (red light) is 0.30, because the light 
has probability 0.30 of showing red on any push of 
the button. To test this claim, a concerned citizen 
watches a random sample of 100 travelers push the 
button. Only 20 get a red light. 


Assume that the Customs agents’ claim is true. 

Find the probability that the proportion of travelers 
who get a red light is as small as or smaller than the 
result in this sample. Show your work. 

Based on your results in (a), do you believe the 
Customs agents’ claim? Explain. 

IQ tests The Wechsler Adult Intelligence Scale 
(WAIS) is acommon “IQ test” for adults. The 
distribution of WAIS scores for persons over 16 years 
of age is approximately Normal with mean 100 and 
standard deviation 15. 

What is the probability that a randomly chosen 
individual has a WAIS score of 105 or higher? Show 
your work. 

Find the mean and standard deviation of the sam- 
pling distribution of the average WAIS score x for 
an SRS of 60 people. 

What is the probability that the average WAIS score 
of an SRS of 60 people is 105 or higher? Show your 
work. 

Would your answers to any of parts (a), (b), or (c) be 
affected if the distribution of WAIS scores in the adult 
population were distinctly non-Normal? Explain. 
Detecting gypsy moths ‘The gypsy moth is a serious 
threat to oak and aspen trees. A state agriculture 
department places traps throughout the state to 
detect the moths. Each month, an SRS of 50 traps 
is inspected, the number of moths in each trap is 
recorded, and the mean number of moths is cal- 
culated. Based on years of data, the distribution of 
moth counts is discrete and strongly skewed, with a 
mean of 0.5 and a standard deviation of 0.7. 
Explain why it is reasonable to use a Normal distri- 
bution to approximate the sampling distribution of 
x for SRSs of size 50. 

Estimate the probability that the mean number of 
moths in a sample of size 50 is greater than or equal 
to 0.6. 

In a recent month, the mean number of moths 

in an SRS of size 50 was 0.6. Based on this result, 
should the state agricultural department be worried 
that the moth population is getting larger in their 
state? Explain. 
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Chapter 7 AP® Statistics Practice Test 


Section |: Multiple Choice Select the best answer for each question. 


T7.1 A study of voting chose 663 registered voters at ran- 17.5 ‘The number of undergraduates at Johns Hopkins 


dom shortly after an election. Of these, 72% said they 
had voted in the election. Election records show that 
only 56% of registered voters voted in the election. 
Which of the following statements is true about the 
boldface numbers? 


(a) 
(b) 72% and 56% are both statistics. 

(c) 72% is a statistic and 56% is a parameter. 
(d) 72% is a parameter and 56% is a statistic. 
(e) 


17.2 The Gallup Poll has decided to increase the 
size of its random sample of voters from about 
1500 people to about 4000 people right before 
an election. The poll is designed to estimate the 
proportion of voters who favor a new law banning 
smoking in public buildings. The effect of this 
increase is to 

(a) reduce the bias of the estimate. 

(b) increase the bias of the estimate. 

(c) reduce the variability of the estimate. 

(d 

( 


e 


) increase the variability of the estimate. 

) reduce the bias and variability of the estimate. 

T7.3 Suppose we select an SRS of size n = 100 from a 
large population having proportion p of successes. 
Let fp be the proportion of successes in the sample. 
For which value of p would it be safe to use the 
Normal approximation to the sampling distribu- 
tion of p? 

(a) 0.01 (b) 0.09 (c) 0.85 (d) 0.975 (e) 0.999 

T7.4 The central limit theorem is important in statistics 
because it allows us to use the Normal distribution to 
find probabilities involving the sample mean 

(a) if the sample size is reasonably large (for any 
population). 

(b) if the population is Normally distributed and the 
sample size is reasonably large. 

(c) if the population is Normally distributed (for any 

sample size). 


(d) if the population is Normally distributed and the 
population standard deviation is known (for any 
sample size). 


(e) if the population size is reasonably large (whether the 
population distribution is known or not). 


University is approximately 2000, while the number 
at Ohio State University is approximately 60,000. At 
both schools, a simple random sample of about 3% 
of the undergraduates is taken. Each sample is used 
to estimate the proportion f of all students at that 
university who own an iPod. Suppose that, in fact, 

p = 0.80 at both schools. Which of the following is 


the best conclusion? 

(a) The estimate from Johns Hopkins has less sampling 
variability than that from Ohio State. 

(b) The estimate from Johns Hopkins has more sampling 
variability than that from Ohio State. 

(c) ‘The two estimates have about the same amount of 
sampling variability. 

(d) It is impossible to make any statement about the 


sampling variability of the two estimates because the 
students surveyed were different. 


(e) None of the above. 


17.6 A researcher initially plans to take an SRS of 


size n from a population that has mean 80 and 
standard deviation 20. If he were to double his 
sample size (to 27), the standard deviation of the 
sampling distribution of the sample mean would 
be multiplied by 


(a) V2. (b) 1/VZ. (©) 2. (d) 12. (e) 1/Vin. 


T7.7 The student newspaper at a large university asks an 


SRS of 250 undergraduates, “Do you favor eliminat- 
ing the carnival from the term-end celebration?” 

All in all, 150 of the 250 are in favor. Suppose that 
(unknown to you) 55% of all undergraduates favor 
eliminating the carnival. If you took a very large 
number of SRSs of size n = 250 from this popula- 
tion, the sampling distribution of the sample propor- 
tion p would be 


(a) exactly Normal with mean 0.55 and standard devia- 


tion 0.03. 


(b) approximately Normal with mean 0.55 and standard 
deviation 0.03. 
(c) exactly Normal with mean 0.60 and standard devia- 


tion 0.03. 


(d) approximately Normal with mean 0.60 and standard 
deviation 0.03. 


(e) heavily skewed with mean 0.55 and standard devia- 
tion 0.03. 


17.8 Which of the following statements about the 
sampling distribution of the sample mean is 
incorrect? 


(a) The standard deviation of the sampling distribution 
will decrease as the sample size increases. 


(b) The standard deviation of the sampling distribution 
is a measure of the variability of the sample mean 
among repeated samples. 


(c) ‘The sample mean is an unbiased estimator of the 
population mean. 


(d) The sampling distribution shows how the sample 
mean will vary in repeated samples. 


(e) The sampling distribution shows how the sample 
was distributed around the sample mean. 


T7.9 A machine is designed to fill 16-ounce bottles of 
shampoo. When the machine is working prop- 
erly, the amount poured into the bottles follows 
a Normal distribution with mean 16.05 ounces 
and standard deviation 0.1 ounce. Assume that 
the machine is working properly. If four bottles 
are randomly selected and the number of ounces 
in each bottle is measured, then there is about 
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a 95% chance that the sample mean will fall in 
which of the following intervals? 


(a) 16.05 to 16.15 ounces (d) 15.90 to 16.20 ounces 
(b) 16.00 to 16.10 ounces (e) 15.85 to 16.25 ounces 
(c) 15.95 to 16.15 ounces 


17.10 Suppose that you are a student aide in the 
library and agree to be paid according to the 
“random pay” system. Each week, the librar- 
ian flips a coin. If the coin comes up heads, 
your pay for the week is $80. If it comes up 
tails, your pay for the week is $40. You work for 
the library for 100 weeks. Suppose we choose 
an SRS of 2 weeks and calculate your average 
earnings x. The shape of the sampling distribu- 
tion of x will be 


a) Normal. 


(a) 

(b) approximately Normal. 

(c) right-skewed. 

(d) 
) 


d) left-skewed. 


(e) symmetric but not Normal. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T7.11 Below are histograms of the values taken by three 
sample statistics in several hundred samples from 
the same population. The true value of the popula- 
tion parameter is marked with an arrow on each 
histogram. 


Which statistic would provide the best estimate of 
the parameter? Justify your answer. 


17.12 The amount that households pay service providers 
for access to the Internet varies quite a bit, but the 


mean monthly fee is $38 and the standard devia- 
tion is $10. The distribution is not Normal: many 
households pay a base rate for low-speed access, 
but some pay much more for faster connections. A 
sample survey asks an SRS of 500 households with 
Internet access how much they pay. Let x be the 
mean amount paid. 


— 
~ 
ma 


Explain why you can’t determine the probability 
that the amount a randomly selected household 
pays for access to the Internet exceeds $39. 


(b) What are the mean and standard deviation of the 
sampling distribution of x? 


(c) What is the shape of the sampling distribution of x? 
Justify your answer. 


(d) Find the probability that the average fee paid by 
the sample of households exceeds $39. Show your 
work. 


17.13 According to government data, 22% of American 
children under the age of six live in households 
with incomes less than the official poverty level. 

A study of learning in early childhood chooses 

an SRS of 300 children. Find the probability that 
more than 20% of the sample are from poverty-level 
households. Be sure to check that you can use the 
Normal approximation. 
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SAMPLING DISTRIBUTIONS 


Cumulative AP® Practice Test 2 


Section I: Multiple Choice Choose the best answer for each question. 


AP2.1 The five-number summary for a data set is given 
by min = 5, Q; = 18, median = 20, Q; = 40, 
max = 75. If you wanted to construct a boxplot for 
the data set (that is, one that would show outliers, if 
any existed), what would be the maximum possible 
length of the right-side “whisker”? 
(a) 33 (b) 35 (c) 45 (d) 53 (e) 55 


AP2.2 The probability distribution for the number of 
heads in four tosses of a coin is given by 


Number of heads: 0 1 2 3 4 
Probability: 0.0625 0.2500 0.3750 0.2500 0.0625 


The probability of getting at least one tail in four 
tosses of a coin is 


(a) 0.2500. (c) 0.6875. 
(b) 0.3125. (d) 0.9375. 


AP2.3 Ina certain large population of adults, the distribu- 
tion of IQ scores is strongly left-skewed with a mean 
of 122 and a standard deviation of 5. Suppose 200 
adults are randomly selected from this population 
for a market research study. The distribution of the 
sample mean of IQ scores is 


(e) 0.0625. 


left-skewed with mean 122 and standard deviation 
0.35. 

(b) exactly Normal with mean 122 and standard 
deviation 5. 


— 
fp 
<7 


(c) exactly Normal with mean 122 and standard 
deviation 0.35. 

(d) approximately Normal with mean 122 and standard 
deviation 5. 

(e) approximately Normal with mean 122 and standard 
deviation 0.35. 


AP2.4 A 10-question multiple-choice exam offers 5 
choices for each question. Jason just guesses the 
answers, so he has probability 1/5 of getting any one 
answer correct. You want to perform a simulation to 
determine the number of correct answers that Jason 
gets. One correct way to use a table of random 
digits to do this is the following: 

(a) One digit from the random digit table simulates one 
answer, with 5 = right and all other digits = wrong. 
‘Ten digits from the table simulate 10 answers. 


S 


One digit from the random digit table simulates one an- 
swer, with 0 or 1 = right and all other digits = wrong. 
‘Ten digits from the table simulate 10 answers. 


(c 


WH 


One digit from the random digit table simulates 
one answer, with odd = right and even = wrong. 
Ten digits from the table simulate 10 answers. 


(d) One digit from the random digit table simulates 
one answer, with 0 or | = right and all other 
digits = wrong, ignoring repeats. Ten digits from 
the table simulate 10 answers. 

(e) ‘Two digits from the random digit table simulate 
one answer, with 00 to 20 = right and 21 to 99 = 
wrong. Ten pairs of digits from the table simulate 10 
answers. 


AP2.5 Suppose we roll a fair die four times. The probabil- 
ity that a 6 occurs on exactly one of the rolls is 


IN 2f5Ne TNE /S\2 Noe 
wy) o)G) ©48)() 
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wa) o@@ 
AP2.6 You want to take an SRS of 50 of the 816 students 
who live in a dormitory on a college campus. You 


label the students 001 to 816 in alphabetical order. 
In the table of random digits, you read the entries 


95592 94007 69769 33547 72450 16632 81194 14873 


The first three students in your sample have labels 
(ay 953, 929 00! (a): 929° 400, 769. 
(b) 400, 769, 769. (e) 400, 769, 335. 
(c)) 359, 294, 007, 


AP2.7 The number of unbroken charcoal briquets in a 
20-pound bag filled at the factory follows a Nor- 
mal distribution with a mean of 450 briquets and 
a standard deviation of 20 briquets. ‘The company 
expects that a certain number of the bags will be 
underfilled, so the company will replace for free 
the 5% of bags that have too few briquets. What is 
the minimum number of unbroken briquets the 
bag would have to contain for the company to avoid 
having to replace the bag for free? 

(a) 404 (ball «=6((e) 418 (dd) 425) (a) 448 

AP2.8 You work for an advertising agency that is preparing a 
new television commercial to appeal to women. You 
have been asked to design an experiment to compare 
the effectiveness of three versions of the commercial. 
Each subject will be shown one of the three ver- 
sions and then asked about her attitude toward the 
product. You think there may be large differences 
between women who are employed and those who 
are not. Because of these differences, you should use 

(a) a block design, but not a matched pairs design. 
(b) a completely randomized design. 
(c) a matched pairs design. 


(d) 
(e) 
AP2.9 


(a) 


(b) 


(e) 


a simple random sample. 


a stratified random sample. 


Suppose that you have torn a tendon and are fac- 
ing surgery to repair it. The orthopedic surgeon 
explains the risks to you. Infection occurs in 3% of 
such operations, the repair fails in 14%, and both 
infection and failure occur together 1% of the 
time. What is the probability that the operation is 
successful for someone who has an operation that 
is free from infection? 


0.8342 (c) 0.8600 
0.8400 (d) 0.8660 


Social scientists are interested in the association 
between high school graduation rate (HSGR, 
measured as a percent) and the percent of U.S. 
families living in poverty (POV). Data were 
collected from all 50 states and the District of Co- 
lumbia, and a regression analysis was conducted. 


(e) 0.9900 


The resulting least-squares regression line is given 
— 

by POV = 59.2 — 0.620(HSGR) with r? = 0.802. 

Based on the information, which of the following 

is the best interpretation for the slope of the least- 

squares regression line? 


For each 1% increase in the graduation rate, the 
percent of families living in poverty is predicted to 
decrease by approximately 0.896. 

For each 1% increase in the graduation rate, the 
percent of families living in poverty is predicted to 
decrease by approximately 0.802. 

For each 1% increase in the graduation rate, the 
percent of families living in poverty is predicted to 
decrease by approximately 0.620. 

For each 1% increase in the percent of families 
living in poverty, the graduation rate is predicted 
to increase by approximately 0.802. 

For each 1% increase in the percent of families 
living in poverty, the graduation rate is predicted 
to decrease by approximately 0.620. 


Here is a dotplot of the adult literacy rates in 177 countries in 
a recent year, according to the United Nations. For example, 
the lowest literacy rate was 23.6%, in the African country of 
Burkina Faso. Mali had the next lowest literacy rate at 24.0%. 
Use the graph to answer Questions AP2.11 to AP2.13. 
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The overall shape of this distribution is 

clearly skewed to the right. 

clearly skewed to the left. 

roughly symmetric. 

uniform. 

There is no clear shape. 

The mean of this distribution (don’t try to find it) 
will be 

very close to the median. 

greater than the median. 

less than the median. 

You can’t say, because distribution isn’t symmetric. 
You can’t say, because the distribution isn’t Normal. 
Based on the shape of this distribution, what 
measures of center and spread would be most ap- 
propriate to report? 

The mean and standard deviation 

The mean and the interquartile range 

The median and the standard deviation 

The median and the interquartile range 

The mean and the range 

The correlation between the age and height of 
children under the age of 12 is found to be 

r = 0.60. Suppose we use the age x of a child to 
predict the height y of the child. What can we 
conclude? 

The height is generally 60% of a child’s weight. 
About 60% of the time, age will accurately predict 
height. 

‘Thirty-six percent of the variation in height is account- 
ed for by the linear model relating height to age. 

For every | year older a child is, the regression line 
predicts an increase of 0.6 feet in height. 

Thirty-six percent of the time, the least-squares re- 
gression line accurately predicts height from age. 


An agronomist wants to test three different types 
of fertilizer (A, B, and C) on the yield of a new 
variety of wheat. The yield will be measured in 
bushels per acre. Six l-acre plots of land were 
randomly assigned to each of the three fertilizers. 
The treatment, experimental unit, and response 
variable are, respectively, 

a specitic fertilizer, bushels per acre, a plot of land. 
a plot of land, bushels per acre, a specific fertilizer. 
random assignment, a plot of land, wheat yield. 

a specific fertilizer, a plot of land, wheat yield. 

a specific fertilizer, the agronomist, wheat yield. 
According to the U.S. Census, the proportion of 


adults in a certain county who owned their own 
home was 0.71. An SRS of 100 adults in a certain 
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section of the county found that 65 owned their 
home. Which one of the following represents the 
approximate probability of obtaining a sample 

of 100 adults in which fewer than 65 own their 
home, assuming that this section of the county 
has the same overall proportion of adults who own 
their home as does the entire county? 


100 06s 071 
7029)" (dye 
a (oe) y(0.29)— (d) (0.61035) 
100 
100 0165 = 0.71 
(b) ( gs) 029)(0.71)" (ep Piz = (0.70.29) 
V100 
ce) pf z <0 =O 
(0.71)(0.29) 
100 


AP2.17 Which one of the following would be a correct inter- 
pretation if you have a z-score of +2.0 on an exam? 


It means that you missed two questions on the exam. 


It means that you got twice as many questions cor- 
rect as the average student. 


It means that your grade was 2 points higher than 
the mean grade on this exam. 

It means that your grade was in the upper 2% of 
all grades on this exam. 


It means that your grade is 2 standard deviations 
above the mean for this exam. 


Records from a random sample of dairy farms 
yielded the information below on the number of 
male and female calves born at various times of 


the day. 
Day Evening Night Total 
Males 129 15 anv 261 
Females 118 18 116 252 
Total 247 33 233 513 


What is the probability that a randomly selected 


calf was born in the night or was a female? 


(a) 369 (b) 485 116 116 116 
Pee ee SOE ea 
AP2.19 When people order books from a popular online 


source, they are shipped in standard-sized boxes. 
Suppose that the mean weight of the boxes is 1.5 
pounds with a standard deviation of 0.3 pounds, 
the mean weight of the packing material is 0.5 
pounds with a standard deviation of 0.1 pounds, 
and the mean weight of the books shipped is 12 
pounds with a standard deviation of 3 pounds. 


SAMPLING DISTRIBUTIONS 


Assuming that the weights are independent, what 
is the standard deviation of the total weight of the 
boxes that are shipped from this source? 


(a) 1.84 (c) 3.02 (e) 9.10 
(b) 2.60 (d) 3.40 


AP2.20 A grocery chain runs a prize game by giving each 
customer a ticket that may win a prize when the 
box is scratched off. Printed on the ticket is a 
dollar value ($500, $100, $25) or the statement 
“This ticket is not a winner.” Monetary prizes can 
be redeemed for groceries at the store. Here is the 
probability distribution of the amount won on a 
randomly selected ticket: 


Amount won: 
Probability: 


$500 
0.01 


$100 
0.05 


$25 
0.20 


$0 
0.74 


Which of the following are the mean and standard 


deviation, respectively, of the winnings? 


(a) $15.00, $2900.00 
(b) $15.00, $53.85 
(c) $15.00, $26.93 
(d) $156.25, $53.85 
(e) $156.25, $26.93 
AP2.21 A large company is interested in improving the 
efficiency of its customer service and decides to 
examine the length of the business phone calls 
made to clients by its sales staff. A cumulative 
relative frequency graph is shown below from 
data collected over the past year. According to the 
graph, the shortest 80% of calls will take how long 
to complete? 
Length of Phone Calls 
100 
80 
« 0 
de 
20 
0 
Cte nA TEL J NTDLC © TNLLU EL AdTG 
Minutes 


(a) Less than 10 minutes 
(b) At least 10 minutes 

( 
(d) At least 5.5 minutes 


(e) Less than 5.5 minutes 


c) Exactly 10 minutes 
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Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


AP2.22 


AP2.23 


A health worker is interested in determining if 
omega-3 fish oil can help reduce cholesterol in 
adults. She obtains permission to examine the 
health records of 200 people in a large medical 
clinic and classifies them according to whether 

or not they take omega-3 fish oil. She also obtains 
their latest cholesterol readings and finds that the 
mean cholesterol reading for those who are taking 
omega-3 fish oil is 18 points lower than the mean 
for the group not taking omega-3 fish oil. 


Is this an observational study or an experiment? 
Justify your answer. 


Explain the concept of confounding in the context 
of this study and give one example of a variable 
that could be confounded with whether or not 
people take omega-3 fish oil. 


Researchers find that the 18-point difference in 
the mean cholesterol readings of the two groups 
is statistically significant. Can they conclude that 
omega-3 fish oil is the cause? Why or why not? 


There are four major blood types in humans: O, 
A, B, and AB. Ina study conducted using blood 
specimens from the Blood Bank of Hawaii, indi- 
viduals were classified according to blood type and 
ethnic group. The ethnic groups were Hawaiian, 
Hawaiian-White, Hawaiian-Chinese, and White. 
Suppose that a blood bank specimen is selected at 
random. 


Ethnic Group 


Hawaiian- 
Chinese 


Hawaiian- 


Hawaiians White White Total 


Total 


1903 
2490 
178 
99 
4670 


4469 
4671 
606 
236 
9982 


2206 
2368 
568 
243 
5385 


53,759 
50,008 
16,252 
5001 
125,020 


62,337 
59,537 
17,604 
5579 
145,057 


Find the probability that the specimen contains 
type O blood or comes from the Hawaiian-Chinese 
ethnic group. Show your work. 

What is the probability that the specimen contains 
type AB blood, given that it comes from the Hawai- 
ian ethnic group? Show your work. 

Are the events “type B blood” and “Hawaiian 
ethnic group” independent? Give appropriate 
statistical evidence to support your answer. 

Now suppose that two blood bank specimens are 
selected at random. Find the probability that at 


AP2.24 


least one of the specimens contains type A blood 
from the White ethnic group. 


Every 17 years, swarms of cicadas emerge from 
the ground in the eastern United States, live for 
about six weeks, and then die. (‘There are sev- 

eral different “broods,” so we experience cicada 
eruptions more often than every 17 years.) There 
are so many cicadas that their dead bodies can 
serve as fertilizer and increase plant growth. Ina 
study, a researcher added 10 dead cicadas under 
39 randomly selected plants in a natural plot of 
American bellflowers on the forest floor, leaving 
other plants undisturbed. One of the response 
variables measured was the size of seeds produced 
by the plants. Here are the boxplots and summary 
statistics of seed mass (in milligrams) for 39 cicada 
plants and 33 undisturbed (control) plants: 


Variable: 
Cicada plants: 


Control plants: 33 


Cicada Plants 4 


—_-}+— 


a — + r r 
0.15 0.20 0.25 0.30 0.35 


Seed Mass (milligrams) 
n Minimum Q, Median Q3; Maximum 
39 On 7 Ox.22° 0825: 10.218 0235 
0.14 ORTOT N02 5: HOR2'6 OR29 


(a) Write a few sentences comparing the distributions 


SNS 


of seed mass for the two groups of plants. 

Based on the graphical displays, which distribu- 
tion has the larger mean? Justify your answer. 
Explain the purpose of the random assignment in 
this study. 


Name one benefit and one drawback of only using 
American bellflowers in the study. 


In a city library, the mean number of pages in 

a novel is 525 with a standard deviation of 200. 
Approximately 30% of the novels have fewer than 
400 pages. Suppose that you randomly select 50 
novels from the library. 


What is the probability that the total number of 
pages is fewer than 25,000? Show your work. 


What is the probability that at least 20 of the nov- 
els have fewer than 400 pages? Show your work. 


Chapter 


Confidence Intervals: 
The Basics 


Estimating a Population 
Proportion 


Estimating a Population Mean 
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Estimating with 
Confidence 


Need Help? Give Us a Call! 


If your cable television goes out, you phone the cable company to get it fixed. Does a real person an- 
swer your call? These days, probably not. It is far more likely that you will get an automated response. 
You will probably be offered several options, such as: to order cable service, press 1; for questions 


about your bill, press 2; to add new channels, 
press 3; (and finally) to speak with a customer ser- 
vice agent, press 4. Customers will get frustrated 
if they have to wait too long before speaking to a 
live person. So companies try hard to minimize 
the time required to connect to a customer ser- 
vice representative. 

A large bank decided to study the call response 
times in its customer service department. The 
bank’s goal was to have a representative answer an 
incoming call in less than 30 seconds. Figure 8.1 is a 
histogram of the response times in a random sample 
of 241 calls to the bank’s customer service center in 
a given month. What does the graph suggest about 
how well the bank is meeting its goal? 


Frequency 


0.0 75 15.0 225 30.0 37:5 45.0 
Call response time (seconds) 
FIGURE 8.1 Histogram showing the response time (in seconds) 
at a bank’s customer service center for a random sample of 
241 calls. 
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ACTIVITY 


MATERIALS: 


TI-83/84 or TI-89 with 
display capability 


NORMAL FIX2 AUTO REAL RADIAN CL f 


mean(randNorm(M,20,16)) 


Introduction 


How long does a new model of laptop battery last? What proportion of college 
undergraduates have engaged in binge drinking? How much does the weight of a 
quarter-pound hamburger at a fast-food restaurant vary after cooking? These are 
the types of questions we would like to be able to answer. 

It wouldn’t be practical to determine the lifetime of every laptop battery, to ask all 
college undergraduates about their drinking habits, or to weigh every burger after 
cooking. Instead, we choose a sample of individuals (batteries, college students, 
burgers) to represent the population and collect data from those individuals. Our 
goal in each case is to use a sample statistic to estimate an unknown population 
parameter. From what we learned in Chapter 4, if we randomly select the sample, 
we should be able to generalize our results to the population of interest. 

We cannot be certain that our conclusions are correct—a different sample 
would probably yield a different estimate. Statistical inference uses the language 
of probability to express the strength of our conclusions. Probability allows us to 
take chance variation due to random selection or random assignment into ac- 
count. The following Activity gives you an idea of what lies ahead. 


The Mystery Mean 


In this Activity, each team of three to four students will try to estimate the mystery 
value of the population mean ju that your teacher entered before class.! 


1. Before class, your teacher will store a value of ju (represented by M) in the 
display calculator. The teacher will then clear the home screen so you can’t see 
the value of M. 


2. With the class watching, the teacher will execute the following command: 
mean (randNorm(M,20,16)). 


This tells the calculator to choose an SRS of 16 observations from a Normal popu- 
lation with mean M and standard deviation 20 and then compute the mean x of 
those 16 sample values. Is the sample mean shown likely to be equal to the mys- 
tery mean M? Why or why not? 


3. Now for the challenge! Your group must determine an interval of reason- 
able values for the population mean ju. Use the result from Step 2 and what you 
learned about sampling distributions in the previous chapter. 


4. Share your team’s results with the class. 


In this chapter and the next, we will meet the two most common types of for- 
mal statistical inference. Chapter 8 concerns confidence intervals for estimating 
the value of a parameter. Chapter 9 presents significance tests, which assess the 
evidence for a claim about a parameter. Both types of inference are based on the 
sampling distributions of statistics. That is, both report probabilities that state what 
would happen if we used the inference method many times. 
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Section 8.1 examines the idea of a confidence interval. We start by pre- 
senting the reasoning of confidence intervals in a general way that applies to 
estimating any unknown parameter. In Section 8.2, we show how to estimate 
a population proportion. Section 8.3 focuses on confidence intervals for a 
population mean. 


Confidence Intervals: 
The Basics 


WHAT YOU WILL LEARN By the end of the section, you should be able to: 


Determine the point estimate and margin of error from e Describe how the sample size and confidence level 
a confidence interval. affect the length of a confidence interval. 


Interpret a confidence interval in context. Explain how practical issues like nonresponse, under- 
Interpret a confidence level in context. coverage, and response bias can affect the interpreta- 
tion of a confidence interval. 


Mr. Schiel’s class did the mystery mean Activity from the Introduction. The TI 
screen shot displays the information that the students received about the unknown 


population mean jj. Here is a summary of what the class said about the calculator 


mean(randNorm(M, 20, eth output: 


e The population distribution is Normal and its standard deviation is 0 = 20. 

e Asimple random sample ofn = 16 observations was taken from this population. 
e The sample mean is * = 240.80. 

If we had to give a single number to estimate the value of M that Mr. Schiel 
chose, what would it be? Such a value is known as a point estimate. How about 
240.80? That makes sense, because the sample mean x is an unbiased estimator 


of the population mean j. We are using the statistic X as a point estimator of the 
parameter LU. 


DEFINITION: Point estimator and point estimate 


A point estimator is a statistic that provides an estimate of a population parameter. 
The value of that statistic from a sample is called a point estimate. 


As we saw in Chapter 7, the ideal point estimator will have no bias and 
low variability. Here’s an example involving some of the more common point 
estimators. 
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From Batteries to Smoking 


Point estimators 


PROBLEM: Ineach of the following settings, determine the point estimator you would use and 
calculate the value of the point estimate. 


(a) Quality control inspectors want to estimate the mean lifetime 1 of the AA batteries produced 
inan hour at a factory. They select a random sample of 50 batteries during each hour of production 

and then drain them under conditions that mimic normal use. Here are the lifetimes (in hours) of the 
batteries from one such sample: 

16.73 15.60 16.31 17.57 16.14 17.28 16.67 17.28 17.27 17.50 15.46 16.50 16.19 

15.59 17.54 16.46 15.63 16.82 17.16 16.62 16.71 16.69 17.98 16.36 17.80 16.61 

15.99 15.64 17.20 17.24 16.68 16.55 17.48 15.58 17.61 15.98 16.99 16.93 16.01 

17.54 17.41 16.91 16.60 16.78 15.75 17.31 16.50 16.72 17.55 16.46 


(b) What proportion pofU.S. high school students smoke? The 2011 Youth Risk Behavioral Survey 
questioned a random sample of 15,425 students in grades 9 to 12. Of these, 2792 said they had 
smoked cigarettes at least one day in the past month. 


PE VV ey 
ALKALINE 


(c) The quality control inspectors in part (a) want to investigate the variability in battery lifetimes 


by estimating the population variance 0”. 


SOLUTION: 


(a) Usethe sample mean x asa point estimator for the population mean ju. For these data, our 
point estimate is X = 16.716 hours. 


(b) Use the sample proportion p as a point estimator for the population proportion p. For this 


: Be IE? 
survey, our point estimate is p = even 0.181. 


(c) Use the sample variance sf as a point estimator for the population variance 07. For the battery 
life data, our point estimate is 5; = 0.441 hours”. 


For Practice Try Exercises and 


The Idea of a Confidence Interval 
Is the value of the population mean yu that Mr. Schiel entered in his calculator 


mean(randNorm(M,20,16)) exactly 240.80? Probably not. Because x = 240.80, we guess that uz is “somewhere 
eiiaeomo renee: ove 240289. around 240.80.” How close to 240.80 is pz likely to be? 
To answer this question, we ask another: How would the sample mean X vary if 
we took many SRSs of size 16 from this same population? 
The sampling distribution of x describes how the values of ¥ vary in repeated 
samples. Recall the facts about this sampling distribution from Chapter 7: 


Shape: Because the population distribution is Normal, so is the sampling distribu- 
tion of x. Thus, the Normal/Large Sample condition is met. 

Center: The mean of the sampling distribution of x is the same as the unknown 
mean of the entire population. That is, uw; = pu. 

Spread: The standard deviation of the sampling distribution of x for samples of 
sizen = l6is 
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because the 10% condition is met—we are sampling from an infinite population 
in this case. 
Figure 8.2 summarizes these facts. 


Sampling 


distribution 
of x 
2\6, ¥=240.80 
o=20 sRst ‘< Ll 
SRS N= xX = 246.05 (unknown) 
SRS n= 16, ¥ = 248.85 
1 
u ~— Values of x —~ 
Population 


FIGURE 8.2 The sampling distribution of the mean score xX for SRSs of 16 observations from a 
Normally distributed population with unknown mean y and standard deviation o = 20. 


The next example gives the reasoning of statistical estimation in a nutshell. 


The Mystery Mean 


Moving beyond a point estimate 
When Mr. Schiel’s class discussed the results of the mystery mean Activity, stu- 


PT TTT dents used the following logic to come up with an “interval estimate” for the un- 
240.88 known population mean ju. 

1. The sample mean x = 240.80 is our point estimate for j1. We don’t expect X 

to be exactly equal to jz, so we want to say how precise this estimate is. 


2. In repeated samples, the values of * follow a Normal distribution with mean ju 
and standard deviation 5, as in Figure 8.2. 


3. The 95 part of the 68—95—99.7 rule for Normal distributions says that ¥ is with- 
in 2(5) = 10 (that’s 2 standard deviations) of the popula- 
tion mean p in about 95% of all samples of size n = 16. 


Sampling See Figure 8.3. 
distribution 
of x 


4. Whenever x is within 10 points of ju, 44 is within 10 
points of x. This happens in about 95% of all possible sam- 
ples. So the interval from ¥ — 10 to x + 10 “captures” the 
population mean yz in about 95% of all samples of size 16. 


Probability ~ 0.95 5. If we estimate that yu lies somewhere in the interval 
from 


x — 10 = 240.80 — 10 = 230.80 


to 


oa Z +10 = 240.80 + 10 = 250.80 


FIGURE 8.3 In about 95% of all samples, xX lies within +10 | We'd be calculating this interval using a method that cap- 


of the unknown population mean jc. So jz also lies within +10 _ tures the true yz in about 95% of all possible samples of 
of x in those samples. this size. 
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A confidence interval is sometimes 
referred to as an interval estimate. This 
is consistent with our earlier use of the 
term point estimate. 


The big idea is that the sampling distribution of x tells us how close to yu the 
sample mean X is likely to be. Statistical estimation just turns that information 
around to say how close to x the unknown population mean 1 is likely to be. In 
the mystery mean example, the value of ju is usually within 2(5) = 10 of X for 
SRSs of size 16. Because the class’s sample mean was x = 240.80, the interval 
240.80 + 10 gives an approximate 95% confidence interval for ju. 

There are several ways to write a confidence interval. We can give the interval for 
Mr. Schiel’s mystery mean as 240.80 + 10, as 230.80 to 250.80, or as (230.80, 250.80). 


All the confidence intervals we will meet have a form similar to this: 
point estimate + margin of error 


The point estimate (x = 240.80 in our example) is our best guess for the value of 
the unknown parameter. The margin of error, 10, shows how close we believe our 
guess is, based on the variability of the estimate in repeated SRSs of size 16. We 
say that our confidence level is about 95% because the interval x + 10 catches 
the unknown parameter in about 95% of all possible samples. 


TT 


DEFINITION: Confidence interval, margin of error, confidence level 


A C% confidence interval gives an interval of plausible values for a parameter. The 
interval is calculated from the data and has the form 


point estimate + margin of error 


The difference between the point estimate and the true parameter value will be less 
than the margin of error in C% of all samples. 


The confidence level C gives the overall success rate of the method for calculating 
the confidence interval. That is, in C% of all possible samples, the method would 
yield an interval that captures the true parameter value. 


The interval from 230.80 to 250.80 gives the set of plausible values for 
Mr. Schiel’s mystery mean jz with 95% confidence. We wouldn’t be surprised if 
any of the values in this interval turned out to be the actual value of yu. 

Plausible does not mean the same thing as possible. You could argue 
that just about any value of a parameter is possible. A plausible value of a @ 
parameter is a reasonable or believable value based on the data. 

There is a trade-off between the confidence level and the amount of informa- 
tion provided by a confidence interval, as the cartoon below illustrates. We usually 
choose a confidence level of 90% or higher because we want to be quite sure of 
our conclusions. The most common confidence level is 95%. 


TAKING A LOOK AT THE HIGH TEMPERATURE 
TOMORKOW'S WEATHER... WILL BE BETWEEN 40 
BELOW ZEKO ANP 
200 ABOVE / 


THIS GUY'S 
NEVEK WRONG 


woo'pjee6-mmm 


Zl-@ S6ANVO Wal 
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Interpreting Confidence Intervals 
and Confidence Levels 


Our 95% confidence interval for Mr. Schiel’s mystery mean was (230.80, 250.80). 
How do we interpret this interval? We say, “We are 95% confident that the interval 
from 230.80 to 250.80 captures the mystery mean chosen by Mr. Schiel.” 


INTERPRETING CONFIDENCE INTERVALS 


‘To interpret a C% confidence interval for an unknown parameter, say, “We 
are C% confident that the interval from to captures the [param- 
eter in context].” 


Here’s an example that involves interpreting a confidence interval for a 
proportion. 


Who Will Win the Election? 


Interpreting a confidence interval 


‘Two weeks before a presidential election, a polling organization asked a random 
sample of registered voters the following question: “If the presidential election 
were held today, would you vote for candidate A or candidate B?” Based on this 
poll, the 95% confidence interval for the population proportion who favor candi- 


date A is (0.48, 0.54). 


PROBLEM: 
(a) Interpret the confidence interval. 
(b) What is the point estimate that was used to create the interval? What is the margin of error? 


(c) Based on this poll, a political reporter claims that the majority of registered voters favor candi- 
date A. Use the confidence interval to evaluate this claim. 


SOLUTION: 

(a) Weare 95% confident that the interval from 0.48 to 0.54 captures the true proportion of all 
registered voters who favor candidate A in the election. 

(b) Aconfidence interval has the form 


point estimate + margin of error 


The point estimate is at the midpoint of the interval. Here, the point estimate is p = 0.51. The mar- 
gin of error gives the distance from the point estimate to either end of the interval. So the margin of 
error for this interval is 0.03. 


(c) Any value from 0.48 to 0.54 is a plausible value for the population proportion pthat favors can- 
didate A. Because there are plausible values of pless than 0.50, the confidence interval does not give 
convincing evidence to support the reporter's claim that a majority (more than 50%) of registered 
voters favor candidate A. 


For Practice Try Exercise 9 | 
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ESTIMATING WITH CONFIDENCE 


The following Activity gives you a chance to explore the meaning of the confi- 
dence level. 


ACTIVITY | The Confidence Intervals Applet 


MATERIALS: 


Computer with Internet 
connection and display 


capability 


Confidence Level (C). 


Sample Size (n) 


SAMPLE 25 


Hit Total 


Percent hit 


pile, 


The Confidence Intervals applet at the book’s Web site will quickly generate many 
confidence intervals. In this Activity, you will use the applet to investigate the idea 
of a confidence level. 


1. Go to www.whfreeman.com/tps5e and launch the applet. Use the default set- 
tings: confidence level 95% and sample size n = 20. 
2. Click “Sample” to choose an SRS and display the resulting confidence 
interval. Did the interval capture the population mean ju (what the applet calls a 
“hit”)? Do this a total of 10 times. How many of the intervals captured the popu- 
lation mean ju? Note: So far, you have used the applet to take 10 SRSs, each of 
size n = 20. Be sure you understand the 
difference between sample size and the 
number of samples taken. 
3. Reset the applet. Click “Sample 25” 
twice to choose 50 SRSs and display 
the confidence intervals based on those 
samples. How many captured the param- 
eter 1? Keep clicking “Sample 25” and 
observe the value of “Percent hit.” What 
do you notice? 
4. Repeat Step 3 using a 90% confidence 
level. 
5. Repeat Step 3 using an 80% confi- 
dence level. 
6. Summarize what you have learned 
about the relationship between confidence 
level and “Percent hit” after taking many 
samples. 


We will investigate the effect of changing the sample size later. 


As the Activity confirms, the confidence level is the overall capture rate if the meth- 
od is used many times. Figure 8.4 illustrates the behavior of the confidence inter- 
val X + 10 for Mr. Schiel’s mystery mean. Starting with the population, imagine 
taking many SRSs of 16 observations. The first sample has x = 240.80, the second 
has x = 246.05, the third has x = 248.85, and so on. The sample mean varies 
from sample to sample, but when we use the formula x + 10 to get an interval 
based on each sample, about 95% of these intervals capture the unknown popula- 
tion mean [L. 
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210, ¥+10=240.80+ 10 


n= 
nel About 95% of these intervals 
spsn=lo, X¥ + 10 = 246.05 + 10 + capture the unknown 

SRS n=16 mean LU of the population. 


xX + 10 = 248.85 + 10 


Many SRSs_ Many confidence 
intervals 


Population 


FIGURE 8.4 To say that x + 10 is a 95% confidence interval for the population mean ju is to say 
that, in repeated samples, about 95% of these intervals capture ju. 


—____~——- Figure 8.5 illustrates the idea of a confidence interval in 
UA een eEe a different form. It shows the result of drawing many SRSs 
from the same population and calculating a 95% confidence 
interval from each sample. The center of each interval is at 


x and therefore varies from sample to sample. The sampling 
distribution of x appears at the top of the figure to show the 


~—— Values of ¥ —> long-term pattern of this variation. The 95% confidence inter- 
a vals from 25 SRSs appear below. 
a Here’s what you should notice: 


e The center X of each interval is marked by a dot. 


This interval e The distance from the dot to either endpoint of the inter- 
misses the true Hl. val is the margin of error. 


The oth ll 
et : e 24 of these 25 intervals (that’s 96%) contain the true value 


of js. If we took many samples, about 95% of the resulting 
confidence intervals would capture ju. 


Figures 8.4 and 8.5 give us the insight we need to interpret 


FIGURE 8.5 Twenty-five samples of the same size from 4 bemtilence level, 


the same population gave these 95% confidence intervals. 
In the long run, about 95% of samples give an interval that 
captures the population mean ju. 


INTERPRETING CONFIDENCE LEVELS 


‘To say that we are 95% confident is shorthand for “If we take many samples 
of the same size from this population, about 95% of them will result in an 
interval that captures the actual parameter value.” 


The confidence level tells us how likely it is that the method we are using 
will produce an interval that captures the population parameter if we use it many 
times. However, in practice we tend to calculate only a single confidence inter- 
val for a given situation. The confidence level does not tell us the chance 
that a particular confidence interval captures the population parameter. og 
Instead, the confidence interval gives us a set of plausible values for the 
parameter. 
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The Mystery Mean 


Interpreting a confidence level 


The confidence level in the mystery mean 


example—roughly 95% —tells us that if we take 
many SRSs of size 16 from Mr. Schiel’s mystery 
population, the interval x + 10 will capture 
the population mean ju for about 95% of those 
samples. 


Be sure you understand the basis for our confidence. There are only two possibilities: 


1. The interval from 230.80 to 250.80 contains the population mean ju. 


2. The interval from 230.80 to 250.80 does not contain the population mean ju. Our 
SRS was one of the few samples for which xX is not within 10 points of the true ju. 
Only about 5% of all samples result in a confidence interval that fails to capture 1. 


We cannot know whether our sample is one of the 95% for which the interval x + 10 
catches js or whether it is one of the unlucky 5%. The statement that we are “95% 
confident” that the unknown p lies between 230.80 and 250.80 is shorthand for say- 
ing, “We got these numbers by a method that gives correct results 95% of the time.” 


THINK What’s the probability that our 95% confidence interval cap- 
tures the parameter? It’s not 95%! Before we execute the command 
ABOUT IT mean (randNorm(M, 20,16) ), we have a 95% chance of getting a sample 
mean that’s within 20; of the mystery jz, which would lead to a confidence inter- 
val that captures fz. Once we have chosen a random sample, the sample mean X 
either is or isn’t within 20; of jz. And the resulting confidence interval either does 
or doesn’t contain yu. After we construct a confidence interval, the probability that 
it captures the population parameter is either | (it does) or 0 (it doesn’t). 


x».—S$S __ooe_q_uq->qe_x_~é_wue— FS 


We interpret confidence intervals and confidence levels in much the same way 
. whether we are estimating a population mean, proportion, or some other parameter. 


Do You Use Twitter? 


Interpreting a confidence interval and a 
confidence level 


The Pew Internet and American Life Project asked a random sample of 
2253 U.S. adults, “Do you ever . . . use Twitter or another service to share 
updates about yourself or to see updates about others?” Of the sample, 
19% said “Yes.” According to Pew, the resulting 95% confidence interval is 
(OMG, 0) 2113), 
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a 


PROBLEM: Interpret the confidence interval and the confidence level. 
SOLUTION: 


Confidence interval: We are 95% confident that the interval from 0.167 to 0.213 captures the 
true proportion p of all U.S. adults who use Twitter or another service for updates. 


Confidence level: \f many samples of 2253 U.S. adults were taken, the resulting confidence intervals 
would capture the true proportion of all U.S. adults who use Twitter or another service for updates 
for about 95% of those samples. 


For Practice Try Exercise 


Confidence intervals are statements about parameters. In the previous example, it 
would be wrong to say, “We are 95% confident that the interval from 0.167 to 0.213 
contains the sample proportion of U.S. adults who use Twitter.” Why? Because we 
know that the sample proportion, p = 0.19, is in the interval. Likewise, in the mys- 
tery mean example, it would be wrong to say that “95% of the values are between 


230.80 and 250.80,” whether we are referring to the sample or the popula- 1 


AP® EXAM TIP Onagiven 
problem, you may be asked to 
interpret the confidence interval, 
the confidence level, or both. 

Be sure you understand the 
difference: the confidence interval 
gives a set of plausible values for 
the parameter and the confidence 
level describes the long-run 
capture rate of the method. 


tion. All we can say is, “Based on Mr. Schiel’s sample, we believe that the 
population mean is somewhere between 230.80 and 250.80.” 

When interpreting a confidence interval, make it clear that you are describing 
a parameter and nota statistic. And be sure to include context. 


CHECK YOUR UNDERSTANDING 


How much does the fat content of Brand X hot dogs vary? ‘To find out, researchers mea- 
sured the fat content (in grams) of a random sample of 10 Brand X hot dogs. A 95% confi- 
dence interval for the population standard deviation @ is 2.84 to 7.55. 


1. Interpret the confidence interval. 
2. Interpret the confidence level. 


3. ‘True or false: The interval from 2.84 to 7.55 has a 95% chance of containing the 
actual population standard deviation o. Justify your answer. 


Constructing a Confidence Interval 


Why settle for 95% confidence when estimating an unknown parameter? Do 
larger random samples yield “better” intervals? The Confidence Intervals applet 
might shed some light on these questions. 


pheno, 


ACTIVITY | The Confidence Intervals Applet & 


MATERIALS: In this Activity, you will use the applet to explore the relationship between the 
Computer with Internet confidence level, the sample size, and the confidence interval. 

connection and display 1. Go to www.whfreeman.com/tps5e and launch the Confidence Intervals ap- 
capability plet. Use the default settings: confidence level 95% and sample size n = 20. 


Click “Sample 25.” 
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NORMAL FIX2 AUTO REAL RADIAN CL f 


mean(randNorm(M, 20,16)) 
240. 


2. Change the confidence level to 99%. What happens to the length of the 
confidence intervals? 


3. Now change the confidence level to 90%. What happens to the length of the 
confidence intervals? 

4. Finally, change the confidence level to 80%. What happens to the length of 
the confidence intervals? 

5. Summarize what you learned about the relationship between the confidence 
level and the length of the confidence intervals for a fixed sample size. 


6. Reset the applet and change the confidence level to 95%. What happens to 
the length of the confidence intervals as you increase the sample size? 


7. Does increasing the sample size increase the capture rate (percent hit)? Use 
the applet to investigate. 


As the Activity illustrates, the price we pay for greater confidence is a wider 
interval. If we’re satisfied with 80% confidence, then our interval of plausible 
values for the parameter will be much narrower than if we insist on 90%, 95%, 
or 99% confidence. But we'll also be much less confident in our estimate. 
Taking the idea of confidence to an extreme, what if we want to estimate with 
100% confidence the proportion p of all U.S. adults who use Twitter? That’s 
easy: we’re 100% confident that the interval from 0 to 1 captures the true 
population proportion! 

The activity also shows that we can get a more precise estimate of a parameter 
by increasing the sample size. Larger samples yield narrower confidence intervals. 
This result holds for any confidence level. 

Let’s look a bit more closely at the method we used earlier to calculate an ap- 
proximate 95% confidence interval for Mr. Schiel’s mystery mean. We started 
with 


point estimate + margin of error 


Our point estimate came from the sample statistic X = 240.80. What about the 
margin of error? Because the population distribution is Normal, so is the sam- 
pling distribution of x. About 95% of the values of x will lie within 2 standard 
deviations (20;) of the mystery mean pu. See the figure below. We could rewrite 
our interval as 


240.80 +2-5=x 42:0; 


Sampling 
distribution 
of € This leads to the more general formula for a confidence 
interval: 
Ll statistic + (critical value) - (standard deviation of statistic) 
(unknown) 


The critical value is a multiplier that makes the interval 
wide enough to have the stated capture rate. The critical value 
depends on both the confidence level C and the sampling 


+— Values of ¥ —— distribution of the statistic. 


Recall that oz = 7 and 
Vn 


1- 
op = mK 7 Pg as the sample 


size n increases, the standard deviation 
of the statistic decreases. 
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CALCULATING A CONFIDENCE INTERVAL 


The confidence interval for estimating a population parameter has the form 
statistic + (critical value) - (standard deviation of statistic) 


where the statistic we use is the point estimator for the parameter. 


The confidence interval for the mystery mean py of Mr. Schiel’s population 
illustrates several important properties that are shared by all confidence intervals 
in common use. The user chooses the confidence level, and the margin of error 
follows from this choice. We would like high confidence and also a small margin 
of error. High confidence says that our method almost always gives correct an- 
swers. A small margin of error says that we have pinned down the parameter quite 
precisely. 

Our general formula for a confidence interval is 


Margin of error 


(critical value) - (standard deviation of statistic) 


We can see that the margin of error depends on the critical value and the standard 
deviation of the statistic. The critical value is tied directly to the confidence level: 
greater confidence requires a larger critical value. The standard deviation of the 
statistic depends on the sample size n: larger samples give more precise estimates, 
which means less variability in the statistic. 

So the margin of error gets smaller when: 


Statistic + 


e The confidence level decreases. There is a trade-off between 
the confidence level and the margin of error. To obtain a 


80% confidence . 
smaller margin of error from the same data, you must be 


* 95% confidence willing to accept lower confidence. Earlier, we found that 
a 95% confidence interval for Mr. Schiel’s mystery mean ju 


Gap S45: Sadlbae Sak SADA OU Be SIs B50. Gah Se is 230.80 to 250.80. The 80% confidence interval for pu is 
FIGURE 8.6 80% and 95% confidence intervals for 234.39 to 247.21. Figure 8.6 compares these two intervals. 


Mr. Schiel’s mystery mean. Higher confidence requires a ¢ The sample size n increases. Increasing the sample size n 


longer interval. 


reduces the margin of error for any fixed confidence level. 


Using Confidence Intervals Wisely 


Our goal in this section has been to introduce you to the big ideas of confi- 
dence intervals without getting bogged down in details. You may have noticed 
that we only calculated intervals in a contrived setting: estimating an unknown 
population mean ys when we somehow knew the population standard devia- 
tion o. In practice, when we don’t know yu, we don’t know o either. We'll learn 
to construct confidence intervals for a population mean in this more realistic 
setting in Section 8.3. First, we will study confidence intervals for a population 
proportion fp in Section 8.2. Although it is possible to estimate other param- 
eters, confidence intervals for means and proportions are the most common 
tools in everyday use. 
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Here are two important cautions to keep in mind when constructing and inter- 
preting confidence intervals. 


© Our method of calculation assumes that the data come from an SRS of 4 
size n from the population of interest. Other types of random samples 
(stratified or cluster, say) might be preferable to an SRS in a given 
setting, but they require more complex calculations than the ones we'll use. 


¢ The margin of error in a confidence interval covers only chance variation due 
to random sampling or random assignment. The margin of error is 
obtained from the sampling distribution. It indicates how close our @ 
estimate is likely to be to the unknown parameter if we repeat the ran- 
dom sampling or random assignment process many times. Practical difficul- 
ties, such as undercoverage and nonresponse in a sample survey, can lead to 
additional errors that may be larger than this chance variation. Remember this 
unpleasant fact when reading the results of an opinion poll or other sample 
survey. The way in which a survey or experiment is conducted influences the 
trustworthiness of its results in ways that are not included in the announced 
margin of error. 


Sram Summary 


e To estimate an unknown population parameter, start with a statistic that pro- 
vides a reasonable guess. The chosen statistic is a point estimator for the 
parameter. The specific value of the point estimator that we use gives a point 
estimate for the parameter. 


e AC% confidence interval uses sample data to estimate an unknown popula- 
tion parameter with an indication of how precise the estimate is and of how 
confident we are that the result is correct. 


e <A confidence interval gives an interval of plausible values for the parameter. 
The interval is computed from the data and has the form 


point estimate + margin of error 
When calculating a confidence interval, it is common to use the form 
statistic + (critical value) - (standard deviation of statistic) 


e To interpret a C% confidence interval, say, “We are C% confident that the 
interval from to captures the [parameter in context].” Be sure that 
your interpretation describes a parameter and not a statistic. 


e The confidence level C is the success rate of the method that produces the 
interval. If you use 95% confidence intervals often, in the long run about 
95% of your intervals will contain the true parameter value. You don’t know 
whether a 95% confidence interval calculated from a particular set of data 
actually captures the true parameter value. 
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e Other things being equal, the margin of error of a confidence interval gets 


smaller as 


e the confidence level C decreases. 


e the sample size n increases. 


e Remember that the margin of error for a confidence interval includes 
only chance variation, not other sources of error like nonresponse and 


undercoverage. 


Ysainmesae EXercises 


In Exercises | to +, determine the point estimator you would 
use and calculate the value of the point estimate. 
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Got shoes? How many pairs of shoes, on average, 
do female teens have? To find out, an AP® Statistics 
class conducted a survey. They selected an SRS of 
20 female students from their school. Then they 
recorded the number of pairs of shoes that each 
student reported having. Here are the data: 


50 


2679265 Si 57 19 24 2257 23) 38 
50 = 13°—CO84—C 2380) 49 85 


Got shoes? ‘The class in Exercise 1 wants to estimate 
the variability in the number of pairs of shoes that 
female students have by estimating the population 
variance o°. 


Going to the prom Tonya wants to estimate what 
proportion of the seniors in her school plan to attend 
the prom. She interviews an SRS of 50 of the 750 
seniors in her school and finds that 36 plan to go to 
the prom. 


Reporting cheating What proportion of students 
are willing to report cheating by other students? A 
student project put this question to an SRS of 172 
undergraduates at a large university: “You witness 
two students cheating on a quiz. Do you go to the 
professor?” Only 19 answered “Yes.”° 


NAEP scores Young people have a better chance 
of full-time employment and good wages if they 

are good with numbers. How strong are the quan- 
titative skills of young Americans of working age? 
One source of data is the National Assessment of 
Educational Progress (NAEP) Young Adult Literacy 
Assessment Survey, which is based on a nationwide 
probability sample of households. ‘The NAEP survey 
includes a short test of quantitative skills, covering 


mainly basic arithmetic and the ability to apply it to 
realistic problems. Scores on the test range from 0 
to 500. For example, a person who scores 233 can 
add the amounts of two checks appearing on a bank 
deposit slip; someone scoring 325 can determine the 
price of a meal from a menu; a person scoring 375 
can transform a price in cents per ounce into dollars 
per pound.* 

Suppose that you give the NAEP test to an SRS 
of 840 people from a large population in which the 
scores have mean 280 and standard deviation o = 60. 
The mean x of the 840 scores will vary if you take 
repeated samples. 


Describe the shape, center, and spread of the sam- 
pling distribution of x. 


Sketch the sampling distribution of x. Mark its mean 
and the values 1, 2, and 3 standard deviations on 
either side of the mean. 


According to the 68-95-99.7 rule, about 95% of all 
values of x lie within a distance m of the mean of the 
sampling distribution. What is m? Shade the region on 
the axis of your sketch that is within m of the mean. 


Whenever <x falls in the region you shaded, the 
population mean 4 lies in the confidence interval 

x + m. For what percent of all possible samples does 
the interval capture ju? 


Auto emissions Oxides of nitrogen (called NOX 

for short) emitted by cars and trucks are important 
contributors to air pollution. The amount of NOX 
emitted by a particular model varies from vehicle to 
vehicle. For one light-truck model, NOX emissions 
vary with mean js = 1.8 grams per mile and standard 
deviation o = 0.4 gram per mile. You test an SRS 

of 50 of these trucks. The sample mean NOX level x 
will vary if you take repeated samples. 
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Describe the shape, center, and spread of the sam- 
pling distribution of x. 


Sketch the sampling distribution of x. Mark its mean 
and the values 1, 2, and 3 standard deviations on 
either side of the mean. 


According to the 68-95—99.7 rule, about 95% of all 
values of x lie within a distance m of the mean of the 
sampling distribution. What is m? Shade the region 
on the axis of your sketch that is within m of the 
mean. 


Whenever x falls in the region you shaded, the 
unknown population mean su lies in the confidence 
interval x + m. For what percent of all possible 
samples does the interval capture ju? 


NAEP scores Refer to Exercise 5. Below your 
sketch, choose one value of x inside the shaded re- 
gion and draw its corresponding confidence interval. 
Do the same for one value of x outside the shaded 
region. What is the most important difference be- 
tween these intervals? (Use Figure 8.5, on page 483, 
as a model for your drawing.) 


Auto emissions Refer to Exercise 6. Below your 
sketch, choose one value of x inside the shaded re- 
gion and draw its corresponding confidence interval. 
Do the same for one value of x outside the shaded 
region. What is the most important difference be- 
tween these intervals? (Use Figure 8.5, on page 483, 
as a model for your drawing.) 


Prayer in school A New York Times/CBS News Poll 
asked a random sample of U.S. adults the question, 
“Do you favor an amendment to the Constitu- 

tion that would permit organized prayer in public 
schools?” Based on this poll, the 95% confidence 
interval for the population proportion who favor such 
an amendment is (0.63, 0.69). 


Interpret the confidence interval. 


What is the point estimate that was used to create the 
interval? What is the margin of error? 


Based on this poll, a reporter claims that more than 
two-thirds of U.S. adults favor such an amendment. 
Use the confidence interval to evaluate this claim. 


Losing weight A Gallup Poll asked a random sam- 
ple of U.S. adults, “Would you like to lose weight?” 
Based on this poll, the 95% confidence interval for 
the population proportion who want to lose weight is 


(0.56, 0.62).° 
Interpret the confidence interval. 


What is the point estimate that was used to create the 
interval? What is the margin of error? 


(c) 


11. 
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Based on this poll, Gallup claims that more than 
half of U.S. adults want to lose weight. Use the confi- 
dence interval to evaluate this claim. 


How confident? ‘The figure below shows the result 
of taking 25 SRSs from a Normal population and 
constructing a confidence interval for each sample. 


Which confidence level— 80%, 90%, 95%, or 99% — 
do you think was used? Explain. 


How confident? ‘The figure below shows the result 
of taking 25 SRSs from a Normal population and 
constructing a confidence interval for each sample. 


Which confidence level— 80%, 90%, 95%, or 99% — 
do you think was used? Explain. 


Prayer in school Refer to Exercise 9. The news 
article goes on to say: “The theoretical errors do not 
take into account - - - additional error resulting from 
the various practical difficulties in taking any survey 
of public opinion.” List some of the “practical diffi- 
culties” that may cause errors which are not included 
in the +3 percentage point margin of error. 


Losing weight Refer to Exercise 10. As Gallup 
indicates, the 3 percentage point margin of error for 
this poll includes only sampling variability (what they 
call “sampling error”). What other potential sources of 
error (Gallup calls these “nonsampling errors”) could 
affect the accuracy of the 95% confidence interval? 


Shoes The AP® Statistics class in Exercise | also 
asked an SRS of 20 boys at their school how many 


16. 


17. 


18. 


19: 


pairs of shoes they have. A 95% confidence interval for 
the difference in the population means (girls — boys) is 
10.9 to 26.5. Interpret the confidence interval and the 
confidence level. 


Lying online Many teens have posted profiles 

on sites such as Facebook. A sample survey asked 
random samples of teens with online profiles if they 
included false information in their profiles. Of 170 
younger teens (ages 12 to 14) polled, 117 said “Yes.” 
Of 317 older teens (ages 15 to 17) polled, 152 said 
“Yes.”° A 95% confidence interval for the difference 
in the population proportions (younger teens — older 
teens) is 0.120 to 0.297. Interpret the confidence 
interval and the confidence level. 


Shoes Refer to Exercise 15. Does the confidence 
interval give convincing evidence of a difference in 
the population mean number of pairs of shoes for 
boys and girls at the school? Justify your answer. 


Lying online Refer to Exercise 16. Does the confi- 
dence interval give convincing evidence of a differ- 
ence in the population proportions of younger and 
older teens who include false information in their 
profiles? Justify your answer. 


Explaining confidence A 95% confidence interval 
for the mean body mass index (BMI) of young 
American women is 26.8 + 0.6. Discuss whether 
each of the following explanations is correct. 


We are confident that 95% of all young women have 
BMI between 26.2 and 27.4. 


We are 95% confident that future samples of young 
women will have mean BMI between 26.2 and 27.4. 


Any value from 26.2 to 27.4 is believable as the true 
mean BMI of young American women. 


If we take many samples, the population mean BMI will 
be between 26.2 and 27.4 in about 95% of those samples. 


The mean BMI of young American women cannot 


be 28. 


Explaining confidence ‘The admissions director 
from Big City University found that (107.8, 116.2) is 
a 95% confidence interval for the mean IQ score of 
all freshmen. Discuss whether each of the following 
explanations is correct. 


There is a 95% probability that the interval from 
107.8 to 116.2 contains 2. 


There is a 95% chance that the interval (107.8, 
116.2) contains x. 


This interval was constructed using a method that 
produces intervals that capture the true mean in 95% 
of all possible samples. 
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(d) 


(e) 


If we take many samples, about 95% of them will 
contain the interval (107.8, 116.2). 


The probability that the interval (107.8, 116.2) cap- 
tures 1 is either 0 or 1, but we don’t know which. 


Multiple choice: Select the best answer for Exercises 21 
to 24. 

Exercises 21 and 22 refer to the following setting. A researcher 
plans to use a random sample of families to estimate the 
mean monthly family income for a large population. 


21. The researcher is deciding between a 95% confidence 
level and a 99% confidence level. Compared to a 95% 
confidence interval, a 99% confidence interval will be 

(a) narrower and would involve a larger risk of being 
incorrect. 

(b) wider and would involve a smaller risk of being 
incorrect. 

(c) narrower and would involve a smaller risk of being 
incorrect. 

(d) wider and would involve a larger risk of being incorrect. 

(e) wider and would have the same risk of being incorrect. 

22. The researcher is deciding between a sample of size 
n = 500 and a sample of size n = 1000. Compared 
to using a sample size of n = 500, a 95% confidence 
interval based on a sample size of n = 1000 will be 

(a) narrower and would involve a larger risk of being 
incorrect. 

(b) wider and would involve a smaller risk of being 
incorrect. 

(c) narrower and would involve a smaller risk of being 
incorrect. 

(d) wider and would involve a larger risk of being incorrect. 

(e) narrower and would have the same risk of being 
incorrect. 

23. Ina poll, 

I. Some people refused to answer questions. 

II. People without telephones could not be in the sample. 

III. Some people never answered the phone in several 
calls. 

Which of these possible sources of bias is included in 
the +2% margin of error announced for the poll? 

(a) I only (c) II only (e) None of these 

(b) II only (d) I, I, and Ill 

24. You have measured the systolic blood pressure of an 


SRS of 25 company employees. A 95% confidence 
interval for the mean systolic blood pressure for the 
employees of this company is (122, 138). Which of 
the following statements is true? 
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95% of the sample of employees have a systolic blood 
pressure between 122 and 138. 


95% of the population of employees have a systolic 
blood pressure between 122 and 138. 


If the procedure were repeated many times, 95% of 
the resulting confidence intervals would contain the 
population mean systolic blood pressure. 


If the procedure were repeated many times, 95% of 
the time the population mean systolic blood pressure 
would be between 122 and 138. 


If the procedure were repeated many times, 95% of 
the time the sample mean systolic blood pressure 
would be between 122 and 138. 


Power lines and cancer (4.2, 4.3) Does living 

near power lines cause leukemia in children? ‘The 
National Cancer Institute spent 5 years and $5 mil- 
lion gathering data on this question. The researchers 
compared 638 children who had leukemia with 620 
who did not. They went into the homes and mea- 
sured the magnetic fields in children’s bedrooms, in 
other rooms, and at the front door. They recorded 
facts about power lines near the family home and 
also near the mother’s residence when she was 


State and check the Random, 10%, and Large Counts 
conditions for constructing a confidence interval for a 


population proportion. 

Determine critical values for calculating a C% confi- 
dence interval for a population proportion using a table 
or technology. 


(a) 
(b) 


26. 


pregnant. Result: no connection between leukemia 
and exposure to magnetic fields of the kind produced 
by power lines was found.’ 


Was this an observational study or an experiment? 
Justify your answer. 


Does this study show that living near power lines 
doesn’t cause cancer? Explain. 


Sisters and brothers (3.1, 3.2) How strongly do physi- 


ap, cal characteristics of sisters and brothers correlate? Here 


are data on the heights (in inches) of 11 adult pairs:* 


Brother: 71 68 66 67 70 71 70 73 72 65 66 
Sister: 69 64 65 63 65 62 65 64 66 59 62 


(a) 


(b) 


Construct a scatterplot using brother’s height as the 
explanatory variable. Describe what you see. 


Use your calculator to compute the least-squares 
regression line for predicting sister’s height from 
brother’s height. Interpret the slope in context. 


Damien is 70 inches tall. Predict the height of his 
sister ‘Tonya. 


Do you expect your prediction in (c) to be very accu- 
tate? Give appropriate evidence to support your answer. 


Estimating a Population 
Proportion 


By the end of the section, you should be able to: 


Construct and interpret a confidence interval for a 
population proportion. 

Determine the sample size required to obtain a C% 
confidence interval for a population proportion with a 
specified margin of error. 


In Section 8.1, we saw that a confidence interval can be used to estimate an un- 
known population parameter. We are often interested in estimating the propor- 
tion p of some outcome in the population. Here are some examples: 


e¢ What proportion of U.S. adults are unemployed right now? 


e¢ What proportion of high school students have cheated on a test? 


ACTIVITY 


MATERIALS: 


Several thousand small 
plastic beads of at least two 
colors, container to hold 

all the beads, small cup 

for sampling, several small 
bowls 
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e¢ What proportion of pine trees in a national park are infested with beetles? 
e¢ What proportion of college students pray daily? 


¢ What proportion of a company’s laptop batteries last as long as the company 
claims? 


This section shows you how to construct and interpret a confidence interval for a 
population proportion. The following Activity gives you a taste of what lies ahead. 


The Beads 


Before class, your teacher will prepare a large population of different-colored 
beads and put them into a container that you cannot see inside. Your goal is to 
estimate the actual proportion of beads in the population that have a particular 
color (say, red). 

1. Asa class, discuss how to use the cup provided to get a simple random 
sample of beads from the container. Think this through carefully, because you 
will get to take only one sample. 


2. Have one student take an SRS of beads. Separate the beads 
into two groups: those that are red and those that aren’t. Count the 
number of beads in each group. 


3. Determine a point estimate for the unknown population pro- 
portion fp of red beads in the container. 


4. Now for the challenge: each team of three to four students 
will be given about 10 minutes to find a 95% confidence interval 
for the parameter p. Be sure to consider any conditions that are 
required for the methods you use. 


5. Compare the results with those of the other teams in the class. 
Discuss any problems you encountered and how you dealt with them. 


Conditions for Estimating p 


Before constructing a confidence interval for p, you should check some important 
conditions: 


e¢ Random: The data should come from a well-designed random sample or ran- 
domized experiment. Otherwise, there’s no scope for inference to a popula- 
tion (sampling) or inference about cause and effect (experiment). If we can’t 
draw conclusions beyond the data at hand, there’s not much point in con- 
structing a confidence interval! 
Another important reason for random selection or random assignment is to in- 
troduce chance into the data-production process. We can model chance behay- 
ior with a probability distribution, like the sampling distributions of Chapter 7. 
The probability distribution helps us calculate a confidence interval. 
© 10%: The procedure for calculating confidence intervals assumes that in- 


dividual observations are independent. Well-conducted studies that use 
random sampling or random assignment can help ensure independent 


494 CHAPTER 8 ESTIMATING WITH CONFIDENCE 


observations. However, our formula for the standard deviation of the 


. (1 — p) 
sampling distribution of p, oj = a acts as if we are sampling 
with replacement from a population. That’s rarely the case. When sam- 
pling without replacement from a finite population, be sure to check that 
n= rT . Sampling more than 10% of the population would give a more 
precise estimate of the parameter p but would require us to use a different 
formula to calculate the standard deviation of the sampling distribution. 


e¢ Large Counts: The method that we use to construct a confidence interval 

for p depends on the fact that the sampling distribution of f is approximately 

Normal. From what we learned in Chapter 7, we can use the Normal approxi- 

mation to the sampling distribution of f as long as np = 10 and n(1—p) = 10. 

In practice, of course, we don’t know the value of p. If we did, we wouldn't 

need to construct a confidence interval for it! So we cannot check if np and 

n(1—p) are at least 10. In large random samples, f will tend to be close to p. 

So we replace p by f in checking the Large Counts condition: np = 10 and 
n(1 — p) = 10. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE 
INTERVAL ABOUT A PROPORTION 


e Random: The data come from a well-designed random sample or ran- 
domized experiment. 
l 
© 10%: When sampling without replacement, check that n = 0. : 
e Large Counts: Both nf and n(1 — f) are at least 10. 


When Mr. Vignolini’s class did the beads Activity, they got 107 red beads and 
144 white beads. Their point estimate for the unknown proportion p of red beads 
in the population is 


~ _ 107 
p= 351 = 01.426 
. Let’s see how the conditions play out for Mr. Vignolini’s class. 


The Beads 


Checking conditions 


Mr. Vignolini’s class wants to construct a confidence interval for the true propor- 
tion p of red beads in the container. Recall that the class’s sample had 107 red 
beads and 144 white beads. 


PROBLEM: Check that the conditions for constructing a confidence interval for pare met. 
SOLUTION: There are three conditions to check: 
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Random: The class took an SRS of 251 beads from the container. 


° 10%: Because the class sampled without replacement, they need to check that there are 
at least 10(251) = 2510 beads in the population. Mr. Vignolini reveals that there are 
3000 beads in the container. 

Large Counts: To use a Normal approximation for the sampling distribution of p, we need both np 
and n(1 — p)to be at least 10. Because we don’t know p, we check 


. 107 , 107 144 
np = 251{ —— ] = 107 = 10 and n(1 — p) = 251(1 - ——] = 251 —_]=144 = 10 


pePLon, 


That is, the counts of successes (red beads) and failures (non-red beads) are both at least 10. 
All conditions are met, so it should be safe for the class to construct a confidence interval. 


For Practice Try Exercise 


Notice that nf and n(1 — f) should be whole numbers. You don’t really need 
to calculate these values since they are just the number of successes and failures 
in the sample. In the previous example, we could address the Large Counts condi- 
tion simply by saying, “The numbers of successes (107) and failures (144) in the 
sample are both at least 10.” 


What happens if one of the conditions is violated? If the data 
come from a convenience sample or a poorly designed experiment, there’s no 
point constructing a confidence interval for p. Violation of the Random condition 
severely limits our ability to make any inference beyond the data at hand. 

The figure below shows a screen shot from the Confidence Intervals for Proportions 
applet at the book’s Web site, www.whfreeman.com/tps5e. We set n = 20 and 
p = 0.25. The Large Counts condition is not met because np = 20(0.25) = 5. We 
used the applet to generate 1000 95% confidence intervals for p. Only 902 of those 
1000 intervals contained p = 0.25, a capture rate of 90.2%. When the Large Counts 
condition is violated, the capture rate will be lower than the one advertised by the 
confidence level if the method is used many times. 


Population Proportion (p). 0.25 


aa rT 


Confidence | evel (Cy 95 = 


| 


Sample Size (n): 20 ee 
e— 
htt 
SAMPLE 25 


Hit 902 = Total: 1000 os Eg) BR Soe] SEEN Fa 


Percent hit 0.902 


RESET 
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That leaves just the 10% condition when sampling without replacement from 
a finite population. Large random samples give more precise estimates than small 
random samples. So randomly selecting more than 10% of a population should 
be a good thing! Unfortunately, the formula for the standard deviation of f that we 


p(l — p) 


developed in Chapter 7, 03 = 1s not correct when the 10% condition 


is violated. The formula gives a value that is too large. Confidence intervals based 
on this formula are longer than they need to be. If many 95% confidence intervals 
for a population proportion are constructed in this way, more than 95% of them 
will capture p. The actual capture rate is greater than the reported confidence 
level when the 10% condition is violated. 


CHECK YOUR UNDERSTANDING 


In each of the following settings, check whether the conditions for calculating a confi- 
dence interval for the population proportion p are met. 


1. An AP® Statistics class at a large high school conducts a survey. They ask the first 100 
students to arrive at school one morning whether or not they slept at least 8 hours the night 
before. Only 17 students say “Yes.” 


2. A quality control inspector takes a random sample of 25 bags of potato chips from 
the thousands of bags filled in an hour. Of the bags selected, 3 had too much salt. 


Constructing a Confidence Interval for p 


When the conditions are met, the sampling distribution of f will be approximately 


p(l — p) 


Normal with mean pg = p and standard deviation of = 7 Figure 8.7 


Sampling Standard 
distribution deviation 


of p pC. — p)in 


Mean p 
FIGURE 8.7 Suppose that a population has proportion p of 
successes. When the conditions for inference are met, the 
sampling distribution of the proportion of successes p ina 
sample is approximately Normal with mean p and standard 


displays this distribution. Inference about a population pro- 
portion f is based on the sampling distribution of f. 

We can use the general formula from Section 8.1 to construct 
a confidence interval for an unknown population proportion p: 


statistic + (critical value) - (standard deviation of statistic) 


The sample proportion f is the statistic we use to estimate p. 
Doing so makes sense if the data came from a well-designed ran- 
dom sample or randomized experiment (the Random condition). 

The standard deviation of the sampling distribution of f is 


_ (pd=p) 
i re 


if the 10% condition is met. Because we don’t know the value 
of p, we replace it with the sample proportion fp. The result- 


1 = 
deviation PA P) ; ing quantity is called the standard error (SE) of the sample 
proportion fp. 
b(1 — p) 
Some books refer to oj as the SEs = ae 
“standard error” of 6 and to what ; hae: ; ; 
we call the standard error as the It describes how close the sample proportion fp will typically be to the population 


“estimated standard error.” proportion fp in repeated SRSs of size n. 
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DEFINITION: Standard error 


When the standard deviation of a statistic is estimated from data, the result is called 
the standard error of the statistic. 


Ses How do we get the critical value for our confidence interval? If 
Normal curve 


the Large Counts condition is met, we can use a Normal curve. 
For the approximate 95% confidence intervals of Section 8.1, 
we used a critical value of 2 based on the 68-95-99.7 rule for 
Normal distributions. We can get a more precise critical value 
from ‘Table A or a calculator. As Figure 8.8 shows, the central 95% 
of the standard Normal distribution is marked off by 2 points, 
z* = 1.96 and —z* = —1.96. We use the * to remind you that this 
is a critical value, not a standardized score that has been calculated 


Probability = 0.95 


Area = 0.025 Area = 0.025 


-7' =-1.96 0 2° = 1.96 from data. 
FIGURE 8.8 Finding the critical value for a 95% To find a level C confidence interval, we need to catch the 
confidence interval. The correct value is z* = 1.96, central C% under the standard Normal curve. Here’s an ex- 
which is more precise than the value of 2 we had been ample that shows how to get the critical value z* for a different 
using for 95% confidence. confidence level. 


80% Confidence 


Finding a critical value 

PROBLEM: Use Table A or technology to find the critical value z* for an 80% confidence interval. 
Assume that the Large Counts condition is met. 

SOLUTION: Foran &0% confidence level, we need to capture the 
central 80% of the standard Normal distribution. In capturing the 


Normal curve central 80%, we leave out 20%, or 10% in each tail. So the desired 
critical value 2* is the point with area 0.1 to its right under the stan- 
dard Normal curve. Figure 8.9 shows the details in picture form. 


Search the body of Table A to find the point — z* with area 0.1 toits 


Probability = 0.8 left. The closest entryis z= — 1.28. (See the excerpt from Table A 
below.) So the critical value we want is z* = 1.28. 


Z .07 .08 .09 
-Z" = -1.28 0 z*=128 —1.3  .0853 0838 = .0823 


—14 1210  .1190  ~=.1170 


FIGURE 8.9 Finding the critical value for an 80% confidence 
interval. 


Using technology: The command invNorm(area:0.1,{4:0, 0:1) givesz= —1.28. The 
critical value is z* = 1.28, which matches what we got from Table A. 


For Practice Try Exercise 
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Technically, the correct formula for a 
confidence interval is statistic + 
(critical value) - (standard error 

of statistic). We are following the 
convention used on the AP® Statistics 
exam formula sheet. 
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Once we find the critical value z*, our confidence interval for the population 
proportion p is 
statistic + (critical value) - (standard deviation of statistic) 
p(l — p) 


=fxt-* 
P n 


Notice that we replaced the standard deviation of f with the formula for its stan- 


dard error. The resulting interval is sometimes called a one-sample z interval for 
a population proportion. 


ONE-SAMPLE z INTERVAL FOR A POPULATION PROPORTION 


When the conditions are met, a C% confidence interval for the unknown 


proportion p is 
_., aD 
1 


where z* is the critical value for the standard Normal curve with C% of its 
area between —z* and z™. 


Now we can get the desired confidence interval for Mr. Vignolini’s class. 


The Beads 


Calculating a confidence interval for p 


PROBLEM: Mr. Vignolini’s class took an SRS of beads from the container and got 107 red beads 
and 144 white beads. 


(a) Calculate and interpret a 90% confidence interval for p. 


(b) Mr. Vignolini claims that exactly half of the beads in the container are red. Use your result from 
part (a) to comment on this claim. 


SOLUTION: We checked conditions for calculating the interval earlier. 
(a) Our confidence interval has the form 


statistic + (critical value) - (standard deviation of statistic) 
. [PU ?) 
ye a 


fn 


Area = 0.90 


Area = 0.05 


-Z"= -1.645 


0 


The sample statisticis p = 107/251 = 0.426. Nowlet’sfind the 
critical value. From Table A, we look for the point with area 0.05 to its left. 
As the excerpt from Table A shows, this point is between z= — 1.64 
and z= — 1.65. The calculator’s invNorm(area:0.05,[:0, 
o:1) givesz= — 1.645. So we use z* = 1.645 as our critical value. 


Standard 
Normal curve 


Area = 0.05 


z 03 04 05 
1.7 0418 .0409 .0401 
—1.6 .0516 {/10505)))/0485") 

z= 1.645 -1.5 0630 .0618  .0606 


Section 8.2 Estimating a Population Proportion 499 


The resulting 90% confidence interval is 


(0.426)(1 — 0.426) 
251 


= 0.426+1.645 i 


= 0.426+0.051 
= (0.375,0.477) 


We are 90% confident that the interval from 0.375 to 0.477 captures the true proportion of red 
beads in Mr. Vignolini’s container. 

(b) The confidence interval in part (a) gives a set of plausible values for the population proportion of 
red beads. Because 0.5 is not contained in the interval, it is not a plausible value for p. We have good 
reason to doubt Mr. Vignolini’s claim. 


For Practice Try Exercise 


CHECK YOUR UNDERSTANDING 


Alcohol abuse has been described by college presidents as the number one problem 
on campus, and it is an important cause of death in young adults. How common is 
it? A survey of 10,904 randomly selected U.S. college students collected information on 
drinking behavior and alcohol-related problems.’ The researchers defined “frequent binge 
drinking” as having five or more drinks in a row three or more times in the past two weeks. 
According to this definition, 2486 students were classified as frequent binge drinkers. 


1. Identify the parameter of interest. 
2. Check conditions for constructing a confidence interval for the parameter. 


3. Find the critical value for a 99% confidence interval. Show your method. Then 
calculate the interval. 


4. Interpret the interval in context. 


Putting It All Together: 
The Four-Step Process 


Taken together, the examples about Mr. Vignolini’s class and the beads Activity 
show you how to get a confidence interval for an unknown population proportion 
p. We can use the familiar four-step process whenever a problem asks us to con- 
struct and interpret a confidence interval. 


CONFIDENCE INTERVALS: A FOUR-STEP PROCESS 


oy 4 State: What parameter do you want to estimate, and at what confidence level? 
Plan: Identify the appropriate inference method. Check conditions. 
Do: If the conditions are met, perform calculations. 


Conclude: Interpret your interval in the context of the problem. 
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AP® EXAM TIP If a free- 
response question asks you 
to construct and interpret a 
confidence interval, you are 
expected to do the entire 
four-step process. That 
includes clearly defining the 
parameter, identifying the 
procedure, and checking 
conditions. 


Simulation studies have shown that a 
variation of our method for calculating 
a 95% confidence interval for p can 
result in closer to a 95% capture rate 
in the long run, especially for small 
sample sizes. This simple adjustment, 
first suggested by Edwin Bidwell 
Wilson in 1927, is sometimes called 
the “plus four” estimate. Just pretend 
we have four additional observations, 
two of which are successes and two 
of which are failures. Then calculate 
the “plus four interval” using the plus 
four estimate in place of 6 in our usual 
formula. 


The next example illustrates the four-step process in action. 


Teens Say Sex Can Wait : 


Confidence interval for p 


The Gallup Youth Survey asked a random sample of 439 U.S. teens aged 
13 to 17 whether they thought young people should wait to have sex until 
marriage.!? Of the sample, 246 said “Yes.” Construct and interpret a 95% 
confidence interval for the proportion of all teens who would say “Yes” if 
asked this question. 


STATE: Wewant to estimate the true proportion pofall 13- to 17-year-olds in the 
United States who would say that young people should wait to have sex until they get mar- 
ried with 95% confidence. 


PLAN: We should use a one-sample zinterval for pif the conditions are met. 


* Random: Gallup surveyed a random sample of U.S. teens. 


° 10%: Because Gallup is sampling without replacement, we need to check the 10% 
condition: there are at least 10(439) = 4390 U.S. teens aged 13 to 17. 
¢ Large Counts: We check the counts of “successes” and “failures”: 


ip = 240 = 10 and n(1— fp) = 193 = 10 
DO: The sample statistic is p = 246/439 = 0.56. A 95% confidence interval for pis given by 
P(1 — p) (0.56)(0.44) 


p= 7,|/———"" = 0.56 £1.96 
n 439 


= 0.56+0.046 
= (0.514, 0.606) 


CONCLUDE: Weare 95% confident that the interval from 0.514 to 0.606 captures the true 
proportion of 13- to 17-year-olds in the United States who would say that teens should wait until 
marriage to have sex. 


For Practice Try Exercise 


Remember that the margin of error in a confidence interval includes only 
sampling variability! There are other sources of error that are not taken into 
account. As is the case with many surveys, we are forced to assume that the 
teens answered truthfully. If they didn’t, then our estimate may be biased. Other 
problems like nonresponse and question wording can also affect the results of this or 
any other poll. Lesson: Sampling beads is much easier than sampling people! 

Your calculator will handle the “Do” part of the four-step process, as the follow- 
ing Technology Corner illustrates. 


AP® EXAM TIP You may use your calculator to compute a confidence interval on the AP® 
exam. But there’s a risk involved. If you just give the calculator answer with no work, you'll get 


either full credit for the “Do” step (if the interval is correct) or no credit (if it’s wrong). If you opt 
for the calculator-only method, be sure to name the procedure (e.g., one-proportion z interval) 
and to give the interval (e.g., 0.514 to 0.607). 
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CONFIDENCE INTERVAL FOR A 
CORNER. POPULATION PROPORTION 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


The T1-83/84 and TI-89 can be used to construct a confidence interval for an unknown population proportion. We'll 
demonstrate using the previous example. Of n = 439 teens surveyed, X = 246 said they thought that young people 
should wait to have sex until after marriage. ‘To construct a confidence interval: 


TI-83/84 TI-89 


e Press| STAT |, then choose TESTS e In the Statistics/List Editor, press ({F7]) 
and 1-PropZInt. and choose 1-PropZInt. 


e When the I-PropZInt screen appears, enter x = 246, n = 439, and confidence level 0.95. 


NORMAL FLOAT AUTO REAL RADIAN CL o 


1—PropZInt 
x?246 
ni439 Successes x¢ 246 


C-Level:.95 ne 
Calculate elaiai 


en as 
11st1={0,1,2,35,4,5,6,7,8,. 
TYPE * (ENTERI=K AND [ESCI=CRNCEL 


e Highlight “Calculate” and press |ENTER|, The 95% confidence interval for p is reported, along with the sample 
proportion f and the sample size, as shown here. 


NORMAL FLOAT AUTO REAL RADIAN CL n 


option 2 ints 
(.51393, .60679) r 
b=, 5603644647 ee 
n=439 M 0484S 
=429, 


1isti=(0,1,2,5,.4,5,6,7,8, 
MAIN RAD AUTO FUNC IE 


Choosing the Sample Size 


In planning a study, we may want to choose a sample size that allows us to estimate 

a population proportion within a given margin of error. National survey organi- 

zations like the Gallup Poll typically sample between 1000 and 1500 American 

adults, who are interviewed by telephone. Why do they choose such sample sizes? 
The margin of error (ME) in the confidence interval for p is 


p(l—~ Pp) 


i 


ME = z* 


Here, z* is the standard Normal critical value for the confidence level we want. 
Because the margin of error involves the sample proportion of successes f, we 
have to guess the value of f when choosing n. Here are two ways to do this: 


1. Use a guess for p based on a pilot study or on past experience with similar studies. 
You should do several calculations that cover the range of p-values you might get. 


2. Use p = 0.5 as the guess. The margin of error ME is largest when p = 0.5, so 
this guess is conservative in the sense that if we get any other f when we do 
our study, we will get a margin of error smaller than planned. 
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Once you have a guess for f, the formula for the margin of error can be solved to 
give the sample size n needed. 


SAMPLE SIZE FOR DESIRED MARGIN OF ERROR 


To determine the sample size n that will yield a C% confidence interval for 

a population proportion p with a maximum margin of error ME, solve the 

following inequality for n: 

pe?) 
n 


zZ = ME 


where f is a guessed value for the sample proportion. The margin of error 
will always be less than or equal to ME if you use fp = 0.5. 


Here’s an example that shows you how to determine the sample size. 


Customer Satisfaction 
Determining sample size 


A company has received complaints about its customer service. ‘The managers intend 
to hire a consultant to carry out a survey of customers. Before contacting the con- 
sultant, the company president wants some idea of the sample size that she will be 
required to pay for. One critical question is the degree of satisfaction with the com- 
pany’s customer service, measured on a 5-point scale. The president wants to estimate 
the proportion p of customers who are satisfied (that is, who choose either “somewhat 
satisfied” or “very satisfied,” the two highest levels on the 5-point scale). She decides 
that she wants the estimate to be within 3% (0.03) at a 95% confidence level. How 
large a sample is needed? 


PROBLEM: Determine the sample size needed to estimate pwithin 0.03 with 95% confidence. 


SOLUTION: The critical value for 95% confidence is z* = 1.96. We have no idea about the true 
proportion p of satisfied customers, so we decide to use p = 0.5 as our guess. Because the company 
president wants a margin of error of no more than 0.03, we need to solve the inequality 


0.5(1 — 0.5) 
196.) 0S 


for n. Multiplying both sides by Vn and then dividing both sides by 0.03 yields 
1.96 


—— 0007 
0.03 


Gal —0.5)<n 


Squaring both sides gives 


0.03 
1067.111 <n 
We round up to 1068 respondents to ensure that the margin of error is no more than 3%. 


For Practice Try Exercise 


Section 8.2 Estimating a Population Proportion 503 


Why not round to the nearest whole number—in this case, 1067? Because a 
smaller sample size will result in a larger margin of error, possibly more than the 
desired 3% for the poll. 

If you want a 2.5% margin of error rather than 3%, then 

1.96 


z 
— (<=) (0.5)(1 — 0.5) = 1536.64 > n = 1537 


For a 2% margin of error, the sample size you need is 


1.96\? 
a ee = = 
= (Fas) (0.5)(1 — 0.5) = 2401 
As usual, smaller margins of error call for larger samples. 

News reports frequently describe the results of surveys with sample sizes be- 
tween 1000 and 1500 and a margin of error of about 3%. These surveys generally 
use sampling procedures more complicated than a simple random sample, so the 
calculation of confidence intervals is more involved than what we have studied in 
this section. The calculations of the previous example still give you a rough idea 
of how such surveys are planned. 


CHECK YOUR UNDERSTANDING 


Refer to the previous example about the company’s customer satisfaction survey. 


1. In the company’s prior-year survey, 80% of customers surveyed said they were “some- 
what satisfied” or “very satisfied.” Using this value as a guess for f, find the sample size 
needed for a margin of error of 3% at a 95% confidence level. 


2. What if the company president demands 99% confidence instead? Determine how 
this would affect your answer to Question I. 


Sra olummary 


e The conditions for constructing a confidence interval about a population 
proportion are 
e Random: The data were produced by a well-designed random sample 
or randomized experiment. 
© 10%: When sampling without replacement, we check that the 
population is at least 10 times as large as the sample. 
e Large Counts: The sample is large enough that nf and n(1 — ), the 
counts of successes and failures in the sample, are both at least 10. 
e Confidence intervals for a population proportion p are based on the sampling 
distribution of the sample proportion p. When the conditions for inference 
are met, the sampling distribution of f is approximately Normal with mean p 


and standard deviation Vp(1 — p)/n. 
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e In practice, we use the sample proportion / to estimate the unknown param- 
eter p. We therefore replace the standard deviation of f with its standard error 
when constructing a confidence interval. The C% confidence interval for p is 

F p(l — p) 

p= Zz eZ 

n 

where z* is the standard Normal critical value with C% of its area between —z* 

andi 


e When constructing a confidence interval, follow the four-step process: 


STEP STATE: What parameter do you want to estimate, and at what confidence 
4 level? 
L. PLAN: Identify the appropriate inference method. Check conditions. 


DO: If the conditions are met, perform calculations. 
CONCLUDE: Interpret your interval in the context of the problem. 


e The sample size needed to obtain a confidence interval with approximate 
margin of error ME for a population proportion involves solving 


R16 

= p(1 — p) = ME 
n 

for n, where / is a guessed value for the sample proportion, and z” is the criti- 


cal value for the confidence level you want. Use p = 0.5 if you don’t have a 
good idea about the value of p. 


TECHNOLOGY 
CORNER 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


15. Confidence interval for a population proportion 


Sesame Exercises 


For Exercises 27 to 30, check whether each of the condi- at his college. Thirty-eight of those interviewed think 
tions is met for calculating a confidence interval for the tuition is too high. 


population proportion p. 29. AIDS and risk factors In the National AIDS Be- 


27. Rating school food Latoya wants to estimate what havioral Surveys sample of 2673 adult heterosexuals, 
m1) 494| proportion of the seniors at her boarding high school 0.2% had both received a blood transfusion and had 
& like the cafeteria food. She interviews an SRS of 50 a sexual partner from a group at high risk of AIDS. 

of the 175 seniors living in the dormitory. She finds We want to estimate the proportion p in the popula- 
that 14 think the cafeteria food is good. tion who share these two risk factors. 

28. High tuition costs Glenn wonders what proportion 30. Whelks and mussels ‘The small round holes you 
of the students at his school believe that tuition is too often see in sea shells were drilled by other sea 


high. He interviews an SRS of 50 of the 2400 students creatures, who ate the former dwellers of the shells. 


3B: 
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Whelks often drill into mussels, but this behavior 
appears to be more or less common in different 
locations. Researchers collected whelk eggs from 
the coast of Oregon, raised the whelks in the labora- 
tory, then put each whelk in a container with some 
delicious mussels. Only 9 of 98 whelks drilled into a 
mussel.'! The researchers want to estimate the pro- 
portion p of Oregon whelks that will spontaneously 
drill into mussels. 


98% confidence Find z* for a 98% confidence 
interval using Table A or your calculator. Show your 
method. 


93% confidence Find z* for a 93% confidence 
interval using ‘Table A or your calculator. Show your 
method. 


Going to the prom Tonya wants to estimate what 
proportion of her school’s seniors plan to attend the 
prom. She interviews an SRS of 50 of the 750 seniors 
in her school and finds that 36 plan to go to the 
prom. 


Identify the population and parameter of interest. 


Check conditions for constructing a confidence 
interval for the parameter. 


Construct a 90% confidence interval for p. Show 
your method. 


Interpret the interval in context. 


Reporting cheating What proportion of students 
are willing to report cheating by other students? A 
student project put this question to an SRS of 172 
undergraduates at a large university: “You witness 
two students cheating on a quiz. Do you go to the 
professor?” Only 19 answered “Yes.”!” 


Identify the population and parameter of interest. 


Check conditions for constructing a confidence 
interval for the parameter. 


Construct a 99% confidence interval for p. Show 
your method. 


Interpret the interval in context. 


Binge drinking In a recent National Survey of Drug 
Use and Health, 2312 of 5914 randomly selected 
full-time U.S. college students were classified as 
binge drinkers." 


Calculate and interpret a 99% confidence interval 
for the population proportion p that are binge 
drinkers. 

A newspaper article claims that 45% of full-time U.S. 


college students are binge drinkers. Use your result 
from part (a) to comment on this claim. 


37. 


38. 
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Ae 


. Teens’ texting A Pew Internet and American Life 


Project survey found that 392 of 799 randomly 
selected teens reported texting with their friends 
every day. 


Calculate and interpret a 95% confidence interval 
for the population proportion p that would report 
texting with their friends every day. 


Is it plausible that the true proportion of American 
teens who text with their friends every day is 0.45? 
Use your result from part (a) to support your answer. 


Binge drinking Describe a possible source of error 
that is not included in the margin of error for the 
99% confidence interval in Exercise 35. 


Teens’ texting Describe a possible source of error 
that is not included in the margin of error for the 
95% confidence interval in Exercise 36. 


How common is SAT coaching? A random sample 
of students who took the SAT college entrance 
examination twice found that 427 of the respondents 
had paid for coaching courses and that the remain- 
ing 2733 had not.'* Construct and interpret a 99% 
confidence interval for the proportion of coaching 
among students who retake the SAT’. 


2010 begins In January 2010 a Gallup Poll asked a 
random sample of adults, “In general, are you satisfied 
or dissatisfied with the way things are going in the 
United States at this time?” In all, 256 said that they 
were satisfied and the remaining 769 said they were 
not. Construct and interpret a 90% confidence inter- 
val for the proportion of adults who are satisfied with 
how things are going. 


Equality for women? Have efforts to promote equal- 
ity for women gone far enough in the United States? 
A poll on this issue by the cable network MSNBC 
contacted 1019 adults. A newspaper article about the 
poll said, “Results have a margin of sampling error of 
plus or minus 3 percentage points.”!” 


The news article said that 65% of men, but only 
43% of women, think that efforts to promote equality 
have gone far enough. Explain why we do not have 
enough information to give confidence intervals for 
men and women separately. 


Would a 95% confidence interval for women alone 
have a margin of error less than 0.03, about equal to 
0.03, or greater than 0.03? Why? (You see that the 
news article’s statement about the margin of error for 
poll results is a bit misleading.) 


ATV poll A television news program conducts a 
call-in poll about a proposed city ban on handgun 
ownership. Of the 2372 calls, 1921 oppose the ban. 
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The station, following recommended practice, makes 
a confidence statement: “81% of the Channel 13 
Pulse Poll sample opposed the ban. We can be 95% 
confident that the true proportion of citizens opposing 
a handgun ban is within 1.6% of the sample result.” Is 
the station’s conclusion justified? Explain. 


Can you taste PTC? PTC is a substance that has 

a strong bitter taste for some people and is tasteless 
for others. The ability to taste PT'C is inherited. About 
75% of Italians can taste PTC, for example. You want 
to estimate the proportion of Americans who have at 
least one Italian grandparent and who can taste PTC. 


How large a sample must you test to estimate the 
proportion of PTC tasters within 0.04 with 90% 
confidence? Answer this question using the 75% 
estimate as the guessed value for f. 


Answer the question in part (a) again, but this time 
use the conservative guess f = 0.5. By how much do 
the two sample sizes differ? 


School vouchers A national opinion poll found that 
44% of all American adults agree that parents should 
be given vouchers that are good for education at any 
public or private school of their choice. The result 
was based on a small sample. 


How large an SRS is required to obtain a margin 
of error of 0.03 (that is, +3%) in a 99% confidence 
interval? Answer this question using the previous 
poll’s result as the guessed value for p. 


Answer the question in part (a) again, but this time 
use the conservative guess / = 0.5. By how much do 
the two sample sizes differ? 


Election polling Gloria Chavez and Ronald Flynn 
are the candidates for mayor in a large city. We want to 
estimate the proportion p of all registered voters in the 
city who plan to vote for Chavez with 95% confidence 
and a margin of error no greater than 0.03. How large a 
random sample do we need? Show your work. 


Starting a nightclub A college student organization 
wants to start a nightclub for students under the age 
of 21. To assess support for this proposal, they will 
select an SRS of students and ask each respondent if 
he or she would patronize this type of establishment. 
What sample size is required to obtain a 90% conti- 
dence interval with an approximate margin of error 


of 0.04? Show your work. 


Teens and their T'V sets According to a Gallup Poll 
report, 64% of teens aged 13 to 17 have ‘TVs in their 
rooms. Here is part of the footnote to this report: 


These results are based on telephone interviews with a 
randomly selected national sample of 1028 teenagers 


48. 
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in the Gallup Poll Panel of households, aged 13 to 17. 
For results based on this sample, one can say .. . that 
the maximum error attributable to sampling and other 
random effects is +3 percentage points. In addition 

to sampling error, question wording and practical dif- 
ficulties in conducting surveys can introduce error or 
bias into the findings of public opinion polls.'° 


We omitted the confidence level from the footnote. 
Use what you have learned to determine the confi- 
dence level, assuming that Gallup took an SRS. 


Give an example of a “practical difficulty” that could 
lead to biased results for this survey. 


Gambling and the NCAA Gambling is an issue of 
great concern to those involved in college athletics. 
Because of this concer, the National Collegiate 
Athletic Association (NCAA) surveyed randomly 
selected student athletes concerning their gambling- 
related behaviors.!’ Of the 5594 Division I male 
athletes in the survey, 3547 reported participation 

in some gambling behavior. This includes playing 
cards, betting on games of skill, buying lottery tickets, 
betting on sports, and similar activities. A report of 
this study cited a 1% margin of error. 


The confidence level was not stated in the report. 
Use what you have learned to find the confidence 
level, assuming that the NCAA took an SRS. 


The study was designed to protect the anonymity 
of the student athletes who responded. As a result, 
it was not possible to calculate the number of stu- 
dents who were asked to respond but did not. How 
does this fact affect the way that you interpret the 
results? 


Multiple choice: Select the best answer for Exercises 49 
to 52. 


Os 


A Gallup Poll found that only 28% of American 
adults expect to inherit money or valuable posses- 
sions from a relative. The poll’s margin of error was 
+3 percentage points at a 95% confidence level. 
This means that 


the poll used a method that gets an answer within 3% 
of the truth about the population 95% of the time. 


the percent of all adults who expect an inheritance is 


between 25% and 31%. 


if Gallup takes another poll on this issue, the results 
of the second poll will lie between 25% and 31%. 


there’s a 95% chance that the percent of all adults 
who expect an inheritance is between 25% and 31%. 


Gallup can be 95% confident that between 25% and 
31% of the sample expect an inheritance. 


50. 


52. 


(a) 


Most people can roll their tongues, but many can’t. 
The ability to roll the tongue is genetically deter- 
mined. Suppose we are interested in determining 
what proportion of students can roll their tongues. 
We test a simple random sample of 400 students 
and find that 317 can roll their tongues. The mar- 
gin of error for a 95% confidence interval for the 
true proportion of tongue rollers among students is 
closest to 


0.0008. (c) 0.03. (e) 0.05. 
0.02. (d) 0.04. 


. You want to design a study to estimate the propor- 


tion of students at your school who agree with the 
statement, “The student government is an effective 
organization for expressing the needs of students to 
the administration.” You will use a 95% confidence 
interval, and you would like the margin of error to be 
0.05 or less. The minimum sample size required is 


22. (b) 271. (c) 385. (d) 769. (e) 1795. 


A newspaper reporter asked an SRS of 100 residents 
ina large city for their opinion about the mayor's 

job performance. Using the results from the sample, 
the C% confidence interval for the proportion of all 
residents in the city who approve of the mayor's job 


performance is 0.565 to 0.695. What is the value of C? 
B2 (P8651) 90 id) os fe) 98 


Exercises 53 and 54 refer to the following setting. The fol- 
lowing table displays the number of accidents at a factory 


during each hour of a 24-hour shift (1 = 1:00 a.m.). 


WHAT YOU WILL LEARN 


State and check the Random, 10%, and Normal/ 


Large Sample conditions for constructing a confidence 
interval for a population mean. 


Explain how the t distributions are different from the 
standard Normal distribution and why it is necessary 
to use a f distribution when calculating a confidence 
interval for a population mean. 
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Hour Number of accidents Hour Number of accidents 
1 5) 13 21 
2 8 14 12 
8} 17 1S 10 
4 oil 16 1 
5 24 17 0 
6 18 18 1 
il 2 19 3 
8 i 20 2 
9 1 21 23) 
10 0 22 18 
11 2 23 1 
12 14 24 2 
» 53. Accidents happen (1.2, 3.1) 
od (a) Construct a plot that displays the distribution of the 
number of accidents effectively. 

(b) Construct a plot that shows the relationship between 
the number of accidents and the time when they 
occurred. 

(c) Describe something that the plot in part (a) tells you 
about the data that the plot in part (b) does not. 

(d) Describe something that the plot in part (b) tells you 


about the data that the plot in part (a) does not. 


Accidents happen (1.3) Plant managers are 
concerned that the number of accidents may be 
significantly higher during the midnight to 8:00 a.m. 
shift than during the 4:00 p.m. to midnight shift. 
What would you tell them? Give appropriate statisti- 
cal evidence to support your conclusion. 


By the end of the section, you should be able to: 


Determine critical values for calculating a C% confidence 
interval for a population mean using a table or technology. 
Construct and interpret a confidence interval for a 
population mean. 

Determine the sample size required to obtain a C% 
confidence interval for a population mean with a 
specified margin of error. 
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mean(randNorm(M, 20,16)) 
240. 


FIGURE 8.10 The Normal sampling 
distribution of x for the mystery 
mean Activity. 


Inference about a population proportion usually arises when we study categorical 
variables. We learned how to construct and interpret confidence intervals for a 
population proportion p in Section 8.2. To estimate a population mean, we have 
to record values of a quantitative variable for a sample of individuals. It makes 
sense to try to estimate the mean amount of sleep that students at a large high 
school got last night but not their mean eye color! In this section, we'll examine 
confidence intervals for a population mean jL. 


The Problem of Unknown o 


Mr. Schiel’s class did the mystery mean Activity (page +76) and got a value of 
x = 240.80 from an SRS of size 16, as shown. 

Their task was to estimate the unknown population mean py. They knew that 
the population distribution was Normal and that its standard deviation was 
o = 20. Their estimate was based on the sampling distribution of x. Figure 
8.10 shows this Normal sampling distribution once again. 


Sampling 
distribution 
of x 


Lu 
(unknown) 


~— Values of x —— 


To calculate a 95% confidence interval for jz, we use our familiar formula: 
statistic + (critical value) - (standard deviation of statistic) 


The critical value, z* = 1.96, tells us how many standardized units we need to go 
out to catch the middle 95% of the sampling distribution. Our interval is 


7 ae «. 7 = 249,80 + 196-2 = 240.80 + 9.80 = 231.00, 250.6 
x2 Te 20 = LL. AG 80 + 9. (231.00, 250.60) 


We call such an interval a one-sample z interval for a population mean. 

This method isn’t very useful in practice, however. In most real-world settings, 
if we don’t know the population mean ju, then we don’t know the population stan- 
dard deviation o either. 

How do we estimate yu when the population standard deviation o is unknown? 
Our best guess for the value of o is the sample standard deviation s,. Maybe we 
could use the one-sample z interval for a population mean with s, in place of a: 
Sx 


He 


X= Zz 


S 


Let’s try it. 
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ACTIVITY | Calculator BINGO! 


MATERIALS: A farmer wants to estimate the mean weight (in grams) of all tomatoes grown on 

TI-83/84 for each student his farm. To do so, he will select a random sample of 4 tomatoes, calculate the 
mean weight (in grams), and use the sample mean X to create a 99% confidence 
interval for the population mean ju. Suppose that the weights of tomatoes on his 
farm are approximately Normally distributed with a mean of 100 grams and a 
standard deviation of 40 grams. 


1. Use your calculator to simulate taking an SRS of size 4 from this 


; = ae 10 
population and creating a one-sample z interval for pu: X + z* Pie 


4 
eee 226 a Enter the command shown below and press ENTER. 


randNorm(100,40,4)sL1:ZInterval 40,mean(L1),4,99 


Check to see whether the resulting interval captures 4p = 100. If it does 
not, shout “BINGO!” 


Keep pressing ENTER to generate more 99% confidence intervals. Check each 
To getthe randNormcommand, ; 5 “ _ 
press ancl airowitolPHe: interval to see whether it captures i = 100. Te at does not, shout “BINGO! 
The ZIntervalcommandisin lf this method of constructing confidence intervals is working properly, about 
the Catalog. To get the meanand ~— what percent of the time should you get a BINGO? Does the method seem to 
stdDev commands, press|2nd]_—_ be working? 
EIST) and alow to tig The method in Step 1 works well if we know the population standard deviation o. 
vee That’s rarely the case in real life. What happens if we use the sample standard devia- 

tion s, in place of o when calculating a confidence interval for the population mean? 


2. Use your calculator to simulate taking an SRS of size 4 from this population and 


eG 
Va ae ae 


creating a “modified” one-sample z interval for yu: ¥ + z* 
Enter the command shown below and press ENTER. 
randNorm(100,40,4)2L1:ZInterval stdDev (L1) ,mean(L1) ,4,99 


Check to see whether the resulting interval captures p14 = 100. If it does not, 
shout “BINGO!” 


Keep pressing ENTER to generate more 99% confidence intervals. Check each 
interval to see whether it captures 44 = 100. If it does not, shout “BINGO!” If this 
method of constructing confidence intervals is working properly, about what per- 
cent of the time should you get a BINGO? Does the method seem to be working? 


The figure on the next page shows the results of using an applet from www. 
rossmanchance.com to repeatedly construct confidence intervals as described in 
Step 2 of the Activity. Of the 1000 intervals constructed, only 923 (that’s 92.3%) 
captured the population mean. That’s far below our desired 99% confidence level. 
What went wrong? The intervals that missed (those in red) came from samples with 
small standard deviations s, and from samples in which X was far from the popula- 
tion mean ju. In those cases, multiplying s,/V4 by z* = 2.576 didn’t produce long 
enough intervals to reach : = 100. We need to multiply by a larger critical value to 
achieve a 99% capture rate. But what critical value should we use? 
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Recall that a statistic is a number 
computed from sample data. We 
know that the sample mean x is a 
statistic. So is the standardized value 


xXx— UE : weeks 
Z = ——. The sampling distribution 
o/Vn me 


of zshows the values this statistic 
takes in all possible SRSs of size n 
from the population. 
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Simulating Confidence Intervals 


Last Sample 


method: 


mean = 105, 
stdev = 23.39 


Means v aS 


zwith s v 


rv [roo 
n fs 
Intervals: [r00 
(“Sample —) 
oon level [ss * 


Intervals containing ja 


90/100=90.0% 


as 
0 100 200 
outcomes 


Sample Statistios 


mean = 99,30 
stdev = 20,74 


Running total 


923/1000 = 92.3% 


When o Is Unknown: The f Distributions 


When the sampling distribution of x is close to Normal, we can find probabilities 
involving x by standardizing: 
x — pb 
Zz => 
o/Vn 


Recall that the sampling distribution of x has mean y and standard deviation 
o/Vn, as shown in Figure 8.11(a). What are the shape, center, and spread of the 
sampling distribution of the new statistic z? 

From what we learned in Chapter 6, subtracting the constant jz from the values 
of the random variable x shifts the distribution left by 4 units, making the mean 0. 
This transformation doesn’t affect the shape or spread of the distribution. Dividing 


Sampling 
distribution Standard 
of X Normal curve 
X-u 
> xz 6 
oO vn 
vn 
a 
~— Values of ¥ ——> u=0 
(a) (0) 


FIGURE 8.11 (a) Sampling distribution of ¥ when the Normal/Large Sample condition is met. 
(b) Standardized values of x lead to the statistic z, which follows the standard Normal distribution. 
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by the constant ¢/Vn keeps the mean at 0, makes the standard deviation 1, and 
leaves the shape unchanged. As shown in Figure 8.11(b), z has the standard 
Normal distribution N(0, 1). Therefore, we can use Table A or a calculator to find 
the related probability involving z. That’s how we have gotten the critical values 
for our confidence intervals so far. 

When we don’t know g, we estimate it using the sample standard deviation s,. 
What happens now when we standardize? 


7=- Ft 
s,/Wn 
To find out, let’s start with a Normal population having mean ys = 100 and stan- 


dard deviation o = 5. We'll simulate choosing an SRS of size n = 4 and calculat- 
ing the sample mean xX. Then we will standardize the result in two ways: 


x — 100 d a x — 100 
= —__ an 2? = ——___ 
5/V4 s,/W4 


Figure 8.12 shows the results of taking 500 SRSs of size n = 4 and standard- 
izing the value of the sample mean x in both ways. The values of z follow a stan- 
dard Normal distribution, as expected. The standardized values we get, using the 
sample standard deviation s, in place of the population standard deviation o, show 
much greater spread. In fact, in a few samples, the statistic 


z 


x — 
2a7 Ft 


i s,/Vn 


took values below —6 or above 6. 

This statistic has a distribution that is new to us, called a t distribution. It has a 
different shape than the standard Normal curve: still symmetric with a single peak 
at 0, but with much more area in the tails. 


X— i ya 1 - 
jaa : os 3 < 
Sf e: in @ ® - 
x/ 8 2 8 2) 
Jn 37 1 
°o om oo °¢ ° ° i 0 o| 
4 #2 
8 -: , 2 
= W3) © eg a 3, 
Lim 4 ins ! 2 a | 
z=—_ 2:6 42 8 24 6 8 3 2 -1 «0 are. 4 
O/ t z 
/An 


FIGURE 8.12 Fathom simulation showing standardized values of the sample mean x in 500 SRSs. The 
statistic z follows a standard Normal distribution. Replacing o with s, yields a statistic with much greater vari- 
ability that doesn’t follow the standard Normal curve. The Normal probability plot for the t statistic shows the 
departure from Normality in the tails of the ¢ distribution. 


The statistic t has the same interpretation as any standardized statistic: it says 
how far x is from its mean sz in standard deviation units. There is a different t dis- 
tribution for each sample size. We specify a particular t distribution by giving its 
degrees of freedom (df). When we perform inference about a population mean ju 
using a t distribution, the appropriate degrees of freedom are found by subtracting 
1 from the sample size n, making df = n — 1. We will write the ¢ distribution with 
n — | degrees of freedom as t,,-; for short. 
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The ft distribution and the tinference 
procedures were invented by William 

S. Gosset (1876-1937). Gosset worked 
for the Guinness brewery, and his goal 
in life was to make better beer. He used 
his new t procedures to find the best 
varieties of barley and hops. Gosset’s 
statistical work helped him become 
head brewer. Because Gosset published 
under the pen name “Student,” you 

will often see the ft distribution called 
“Student’s t” in his honor. 


FIGURE 8.13 Density curves for the f distributions with 
2 and 9 degrees of freedom and the standard Normal 


THE t DISTRIBUTIONS; DEGREES OF FREEDOM 


Figure 8.13 compares the density curves of the standard Normal distribution 
and the t distributions with 2 and 9 degrees of freedom. The figure illustrates these 
facts about the t distributions: 


t distributions have 


more area in the tails 
than the standard a -— t,2 degrees 
Normal distribution. | /* s of freedom 


seen t, 9 degrees 
of freedom 

— Standard 
Normal 


) 
4 
‘* 
‘3 


distribution. Allare symmetric with center0.The tf ee 
distributions are somewhat more spread out. 0 


e The density curves of the t distributions are similar in shape to the standard 
Normal curve. They are symmetric about 0, single-peaked, and bell-shaped. 


e The spread of the ¢ distributions is a bit greater than that of the standard Nor- 
mal distribution. The t distributions in Figure 8.13 have more probability in 
the tails and less in the center than does the standard Normal. This is true 
because substituting the estimate s, for the fixed parameter o introduces more 
variation into the statistic. 


e As the degrees of freedom increase, the t density curve approaches the stan- 
dard Normal density curve ever more closely. This happens because s, esti- 
mates o more accurately as the sample size increases. So using s, in place of o 
causes little extra variation when the sample is large. 


‘Table B in the back of the book gives critical values t* for the ¢ distributions. 
Each row in the table contains critical values for the ¢ distribution whose degrees 
of freedom appear at the left of the row. For convenience, several of the more 
common confidence levels C are given at the bottom of the table. By looking 
down any column, you can check that the ¢ critical values approach the Normal 
critical values z* as the degrees of freedom increase. 

When you use Table B to determine the correct value of t* for a given confi- 
dence interval, all you need to know are the confidence level C and the degrees 
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of freedom (df). Unfortunately, Table B does not include every possible sample 

size. When the actual df does not appear in the table, use the greatest df available 

that is less than your desired df. This guarantees a wider confidence interval than 

you need to justify a given confidence level. Better yet, use technology to find an 
. accurate value of t* for any df. 


Finding t* Critical Values 
Using Table B 


PROBLEM: What critical value t* from Table B should be used in constructing a confidence 
interval for the population mean in each of the following settings? 

(a) A95% confidence interval based on an SRS of size n= 12. 

(b) A90% confidence interval from a random sample of 48 observations. 

SOLUTION: 

(a) In Table B, we consult the row corresponding to df = 12 — 1 = 11. We move across that row to 


the entry that is directly above 95% confidence level on the bottom of the chart. The desired critical 
value is ¢* = 2.201. 


(b) With 48 observations, we want to find the t* critical value for df = 48 — 1 = 47 and 90% 


confidence. There is no df = 47 rowin Table B, so we use the more conservative df = 40. The 
corresponding critical value is t* = 1.664. 


The bottom row of Table B gives 
Z* critical values and is labeled 
as df = 0. That’s because the t df = .05 02 01 025.02 
distributions approach the standard 10 1.812 2.359 2.764 2.042 2.147 
Normal distribution as the degrees 
of freedom approach infinity. 


Upper-tail probability p Upper-tail probability p 


12 1.782 
0 61.645 
90% 
Confidence level C Confidence level C 


For Practice Try Exercise 


For part (a) of the example, the corresponding standard Normal critical value 
for 95% confidence is z* = 1.96. We have to go out farther than 1.96 standard de- 
viations to capture the central 95% of the ¢ distribution with 11 degrees of freedom. 

Technology will quickly produce t* critical values for any sample size. 


CORNER INVERSE f ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Most newer T1-84 and TI-89 calculators allow you to find critical values t* using the inverse t command. As with the 
calculator’s inverse Normal command, you have to enter the area to the left of the desired critical value. Let’s use the 
inverse t command to find the critical values in parts (a) and (b) of the example. 
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TI-83/84 TI-89 


° Press [2nd][VARS] (DISTR) and choose invT (. e In the Statistics/List Editor, press |F5|, choose In- 
e For part (a), OS 2.55 or later: In the dialog box, NESS WeNipt IN Se 
enter these values: area: .025, df£:11, choose ¢ For part (a), enter Area: .025 and Deg of Freedom, 
Paste, and then press [ENTER| Older OS: Com- df: 11, and then press [ENTER |. 
plete the command invT(.025,11)and presse For part (b), use Area:.05 and df: 47. 


ENTER }. 


e For part (b), use the command invT(.05,47). 


NORMAL FLOAT AUTO REAL RADIAN CL o 


list3lil= 


Note that the ¢ critical values are t* = 2.201 and t* = 1.678, respectively. 


CHECK YOUR UNDERSTANDING 


Use Table B to find the critical value t* that you would use for a confidence interval for a 
population mean 4 in each of the following settings. If possible, check your answer with 
technology. 


1. A 96% confidence interval based on a random sample of 22 observations. 
2. A99% confidence interval from an SRS of 71 observations. 


Conditions for Estimating pu 


As with proportions, you should check some important conditions before con- 
structing a confidence interval for a population mean. Two of the conditions 
should be familiar by now. The Random condition is crucial for doing inference. If 
the data don’t come from a well-designed random sample or randomized experi- 
ment, you can’t draw conclusions about a larger population or about cause and 
effect. When sampling without replacement, the 10% condition ensures that our 
formula for the standard deviation of the statistic ¥ 


is approximately correct. 

The method we use to construct a confidence interval for jp depends on the fact 
that the sampling distribution of ¥ is approximately Normal. From Chapter 7, we 
know that the sampling distribution of x is Normal if the population distribution 
is Normal. When the population distribution is not Normal, the central limit 
theorem tells us that the sampling distribution of ¥ will be approximately Normal 
if the sample size is large enough (n = 30). Be sure to check this Normal/Large 
Sample condition before calculating a confidence interval. 
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CONDITIONS FOR CONSTRUCTING A 
CONFIDENCE INTERVAL ABOUT A MEAN 


Larger samples improve the accuracy of critical values from the t distributions 
when the population is not Normal. This is true for two reasons: 
1. The sampling distribution of x for large sample sizes is close to Normal. 
2. As the sample size n grows, the sample standard deviation s, will give a more 
. oO 


Vn Vn 


The Normal/Large Sample condition is obviously met if we know that the pop- 
ulation distribution is Normal or that the sample size is at least 30. What if 
we don’t know the shape of the population distribution and n < 30? In that case, 
we have to graph the sample data. Our goal is to answer the question, “Is it reason- 
able to believe that these data came from a Normal population?” 

How should graphs of data from small samples look if the population has a 
Normal distribution? The following Activity sheds some light on this question. 


accurate estimate of a. This is important because we use in place of 


when doing calculations. 


ACTIVITY | Sampling from a Normal Population 


MATERIALS: Let’s use the calculator to simulate choosing random samples of size n = 20 from 
TI-83/84 or TI-89 for each a Normal distribution with 44 = 100 and o = 15 and then to plot the data. 
student 


1. Choose an SRS of 20 observations from this Normal population. 


TI-83/84: Press |MATH |, arrow to PRB and choose randNorm (. Complete the 
command randNorm(100,15,20)-—>L1 and press |ENTER |, 
TI-89: Press |CATALOG]|F3|(Flash Apps), press |alpha|/2|(R) to jump 
to the r’s, and choose randNorm (... Complete the command 
tistat.randNorm(100,15,20)—list1 and press |ENTER |. 
2. Make a histogram, a boxplot, and a Normal probability plot of the data in 

LI /listl. Do you see any obvious departures from Normality in the graphs of the 
sample data? 


WwW 
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3. Repeat Steps | and 2 several times. Do the graphs of the sample data always 
look approximately Normal when the population distribution is Normal? 


4. Compare the results with those of your classmates. How easy do you think it 
will be to use a graph of sample data to determine whether or not a population 
has a Normal distribution? 


Did you expect that a random sample from a Normal population would yield a 
graph that looked Normal? Unfortunately, that’s usually not the case. The figure 
below shows boxplots from three different SRSs of size 20 chosen in Step 3 of the 
Activity. The left-hand graph is skewed to the right. The right-hand graph shows 
three outliers in the sample. Only the middle graph looks symmetric and has no 
outliers. 


IMORMAL FLOAT AUTO REAL RADIAN CL fl f} IMORMAL FLOAT AUTO REAL RADIAN CL of 
CL) —_ [re - Ir " . 
t 4 ‘ 4 + n i, 4 4 + 4 + i, 4 ‘ + + n + 


As the Activity shows, it is very difficult to use a graph of sample data to 
assess the Normality of a population distribution. If the graph has a skewed shape 
or if there are outliers present, it could be because the population distribution 
isn’t Normal. Skewness or outliers could also occur naturally in a ran- _ 
dom sample from a Normal population. To be safe, you should only use @ 
at distribution for small samples with no outliers or strong skewness. 

What constitutes strong skewness in a distribution? The following example 
gives you some idea. 


GPAs, Wood, and SATs 


Can we use t? 


PROBLEM: Determine if we can safely use a t* critical value to calculate a confidence interval for 
the population mean in each of the following settings. 


(a) To estimate the average GPA of students at your school, you randomly select 50 students from 
classes you take. Figure 8.14(a) is a histogram of their GPAs. 


(b) How much force does it take to pull wood apart? Figure 8.14(b) shows a stemplot of the force (in 
pounds) required to pull apart a random sample of 20 pieces of Douglas fir. 
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(c) Suppose you want to estimate the mean SAT Math score at a large high school. Figure 8.14(c) is 
a boxplot of the SAT Math scores for a random sample of 20 students at the school. 


= 


NY 2 Oo @Oo 


Key: 31/3 = 313 pounds 


Frequency of GPA 


—+-— 


20 25 30 35 4.0 400 500 600 700 800 
GPA SAT_Math 


FIGURE 8.14 Can we use a ft distribution for these data? (a) GPAs of 50 randomly selected stu- 
dents in your classes at school. (b) Force required to pull apart a random sample of 20 pieces of 
Douglas fir. (c) SAT Math scores for a random sample of 20 students at a large high school. 


SOLUTION: 


(a) No. Although the histogram is roughly symmetric with no outliers, the random sample of 50 
students was only from yourclasses and not from all students at your school. So we should not use 
these data to calculate a confidence interval for the mean GPA of all students at the school. 


(b) No. The graph is strongly skewed to the left with possible low outliers. We cannot trust a critical 
value from a t distribution with df = 19 in this case. 


(c) Yes. The distribution is only moderately skewed to the right and there are no outliers present. 


AP® EXAM TIP Ifa question on the AP® exam asks you to calculate a confidence interval, all 
the conditions should be met. However, you are still required to state the conditions and show 
evidence that they are met. 


For Practice Try Exercise 


THINK What’s the difference between “strongly skewed” and “mod- 
erately skewed”? Look at the stemplot in Figure 8.14(b) and the boxplot 
ABOUT IT in Figure 8.14(c). Compare the distance from the maximum to the median 
and from the median to the minimum in both graphs. In Figure 8.14(b), 
maximum —median = 336—319.5 = 16.5 and median—minimum = 319.5-230 = 
89.5. The half of the stemplot with smaller values is more than five times as long as 
the half of the stemplot with larger values. In Figure 8.14(c), maximum — median ~ 
775 — 525 = 250 and median — minimum ~ 525 — 375 = 150. The right half of 
the boxplot is not quite twice as long as the left half. 


r02.—S—O_>_ ee -—rHoeeoeewum ™ oe 


There is no accepted rule of thumb for identifying strong skewness. For that 
reason, we have chosen the data sets in examples and exercises to avoid borderline 
cases. You should be able to tell easily if strong skewness is present in a graph of 
data from a small sample. 
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As with proportions, some books 
refer to the standard deviation of the 
sampling distribution of x as the 
“standard error” and what we call 
the standard error of the mean as 
the “estimated standard error.” The 
standard error of the mean is often 
abbreviated SEM. 


ESTIMATING WITH CONFIDENCE 


Constructing a Confidence Interval for ps 


When the conditions are met, the sampling distribution of x has roughly a Nor- 
mal distribution with mean yp and standard deviation ¢/Vn. Because we don’t 
know o, we estimate it by the sample standard deviation s,. We then estimate the 


standard deviation of the sampling distribution with SE; = —— This value is 


n 
called the standard error of the sample mean <, or just the standard error of the 
mean. 


To construct a confidence interval for j, replace the standard deviation ¢/Vn 
of x by its standard error s,/Vn in the formula for the one-sample z interval for a 
population mean. Use critical values from the ¢ distribution with n — 1 degrees of 
freedom in place of the z critical values. That is, 


statistic + (critical value) - (standard deviation of statistic) 


Sx 


=xyitt 
n 
This one-sample t interval for a population mean is similar in both reasoning 
and computational detail to the one-sample z interval for a population proportion 
of Section 8.2. So we will now pay more attention to questions about using these 
methods in practice. 


THE ONE-SAMPLE f INTERVAL FOR A POPULATION MEAN 


The following example shows you how to construct a confidence interval 
for a population mean when a is unknown. By now, you should recognize the 
four-step process. 


df 
29 


40 


Upper-tail probability p 
05 
1.699 


1.684 
90% 
Confidence level C 
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Auto Pollution 
A one-sample t interval for ju 


STEP A 


Environmentalists, government officials, and vehicle manufacturers are all inter- 
ested in studying the auto exhaust emissions produced by motor vehicles. The 
major pollutants in auto exhaust from gasoline engines are hydrocarbons, carbon 
monoxide, and nitrogen oxides (NOX). Researchers collected data on the NOX 
levels (in grams/mile) for a random sample of 40 light-duty engines of the same 
type. The mean NOX reading was 1.2675 and the standard deviation was 0.3332. 


PROBLEM: 

(a) Construct and interpret a 95% confidence interval for the mean amount of NOX emitted by light- 
duty engines of this type. 

(b) The Environmental Protection Agency (EPA) sets a limit of 1.0 gram/mile for average NOX emis- 
sions. Are you convinced that this type of engine violates the EPA limit? Use your interval from (a) to 
Support your answer. 


SOLUTION: 


(a) STATE: We want to estimate the true mean amount ju of NOX emitted by all light-duty engines 
of this type at a 95% confidence level. 


PLAN: We should construct a one-sample t interval for /1 if the conditions are met. 


* Random: Thedata come froma random sample of 40 light-duty engines of this type. 
° 10%: We are sampling without replacement, so we need to assume that there are at least 
10(40) = 400 light-duty engines of this type. 
* Normal/Large Sample: We don’t know whether the population distribution of NOX emissions is 
Normal. Because the sample size is large (n = 40 = 30), we should be safe using a t distribution. 


DO: The formula for the one-sample t interval is 
x ote pte 
fn 
From the information given, x = 1.2675 g/miand 5, = 0.3332 g/mi. To find the critical value t*, 
we use the tdistribution with df = 40-1 = 39. Unfortunately, there is no row corresponding to 
39 degrees of freedom in Table B. We can’t pretend we have a larger sample size than we actually do, 
50 we use the more conservative df = 30. 


At a 95% confidence level, the critical value is t* = 2.042. So the 95% confidence interval for j1 is 


‘4.2675 + 2.0420 
Vig oe na 


Using technology: The command invT (.025,39) gives t = —2.023. Using the critical value 
t* = 2.023 for the 95% confidence interval gives 


Keane 


= 1.2675 = 0.1076 = (1.1599, 1.3751) 


S 5, 0.3332 
Kae ZOO == 2025 = 1.2675 = 0.1066 = (1.1609, 1.3741 
Vi V40 


This interval is slightly narrower than the one found using Table B. 


CONCLUDE: Weare 95% confident that the interval from 1.1609 to 1.37411 grams/mile 
captures the true mean level of nitrogen oxides emitted by this type of light-duty engine. 
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(b) The confidence interval from (a) tells us that any value from 1.1609 to 1.3741 g/miis a plausi- 
ble value of the mean NOX level pu for this type of engine. Because the entire interval exceeds 1.0, it 
appears that this type of engine violates EPA limits. 


For Practice Try Exercise 


Now that we’ve calculated our first confidence interval for a population mean 
ju, it’s time to make a simple observation. Inference for proportions uses z; in- 
ference for means uses t. That’s one reason why distinguishing categorical from 
quantitative variables is so important. 

Here is another example, this time with a smaller sample size. 


Video Screen Tension “A 
Constructing a confidence interval for L. 


A manufacturer of high-resolution video terminals must control the tension on 
the mesh of fine wires that lies behind the surface of the viewing screen. Too 
much tension will tear the mesh, and too little will allow wrinkles. The tension is 
measured by an electrical device with output readings in millivolts (mV). Some 
variation is inherent in the production process. Here are the tension readings from 
a random sample of 20 screens from a single day’s production: 


269.5 297.0 2696 283.3 3048 2804 233.5 2574 317.5 327.4 
264.7 307.7 3100 343.3 328.1 3426 3388 340.1 374.6 336.1 


Construct and interpret a 90% confidence interval for the mean tension yp of all 
the screens produced on this day. 


STATE: We want to estimate the true mean tension ju of all the video terminals produced this day 
with 90% confidence. 


PLAN: Ifthe conditions are met, we should use a one-sample t interval to estimate (1. 


* Random: Weare told that the data come from a random sample of 20 screens produced 
that day. 
° 10%: Because we are sampling without replacement, we must assume that at least 10(20) = 
200 video terminals were produced this day. 
* Normal/Large Sample: Because the sample size is small (n = 20), we must check whether it's 
reasonable to believe that the population distribution is Normal. So we examine the sample data. 
Figure 8.15 shows (a) a dotplot, (b) a boxplot, and (c) a Normal probability plot of the tension 
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(b) 
FIGURE 8.15 (a) A dotplot, (b) boxplot, and (c) Normal probability plot of the video screen tension 
readings. 


When the sample size is small 

(n < 30), as in this example, the 
Normal condition is about the shape 
of the population distribution. We look 
at a graph of the sample data to see 
if it’s believable that the data came 
from a Normal population. 


Upper-tail probability p 


Confidence level C 
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,. 


readings in the sample. Neither the dotplot nor the boxplot shows strong skewness or any outliers. 
The Normal probability plot looks roughly linear. These graphs give us no reason to doubt the 
Normality of the population. 


DO: Weused our calculator to find the mean and standard deviation of the tension readings for the 
20 screens inthe sample: x = 306.32 mV and 5, = 36.21 mV. We use the t distribution with 

df = 19 to find the critical value. For a 90% confidence level, the critical value is t* = 1.729. 

So the 90% confidence interval for puis 


- 5 36.21 
Salih = 0002 2s 29 ——— — 50652022 14:00 =1292- S255 20:52 
a arr 20 ( ) 


Using technology: The calculator’s invT (.05,19) gives t =—1.729, which matches the 
t* = 1.729 critical value we got from the table. 


CONCLUDE: Weare 90% confident that the interval from 292.32 to 320.32 mV captures the 
true mean tension in the entire batch of video terminals produced that day. 


For Practice Try Exercise 169 


AP® EXAM TIP Itis not enough just to make a graph of the data on your calculator when 
assessing Normality. You must sketch the graph on your paper to receive credit. You don’t have to 
draw multiple graphs—any appropriate graph will do. 


17. TECHNOLOGY 


As you probably guessed, your calculator will compute a one-sample t interval 
for a population mean from sample data or summary statistics. 


ONE-SAMPLE fINTERVALS FOR su 
ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Confidence intervals for a population mean using ¢ distributions can be constructed on the TI-83/84 and TI-89, thus 
avoiding the use of Table B. Here is a brief summary of the techniques when you have the actual data values and when 
you have only numerical summaries. 

TI-83/84 TI-89 


1. Using summary statistics (see auto pollution example, page 519) 


From the home screen, 


e Press [STAT] arrow over 
ER araiteeiravictell eee 


From inside the Statistics/List Editor, 
to TESTS, and choose ¢ Press ( [F7]) to go into the intervals (Ints) 


menu, then choose TInterval.... 


¢ On the TInterval screen, adjust your settings asshown ¢ Choose “Stats” as the Data Input Method. 


and choose Calculate. 


IHORHAL FLOAT AUTO REAL RADIAN CL fi 


Tinterval | 
Inet :Data 


C-Level: .95 
Calculate 


¢ On the TInterval screen, adjust your settings as shown 


and press [ENTER |. 


sy 
CEnter=0K _) CESC=CANCEL > 


11st1=(0,1,2,5,4,5,6,7,8,. 
TYPE + TENTERI=O AND (ESCI=CANCEL 
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HORHAL FLOAT AUTO REAL RADIAN CL fi 


(1.1609,1.3741) 
x=1. 2675 

Sx=. 3332 

n=48 


1isti={0,1,2,3,4,5,6,7,8,_ 
Main RAD AUTO FUNC ive 


2. Using raw data (see video screen tension example, page 520) 
Enter the 20 video screen tension readings data in L1 /list]. Proceed to the Interval screen as in Step 1, but choose Data 
as the input method. ‘Then adjust your settings as shown and calculate the interval. 


o 


MORMAL FLOAT AUTO REAL RADIAN CL 


Inet :BERE) Stats 
List:La 
(Spt 9 
—~Level:. ‘ 
Calculate clear «ds 


ESC= CANCEL 
a Laas] 


1isti[21)= 
TYPE + (ENTERI=UK AND [ESCI=CRNCEL 


MORMAL FLOAT AUTO REAL RADIAN CL f 


TInterval 
(292. 32.320.32) 
K=306.32 
Sx=36. 20928349 
n=20 


MaIN RAD AUTO FUNE ives 


CHECK YOUR UNDERSTANDING 


Biologists studying the healing of skin wounds measured the rate at which new cells closed 
a cut made in the skin of an anesthetized newt. Here are data from a random sample of 
18 newts, measured in micrometers (millionths of a meter) per hour:!? 


29 27 34 40 22 28 14 35 26 35 12 30 23 18 Il 22 23 33 


Calculate and interpret a 95% confidence interval for the mean healing rate pu. 


Choosing the Sample Size 


A wise user of statistics never plans data collection without planning the inference 
at the same time. You can arrange to have both high confidence and a small mar- 
gin of error by taking enough observations. When the population standard devia- 
tion o is unknown and conditions are met, the C % confidence interval for pu is 


Sx 


ra 


S 


There are other methods of determining 
sample size that do not require us to 
use a known value of the population 
standard deviation o. These methods 
are beyond the scope of this text. Our 
advice: consult with a statistician when 
planning your study! 
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where ¢* is the critical value for confidence level C and degrees of freedom df = n- 1. 
The margin of error (ME) of the confidence interval is 
Sy 


ME =t* 
Vn 


To determine the sample size for a desired margin of error, it makes sense to 
set the expression for ME less than or equal to the specified value and solve the 
inequality for n. There are two problems with this approach: 


1. We don’t know the sample standard deviation s, because we haven’t produced 
the data yet. 
2. The critical value t* depends on the sample size n that we choose. 


The second problem is more serious. To get the correct value of t*, we need to 
know the sample size. But that’s what we’re trying to find! There is no easy solu- 
tion to this problem. 

One alternative (the one we recommend!) is to come up with a reasonable es- 
timate for the population standard deviation o from a similar study that was done 
in the past or from a small-scale pilot study. By pretending that a is known, we can 
use the one-sample z interval for pu: 


x a 2 


Using the appropriate standard Normal critical value z* for confidence level C, 
we can solve 


z* = = ME 
nN 


for n. Here is a summary of this strategy. 


CHOOSING SAMPLE SIZE FOR A DESIRED MARGIN 
OF ERROR WHEN ESTIMATING jz 


The procedure is best illustrated with an example. 
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How Many Monkeys? 


Determining sample size from margin of error 


Researchers would like to estimate the mean cholesterol level jz of a 
particular variety of monkey that is often used in laboratory experi- 
ments. They would like their estimate to be within | milligram per 
deciliter (mg/dl) of the true value of ys at a 95% confidence level. 
A previous study involving this variety of monkey suggests that the 
standard deviation of cholesterol level is about 5 mg/dl. 


PROBLEM: Obtaining monkeys for research is time-consuming, expensive, and 
controversial. What is the minimum number of monkeys the researchers will need 
to get a satisfactory estimate? 

SOLUTION: For 95% confidence, z* = 1.96. We will use o = 5 as our best 


guess for the standard deviation of the monkeys’ cholesterol level. Set the expres- 
sion for the margin of error to be at most 1 and solve for n: 


5 
1.96 —— = 1 
Vn 


19018) 5 
96.04 =n 


Remember: always round up to the 


ae Because 96 monkeys would give a slightly larger margin of error than desired, the researchers would 
next whole number when finding 7. 


need 97 monkeys to estimate the cholesterol levels to their satisfaction. (On learning the cost of 
getting this many monkeys, the researchers might want to consider studying rats instead!) 


For Practice Try Exercise 


‘Taking observations costs time and money. The required sample size 
may be impossibly expensive. Notice that it is the size of the sample that 
determines the margin of error. The size of the population does not influ- 
ence the sample size we need. This is true as long as the population is much larger 
than the sample. 


CHECK YOUR UNDERSTANDING 


Administrators at your school want to estimate how much time students spend on home- 
work, on average, during a typical week. They want to estimate ju at the 90% confidence 
level with a margin of error of at most 30 minutes. A pilot study indicated that the standard 
deviation of time spent on homework per week is about 154 minutes. 

How many students need to be surveyed to meet the administrators’ goal? Show your 
work. 
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Refer to the chapter-opening Case Study on page 475. The bank 
manager wants to know whether or not the bank’s customer ser- 
vice agents generally met the goal of answering incoming calls 
in less than 30 seconds. We can approach this question in two 
ways: by estimating the proportion p of all calls that were an- 
swered within 30 seconds or by estimating the mean response 
time [U. 
Some graphs and numerical summaries of the data are provided below. 


Frequency 


T T T T 
718 15.0 22.5 300 375 45.0 10 20 30 40 


Call response time (seconds) Call response time (seconds) 


Descriptive Statistics: Call response time (sec) 
Variable N Mean SE Mean StDev Minimum Ql Median Q3 Maximum 
Call response time (sec) 241 18.353 0.758 11.761 1.000 9.000 16.000 25.000 49.000 


1. Describe the distribution of call response times for the random 
sample of 241 calls. 

2. About what proportion of the call response times in the sample 
were less than 30 seconds? Explain how you got your answer. 
The bank’s manager would like to estimate the true proportion p 
of calls to the bank’s customer service center that are answered in 
less than 30 seconds. 

(a) What conditions must be met to calculate a 95% confidence 
interval for p? Show that the conditions are met in this case. 

(b) Explain the meaning of 95% confidence in this setting. 

(c) A 95% confidence interval for p is (0.783, 0.877). Give the 
margin of error and show how it was calculated. 

(d) Interpret the interval from part (c) in context. 

Construct and interpret a 95% confidence interval for the true 

mean response time of calls to the bank’s customer service center. 

Is the customer service center meeting its goal of answering calls 

in less than 30 seconds? Give appropriate evidence to support 

your answer. 
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Section 8.3 Bs viiluelay 


TECHNOLOGY 
CORNERS 


Confidence intervals for the mean jz of a Normal population are based on the 
sample mean X ofan SRS. If we somehow know g, we use the z critical value and 
the standard Normal distribution to help calculate confidence intervals. 


In practice, we usually don’t know a. Replace the standard deviation ¢/Vn 
of the sampling distribution of x by the standard error SE; = s,/\/n and use 
the t distribution with n — | degrees of freedom (df). 


There is a t distribution for every positive degrees of freedom. All t distributions 
are unimodal, symmetric, and centered at 0. The ¢ distributions approach the 
standard Normal distribution as the number of degrees of freedom increases. 


The conditions for constructing a confidence interval about a population 
mean are 


e Random: The data were produced by a well-designed random sample 
or randomized experiment. 
° 10%: When sampling without replacement, check that the population 
is at least 10 times as large as the sample. 


e¢ Normal/Large Sample: The population distribution is Normal or the 
sample size is large (n = 30). When the sample size is small (n < 30), 
examine a graph of the sample data for any possible departures from 
Normality in the population. You should be safe using a ¢ distribution 
as long as there is no strong skewness and no outliers are present. 
When conditions are met, a C% confidence interval for the mean J is given 
by the one-sample t interval 


rte 


S 


The critical value t* is chosen so that the t curve with n — 1| degrees of free- 
dom has C% of the area between —t* and t*. 


Follow the four-step process—State, Plan, Do, Conclude—whenever you are 
asked to construct and interpret a confidence interval for a population mean 
Remember: inference for proportions uses z; inference for means uses t. 
The sample size needed to obtain a confidence interval with approximate 
margin of error ME for a population mean involves solving 


oO 
*— = ME 
s Vn 


for n, where the standard deviation o is a reasonable value from a previous or 
pilot study, and z* is the critical value for the level of confidence we want. 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


16. Inverse t on the calculator page 513 


17. One-sample t intervals for y on the calculator page 521 


Critical values What critical value t* from Table B 


pgfk) would you use for a confidence interval for the popu- 


58. 


lation mean in each of the following situations? 
A95% confidence interval based on n = 10 
randomly selected observations 

A 99% confidence interval from an SRS of 
20 observations 


A 90% confidence interval based on a random 
sample of 77 individuals 

Critical values What critical value t* from ‘Table B 
should be used for a confidence interval for the 
population mean in each of the following situations? 
A 90% confidence interval based on n = 12 ran- 
domly selected observations 


A 95% confidence interval from an SRS of 

30 observations 

A 99% confidence interval based on a random 
sample of size 58 

Pulling wood apart How heavy a load (pounds) is 
needed to pull apart pieces of Douglas fir + inches 
long and 1.5 inches square? A random sample 

of 20 similar pieces of Douglas fir from a large 
batch was selected for a science class. The Fathom 
boxplot below shows the class’s data. Explain why 
it would not be wise to use a t critical value to 
construct a confidence interval for the population 
mean [. 


26 28 30 32 34 
Load (thousands) 


22 24 


Weeds among the corn Velvetleaf is a particularly 
annoying weed in cornfields. It produces lots of 
seeds, and the seeds wait in the soil for years until 
conditions are right for sprouting. How many seeds 
do velvetleaf plants produce? The Fathom histogram 
below shows the counts from a random sample of 28 
plants that came up in a cornfield when no herbicide 
was used.”” Explain why it would not be wise to use a 
t critical value to construct a confidence interval for 
the mean number of seeds jz produced by velvetleaf 
plants. 


Frequency of 
Seeds 
N tt 3] 


4 he Wik a OF te 
Seeds (thousands) 
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Should we use t? Determine whether we can safely 
use a ¢* critical value to calculate a confidence inter- 
val for the population mean in each of the following 
settings. 

We collect data from a random sample of adult resi- 
dents in a state. Our goal is to estimate the overall per- 
cent of adults in the state who are college graduates. 
The coach of a college men’s basketball team records 
the resting heart rates of the 15 team members. We 
use these data to construct a confidence interval for 
the mean resting heart rate of all male students at this 
college. 

Do teens text more than they call? ‘To find out, an 
AP® Statistics class at a large high school collected 
data on the number of text messages and calls sent or 
received by each of 25 randomly selected students. 
The Fathom boxplot below displays the difference 
(texts — calls) for each student. 


20 0 20 40 60 80 
diff 


100 120 


Should we use t? Determine whether we can safely use 
a t* critical value to calculate a confidence interval for 
the population mean in each of the following settings. 
We want to estimate the average age at which U.S. 
presidents have died. So we obtain a list of all U.S. 
presidents who have died and their ages at death. 
How much time do students spend on the Internet? 
We collect data from the 32 members of our AP® 
Statistics class and calculate the mean amount of time 
that each student spent on the Internet yesterday. 

Judy is interested in the reading level of a medical jour- 
nal. She records the length of a random sample of 100 
words. ‘The Minitab histogram below displays the data. 


10 12 


6 8 
Word length 


Blood pressure A medical study finds that x = 114.9 
and s, = 9.3 for the seated systolic blood pressure of the 
27 members of one treatment group. What is the stan- 
dard error of the mean? Interpret this value in context. 
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Travel time to work A study of commuting times 
reports the travel times to work of a random sample 
of 20 employed adults in New York State. ‘The mean 
is X = 31.25 minutes, and the standard deviation is 
s, = 21.88 minutes. What is the standard error of the 
mean? Interpret this value in context. 


Willows in Yellowstone Writers in some fields sum- 
marize data by giving x and its standard error rather 
than x and s,. Biologists studying willow plants in 
Yellowstone National Park reported their results in a 
table with columns labeled x + SE. The table entry 
for the heights of willow plants (in centimeters) in 
one region of the park was 61.55 + 19.03.7! The 
researchers measured a total of 23 plants. 


Find the sample standard deviation s, for these mea- 
surements. Show your work. 


A hasty reader believes that the interval given in the 
table is a 95% confidence interval for the mean height 
of willow plants in this region of the park. Find the 
actual confidence level for the given interval. 


Blink When two lights close together blink alternately, 
we “see” one light moving back and forth if the time 
between blinks is short. What is the longest interval of 
time between blinks that preserves the illusion of mo- 
tion? Ask subjects to turn a knob that slows the blinking 
until they “see” two lights rather than one light moving. 
A report gives the results in the form “mean plus or 
minus the standard error of the mean.””” Data for 12 
subjects are summarized as 25] + 45 (in milliseconds). 


Find the sample standard deviation s, for these mea- 
surements. Show your work. 


A hasty reader believes that the interval given in the 
report is a 95% confidence interval for the popula- 
tion mean. Find the actual confidence level for the 
given interval. 


Bone loss by nursing mothers Breast-feeding moth- 
ers secrete calcium into their milk. Some of the 
calcium may come from their bones, so mothers may 
lose bone mineral. Researchers measured the percent 
change in bone mineral content (BMC) of the spines 
of +7 randomly selected mothers during three months 
of breast-feeding.”’ The mean change in BMC was 

— 3.587% and the standard deviation was 2.506%. 


Construct and interpret a 99% confidence interval 
to estimate the mean percent change in BMC in the 
population. 

Based on your interval from part (a), do these data 
give good evidence that on the average nursing 
mothers lose bone mineral? Explain. 


Reading scores in Atlanta The Trial Urban District 
Assessment (‘T'UDA) is a government-sponsored study 
of student achievement in large urban school districts. 
TUDA gives a reading test scored from 0 to 500. A 
score of 243 is a “basic” reading level and a score of 
281 is “proficient.” Scores for a random sample of 
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1470 eighth-graders in Atlanta had ¥ = 240 with 
standard deviation 42.17.7* 


Calculate and interpret a 99% confidence interval 
for the mean score of all Atlanta eighth-graders. 
Based on your interval from part (a), is there good 
evidence that the mean for all Atlanta eighth-graders 
is less than the basic level? Explain. 

Men and muscle Ask young men to estimate their 
own degree of body muscle by choosing from a set of 
100 photos. Then ask them to choose what they be- 
lieve women prefer. The researchers know the actual 
degree of muscle, measured as kilograms per square 
meter of fat-free mass, for each of the photos. ‘They 
can therefore measure the difference between what 
a subject thinks women prefer and the subject’s own 
self-image. Call this difference the “muscle gap.” 
Here are summary statistics for the muscle gap from 
a random sample of 200 American and European 
young men: X = 2.35 and s, =2.5.” 

Calculate and interpret a 95% confidence interval 
for the mean size of the muscle gap for the popula- 
tion of American and European young men. 

A graph of the sample data is strongly skewed to the 
right. Explain why this information does not invali- 
date the interval you calculated in part (a). 

A big-toe problem A bunion on the big toe is fairly un- 
common in youth and often requires surgery. Doctors 
used X-rays to measure the angle (in degrees) of defor- 
mity on the big toe in a random sample of 37 patients 
under the age of 21 who came to a medical center for 
surgery to correct a bunion. ‘The angle is a measure of 
the seriousness of the deformity. For these 37 patients, 
the mean angle of deformity was 24.76 degrees and the 
standard deviation was 6.34 degrees. A dotplot of the 
data revealed no outliers or strong skewness.”° 
Construct and interpret a 90% confidence interval 
for the mean angle of deformity in the population of 
all such patients. 

Researchers omitted one patient with a deformity 
angle of 50 degrees from the analysis due to a mea- 
surement issue. What effect would including this 
outlier have on the confidence interval in part (a)? 
Justify your answer without doing any calculations. 
Give it some gas! Computers in some vehicles 
calculate various quantities related to performance. 
One of these is fuel efficiency, or gas mileage, usu- 
ally expressed as miles per gallon (mpg). For one 
vehicle equipped in this way, the miles per gallon 
were recorded each time the gas tank was filled 

and the computer was then reset.’’ Here are the mpg 
values for a random sample of 20 of these records: 


15.8 
19.4 


13.6 15.6 19.1 224 15.6 225 17.2 19.4 22.6 
18.0 146 18.7 21.0 148 226 21.5 14.3 20.9 


Construct and interpret a 95% confidence interval for 
the mean fuel efficiency ju for this vehicle. 


STEP 


L 


70. Vitamin C content Several years ago, the U.S. 
Agency for International Development provided 
238,300 metric tons of corn-soy blend (CSB) for 
emergency relief in countries throughout the world. 
CSB is a highly nutritious, low-cost fortified food. 
As part of a study to evaluate appropriate vitamin 

C levels in this food, measurements were taken on 
samples of CSB produced in a factory.’® The follow- 
ing data are the amounts of vitamin C, measured in 
milligrams per 100 grams (mg/100 g) of blend, for a 


random sample of size 8 from one production run: 
2 Sil Bs) 22 jill 22 14 = 31 


Construct and interpret a 95% confidence interval 
for the mean amount of vitamin C yz in the CSB 
from this production run. 


71. Paired tires Researchers were interested in comparing 
two methods for estimating tire wear. ‘The first method 
used the amount of weight lost by a tire. ‘The second 
method used the amount of wear in the grooves of 
the tire. A random sample of 16 tires was obtained. 
Both methods were used to estimate the total distance 
traveled by each tire. The table below provides the two 
estimates (in thousands of miles) for each tire.”? 

(a) Construct and interpret a 95% confidence interval 
for the mean difference jz in the estimates from these 
two methods in the population of tires. 

(b) Does your interval in part (a) give convincing 
evidence of a difference in the two methods of 
estimating tire wear? Justify your answer. 


Section 8.3 Estimating a Population Mean 
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Water ‘Trace metals found in wells affect the taste of 
drinking water, and high concentrations can pose a 
health risk. Researchers measured the concentration 
of zinc (in milligrams/liter) near the top and the bot- 
tom of 10 randomly selected wells in a large region. 
The data are provided in the table below.” 

(a) Construct and interpret a 95% confidence interval 
for the mean difference jz in the zinc concentrations 
from these two locations in the wells. 

(b) Does your interval in part (a) give convincing evidence 

of a difference in zinc concentrations at the top and 

bottom of wells in the region? Justify your answer. 


Well: 1 2 3 4 5) 6 7 8 9 10 
Bottom: 0.430 0.266 0.567 0.531 0.707 0.716 0.651 0.589 0.469 0.723 
Top: 0.415 0.238 0.390 0.410 0.605 0.609 0.632 0.523 0.411 0.612 


Difference: 0.015 0.028 0.177 0.121 0.102 0.107 0.019 0.066 0.058 0.111 
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Estimating BMI The body mass index (BMI) of all 
American young women is believed to follow a Nor- 


7.5. How large a sample would be needed to estimate 
the mean BMI yp in this population to within +1 
with 99% confidence? Show your work. 

74. The SAT again High school students who take 
the SAT Math exam a second time generally score 
higher than on their first try. Past data suggest that 
the score increase has a standard deviation of 
about 50 points. How large a sample of high school 
students would be needed to estimate the mean 
change in SAT score to within 2 points with 95% 
confidence? Show your work. 


Multiple choice: Select the best answer for Exercises 75 
to 78. 


75. One reason for using a ¢ distribution instead of the 
standard Normal curve to find critical values when 
calculating a level C confidence interval for a popu- 
lation mean is that 

(a) zcan be used only for large samples. 

(b) z requires that you know the population standard 
deviation o. 

(c) zrequires that you can regard your data as an SRS 
from the population. 

(d) z requires that the sample size is at most 10% of the 
population size. 

(e) azcritical value will lead to a wider interval than a 
t critical value. 

76. You have an SRS of 23 observations from a large pop- 
ulation. The distribution of sample values is roughly 
symmetric with no outliers. What critical value would 
you use to obtain a 98% confidence interval for the 
mean of the population? 


(a) 2.177. (b) 2.183 (c) 2.326 (d) 2.500 (e) 2.508 

77. A quality control inspector will measure the salt 
content (in milligrams) in a random sample of bags 
of potato chips from an hour of production. Which of 
the following would result in the smallest margin of 
error in estimating the mean salt content 1? 

(a) 90% confidence; n = 25 

(b) 90% confidence; n = 50 

(c) 95% confidence; n = 25 

(d) 95% confidence; n = 50 

(e) n= 100 atany confidence level 

78. Scientists collect data on the blood cholesterol levels 
(milligrams per deciliter of blood) of a random 
sample of 24 laboratory rats. A 95% confidence inter- 
val for the mean blood cholesterol level ju is 80.2 to 
89.8. Which of the following would cause the most 
worry about the validity of this interval? 

(a) ‘There is a clear outlier in the data. 

(b) Astemplot of the data shows a mild right skew. 
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You do not know the population standard deviation o. 


(c) 
(d) 
(e) None of these are a problem when using a ¢ interval. 
79. Watching T'V (6.1, 7.3) Choose a young person (aged 

®> 19 to 25) at random and ask, “In the past seven days, how 


C4 many days did you watch television?” Call the response X 
for short. Here is the probability distribution for X:°! 


The population distribution is not exactly Normal. 


Days: 0 1 2 3 4 5 6 7 
Probability: 0.04 0.03 0.06 0.08 0.09 0.08 0.05 ??? 


(a) What is the probability that X = 7? Justify your 
answer. 

(b) Calculate the mean of the random variable X. Inter- 
pret this value in context. 

(c) Suppose that you asked 100 randomly selected young 
people (aged 19 to 25) to respond to the question 
and found that the mean x of their responses was 
4.96. Would this result surprise you? Justify your 
answer. 

80. Price cuts (4.2) Stores advertise price reductions 
> to attract customers. What type of price cut is most 
© attractive? Experiments with more than one factor 

allow insight into interactions between the factors. 

A study of the attractiveness of advertised price 
discounts had two factors: percent of all foods on sale 
(25%, 50%, 75%, or 100%) and whether the discount 
was stated precisely (as in, for example, “60% off’) 

or as a range (as in “40% to 70% off’). Subjects rated 
the attractiveness of the sale on a scale of | to 7. 


ESTIMATING WITH CONFIDENCE 


(a) Describe a completely randomized design using 
200 student subjects. 

(b) Explain how you would use the partial table of 
random digits below to assign subjects to treatment 
groups. Then use your method to select the first 3 
subjects for one of the treatment groups. Show your 
work clearly on your paper. 


45740 41807 65561 33302 07051 93623 18132 09547 
12975 13258 13048 45144 72321 81940 00360 02428 


(c) ‘The figure below shows the mean ratings for the 
eight treatments formed from the two factors.” Based 
on these results, write a careful description of how 
percent on sale and precise discount versus range of 
discounts influence the attractiveness of a sale. 
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Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


Members at a popular fitness club currently pay a $40 
per month membership fee. ‘The owner of the club wants to 
raise the fee to $50 but is concerned that some members will 
leave the gym if the fee increases. To investigate, the owner 
plans to survey a random sample of the club members and 
construct a 95% confidence interval for the proportion of all 
members who would quit if the fee was raised to $50. 


(a) Explain the meaning of “95% confidence” in the 
context of the study. 

(b) After the owner conducted the survey, he calcu- 
lated the confidence interval to be 0.18 + 0.075. 


Interpret this interval in the context of the 
study. 

(c) According to the club’s accountant, the fee in- 

crease will be worthwhile if fewer than 20% of the 
members quit. According to the interval from part 
(b), can the owner be confident that the fee in- 
crease will be worthwhile? Explain. 
One of the conditions for calculating the con- 
fidence interval in part (b) is that nf = 10 and 
n(1 — f) = 10. Explain why it is necessary to 
check this condition. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you think 
each solution is “complete,” “substantial,” “developing,” or “minimal.” 
If the solution is not complete, what improvements would you suggest 
to the student who wrote it? Finally, your teacher will provide you with 
a scoring rubric. Score your response and note what, if anything, you 
would do differently to improve your own score. 


Chapter Review 


Section 8.1: Confidence Intervals: The Basics 


In this section, you learned that a point estimate is the single 
best guess for the value of a population parameter. You also 
learned that a confidence interval provides an interval of 
plausible values for a parameter. ‘lo interpret a confidence 
interval, say, “We are C% confident that the interval from 
___ to ____ captures the [parameter in context],” where C is 
the confidence level of the interval. 

The confidence level C describes the percentage of con- 
fidence intervals that we expect to capture the value of the 
parameter. To interpret a C% confidence level, we say, “If 
we took many samples of the same size and used them to 
construct C% confidence intervals, about C% of those in- 
tervals would capture the [parameter in context].” 

Confidence intervals are formed by including a margin 
of error on either side of the point estimate. The size of 
the margin of error is determined by several factors, includ- 
ing the confidence level C and the sample size n. Increas- 
ing the sample size n makes the standard deviation of our 
estimate smaller, decreasing the margin of error. Increasing 
the confidence level C makes the margin of error larger, 
to ensure that the capture rate of the interval increases to 
C%. Remember that the margin of error only accounts for 
sampling variability —it does not account for any bias in the 
data collection process. 


Section 8.2: Estimating a Population Proportion 


In this section, you learned how to construct and interpret 
confidence intervals for a population proportion. Several im- 
portant conditions must be met for this type of confidence 
interval to be valid. First, the data used to calculate the in- 
terval must come from a well-designed random sample or 
randomized experiment (the Random condition). When the 
sample is taken without replacement from the population, 
the sample size should be no more than 10% of the popula- 
tion size (the 10% condition). Finally, the observed number 
of successes np and observed number of failures n(1 — f) 
must both be at least 10 (the Large Counts condition). 

The formula for calculating a confidence interval for a 
population proportion is 

p22 PUD 
n 
where f is the sample proportion, z* is the critical value, 
and n is the sample size. The value of z* is based on the 
confidence level C. To find z*, use Table A or technology to 
determine the values of z* and —z* that capture the middle 
C% of the standard Normal distribution. 

The four-step process (State, Plan, Do, Conclude) is per- 
fectly suited for problems that ask you to construct and inter- 
pret a confidence interval. You should state the parameter you 
are estimating and at what confidence level, plan your work 


by naming the type of interval you will use and checking the 
appropriate conditions, do the calculations, and make a con- 
clusion in the context of the problem. You can use technology 
for the Do step, but make sure that you identify the procedure 
you are using and type in the values correctly. 

Finally, an important part of planning a study is deter- 
mining the size of the sample to be selected. The necessary 
sample size is based on the confidence level, the proportion 
of successes, and the desired margin of error. To calculate 
the minimum sample size, solve the following inequality 
for n, where f is a guessed value for the sample proportion: 

> [PED = ue 


n 


If you do not have an approximate value of f from a previous 
study or a pilot study, use f = 0.5 to determine the sample 
size that will yield a value less than or equal to the desired 
margin of error. 


Section 8.3: Estimating a Population Mean 


In this section, you learned how to construct and interpret con- 
fidence intervals for a population mean. Remember that you 
have to check conditions before doing calculations. ‘The Ran- 
dom and 10% conditions are the same as those for proportions. 
‘There’s one new condition for means: the population must be 
Normally distributed or the sample size must be at least 30 (the 
Normal/Large Sample condition). If the population shape is 
unknown and the sample size is less than 30, graph the sample 
data and check for strong skewness or outliers. If there is no 
strong skewness or outliers, it is reasonable to assume that the 
population distribution is approximately Normal. 

The formula for calculating a confidence interval for a 
population mean is 

Sy 

Vn 

where x is the sample mean, t* is the critical value, s, is the 
sample standard deviation, and n is the sample size. We use 
at critical value instead of a z critical value when the popula- 
tion standard deviation is unknown—which is almost always 
the case. The value of t* is based on the confidence level C 
and the degrees of freedom (df = n — 1). To find t*, use Table 
B or technology to determine the values of t* and —¢* that 
capture the middle C% of the appropriate t distribution. ‘The 
t distributions are bell-shaped, symmetric, and centered at 0. 
However, they are more variable and have a shape slightly dif 
ferent from that of the standard Normal distribution. 

You also learned how to estimate the sample size when 
planning a study, as in Section 8.2. To calculate the mini- 
mum sample size, solve the following inequality for n, where 
a is a guessed value for the population standard deviation: 


mae 


oO 
z*— = ME 
Vn 
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What Did You Learn? 


Learning Objective 


Section Related Example 
on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Determine the point estimate and margin of error from a confidence 
interval. 


Interpret a confidence interval in context. 


481 
481, 484 


R8.2 
R8.3, R8.4, R8.6, R8.7 


Interpret a confidence level in context. 


484 


R8.2 


Describe how the sample size and confidence level affect the length 
of a confidence interval. 


Discussion on 487 


R8.9 


Explain how practical issues like nonresponse, undercoverage, and 
response bias can affect the interpretation of a confidence interval. 


Discussion on 488 


R8.6 


State and check the Random, 10%, and Large Counts conditions 
for constructing a confidence interval for a population proportion. 


494 


R8.3 


Determine critical values for calculating a C% confidence interval 
for a population proportion using a table or technology. 


497 


R8.1 


Construct and interpret a confidence interval for a population 
proportion. 


498, 500 


R8.3, R8.6 


Determine the sample size required to obtain a C% confidence 
interval for a population proportion with a specified margin of error. 


502 


R8.5 


State and check the Random, 10%, and Normal/Large Sample con- 
ditions for constructing a confidence interval for a population mean. 


516 


R8.4 


Explain how the f distributions are different from the standard Nor- 
mal distribution and why it is necessary to use a f distribution when 
calculating a confidence interval for a population mean. 


Discussion on 
511-512 


R8.10 


Determine critical values for calculating a C% confidence interval 
for a population mean using a table or technology. 


513 


R8.1 


Construct and interpret a confidence interval for a population mean. 


519-520 


R8.4, R8.7 


Determine the sample size required to obtain a C% confidence 
interval for a population mean with a specified margin of error. 
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Chapter 8 Chapter Review Exercises 


These exercises are designed to help you review the impor- 
tant ideas and methods of the chapter. 


R8.1 It’s critical Find the appropriate critical value for 
constructing a confidence interval in each of the 
following settings. 

(a) Estimating a population proportion p at 
a 94% confidence level based on an SRS of size 
25. 
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R8.8 


(b) Estimating a population mean jp at a 99% confi- 
dence level based on an SRS of size 58. 

R8.2 Batteries A company that produces AA batteries 
tests the lifetime of a random sample of 30 batteries 
using a special device designed to imitate real-world 
use. Based on the testing, the company makes the 
following statement: “Our AA batteries last an aver- 
age of 430 to +70 minutes, and our confidence in 


that interval is 95%.”? 


Chapter Review Exercises de, SE 


(a) Determine the point estimate, margin of error, (a) Construct and interpret a 95% confidence interval 
standard error, and sample standard deviation. for the population proportion. 
(b) A reporter translates the statistical announcement (b) Nonresponse is a practical problem for this 
into “plain English” as follows: “If you buy one of survey—only 21.6% of calls that reached a live 
this company’s AA batteries, there is a 95% chance person were completed. Another practical 
that it will last between 430 and 470 minutes.” problem is that people may not give truthful an- 
Comment on this interpretation. swers. What is the likely direction of the bias: 
(c) Your friend, who has just started studying statistics, Do you think more or fewer than 171 of the 880 
claims that if you select 40 more AA batteries at ran- respondents really ran a red light? Why? Are 
dom from those manufactured by this company, there these sources of bias included in the margin of 
is a 95% probability that the mean lifetime will fall error? 
between 430 and 470 minutes. Do you agree? Explain. R8.7 Engine parts Here are measurements (in millime- 
(d) Give a statistically correct interpretation of the con- ters) of a critical dimension on an SRS of 16 of the 
fidence level that could be published in a newspa- more than 200 auto engine crankshafts produced 
per report. in one day: 


R8.3 We love football! A recent Gallup Poll conducted 
telephone interviews with a random sample of 
adults aged 18 and older. Data were obtained for 


224.120 224.001 224.017 223.982 223.989 223.961 223.960 224.089 
223.987 223.976 223.902 223.980 224.098 224.057 223.913 223.999 


1000 people. Of these, 37% said that football is their (a) Construct and interpret a 95% confidence interval 
favorite sport to watch on television. for the process mean at the time these crankshafts 
(a) Define the parameter p in this setting. Explain to were produced. 


someone who knows nothing about statistics why we 
can’t just say that 37% of all adults would say that 
football is their favorite sport to watch on television. 


(b) ‘The process mean is supposed to be w = 224 mm 
but can drift away from this target during produc- 


tion. Does your interval from part (a) suggest that 
(b) Check the conditions for constructing a confidence the process mean has drifted? Explain. 


interval for p. : 
R8.8 Good wood? A lab supply company sells pieces 


of Douglas fir 4 inches long and 1.5 inches 


square for force experiments in science classes. 


(c) Construct a 95% confidence interval for p. 
(d) Interpret the interval in context. 


R8.4 Smart kids A school counselor wants to know how From experience, the strength of these pieces of 
smart the students in her school are. She gets fund- wood follows a Normal distribution with standard 
ing from the principal to give an IQ test to an SRS deviation 3000 pounds. You want to estimate 
of 60 of the over 1000 students in the school. The the mean load needed to pull apart these pieces 
mean IQ score was 114.98 and the standard devia- of wood to within 1000 pounds with 95% confi- 
tion was 14.80.** dence. How large a sample is needed? Show your 

(a) Define the parameter y in this setting. work. 
(b) Check the conditions for constructing a confidence R8.9_ It’s about ME. Explain how each of the follow- 
interval for j.. ing would affect the margin of error of a confi- 
(c) Construct a 90% confidence interval for the mean dence interval, if all other things remained the 
1O score of students at the school. same. 
(d) Interpret your result from part (c) in context. (a) Increasing the confidence level 
R8.5 Do you go to church? The Gallup Poll plans to ask (b) Quadrupling the sample size 


a random sample of adults whether they attended 
a religious service in the last 7 days. How large a 
sample would be required to obtain a margin of er- 
ror of at most 0.01 in a 99% confidence interval for 
the population proportion who would say that they Ai ir 
attended a religious service? Show your work. (a) When is it necessary to use a ¢ critical value rather 
than a < critical value when constructing a confi- 


dence interval for a population mean? 


(b) Describe two ways that the t distributions are dif 
ferent from the standard Normal distribution. 


R8.10 ttime When constructing confidence intervals for 
a population mean, we almost always use critical 
values from a ¢ distribution rather than the stan- 
dard Normal distribution. 


R8.6 Running red lights A random digit dialing tele- 
phone survey of 880 drivers asked, “Recalling the 
last ten traffic lights you drove through, how many 
of them were red when you entered the intersec- 
tions?” Of the 880 respondents, 171 admitted that at (c) Explain what happens to the ¢ distributions as the 
least one light had been red.” degrees of freedom increase. 
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CHAPTER 8 


ESTIMATING WITH CONFIDENCE 


Chapter 8 AP® Statistics Practice Test 


Section |: Multiple Choice Select the best answer for each question. 


T8.1 


(a) 


The Gallup Poll interviews 1600 people. Of these, 
18% say that they jog regularly. The news report 
adds: “The poll had a margin of error of plus or 
minus three percentage points ata 95% confidence 
level.” You can safely conclude that 


95% of all Gallup Poll samples like this one give 
answers within +3% of the true population value. 


(b) the percent of the population who jog is certain to be 


between 15% and 21%. 


(c) 95% of the population jog between 15% and 21% of 


the time. 


(d) we can be 95% confident that the sample proportion 


is captured by the confidence interval. 


(e) if Gallup took many samples, 95% of them would 


T8.2 


find that 18% of the people in the sample jog. 


The weights (in pounds) of three adult males are 
160, 215, and 195. The standard error of the mean of 
these three weights is 


190. (b) 27.84. (c) 22.73. (d) 16.07. (e) 13.13. 


In preparing to construct a one-sample t interval 

for a population mean, suppose we are not sure if 
the population distribution is Normal. In which of 
the following circumstances would we not be safe 
constructing the interval based on an SRS of size 24 
from the population? 


a) Astemplot of the data is roughly bell-shaped. 


(b) A histogram of the data shows slight skewness. 


A boxplot of the data has a large outlier. 


(d) The sample standard deviation is large. 


(a) 
) 
(c) 
) 
(e) 


asec 


e) A Normal probability plot of the data is fairly linear. 


Many television viewers express doubts about the va- 
lidity of certain commercials. In an attempt to answer 
their critics, Timex Group USA wishes to estimate the 
proportion of consumers who believe what is shown 
in Timex television commercials. Let p represent the 
true proportion of consumers who believe what is 
shown in ‘Timex television commercials. What is the 


(a 


a 


smallest number of consumers that Timex can survey 
to guarantee a margin of error of 0.05 or less at a 99% 
confidence level? 


550  (b) 600 (c) 650 (d) 700 (e) 750 


T8.5 You want to compute a 90% confidence interval for 


(a 


= 


the mean of a population with unknown population 
standard deviation. ‘The sample size is 30. The value 
of t* you would use for this interval is 


1.645. (b) 1.699. (c) 1.697. (d) 1.96. (e) 2.045. 


T8.6 A radio talk show host with a large audience is in- 


terested in the proportion p of adults in his listening 
area who think the drinking age should be lowered 
to eighteen. To find this out, he poses the follow- 
ing question to his listeners: “Do you think that the 
drinking age should be reduced to eighteen in light 
of the fact that eighteen-year-olds are eligible for 
military service?” He asks listeners to phone in and 
vote “Yes” if they agree the drinking age should be 
lowered and “No” if not. Of the 100 people who 
phoned in, 70 answered “Yes.” Which of the follow- 
ing conditions for inference about a proportion using 
a confidence interval are violated? 


. The data are a random sample from the population 


of interest. 


. The population is at least 10 times as large as the 


sample. 


. nis so large that both nf and n(1 — f) are at least 10. 


(c) II only (e) I, Il, and Il 


(d) land II only 


I only 
II only 


18.7 A 90% confidence interval for the mean p of a popu- 


(a) 9+2 


lation is computed from a random sample and is 
found to be 9 = 3. Which of the following could be 


the 95% confidence interval based on the same data? 


(bt) 9+3  (c) 944 (d)9=8 


(e) Without knowing the sample size, any of the above 


answers could be the 95% confidence interval. 


T8.8 Suppose we want a 90% confidence interval for the 
average amount spent on books by freshmen in their 
first year at a major university. The interval is to have 
a margin of error of $2. Based on last year’s book 
sales, we estimate that the standard deviation of the 
amount spent will be close to $30. The number of 
observations required is closest to 


(ay2>. (by 302 “(e) G08. dy G09.” “Wey 865. 
T8.9 A telephone poll of an SRS of 1234 adults found 
that 62% are generally satisfied with their lives. The 
announced margin of error for the poll was 3%. Does 
the margin of error account for the fact that some 
adults do not have telephones? 


= 


(a) Yes. The margin of error includes all sources of error 
in the poll. 

(b) Yes. ‘Taking an SRS eliminates any possible bias in 
estimating the population proportion. 

(c) Yes. The margin of error includes undercoverage but 
not nonresponse. 

(d) No. The margin of error includes nonresponse but 
not undercoverage. 
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(e) No. The margin of error only includes sampling 
variability. 


T$.10 A Census Bureau report on the income of Americans 
says that with 90% confidence the median income of 
all U.S. households in a recent year was $57,005 with 
a margin of error of +$742. This means that 


(a) 90% of all households had incomes in the range 
$57,005 + $742. 


(b) we can be sure that the median income for all 
households in the country lies in the interval 


$57,005 + $742. 

(c) 90% of the households in the sample interviewed 
by the Census Bureau had incomes in the interval 
$57,005 + $742. 

(d) the Census Bureau got the result $57,005 + $742 
using a method that will capture the true median 
income 90% of the time when used repeatedly. 

(e) 90% of all possible samples of this same size would 
result in a sample median that falls within $742 of 
$57,005. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T8.11 The U.S. Forest Service is considering additional re- 
strictions on the number of vehicles allowed to enter 
Yellowstone National Park. To assess public reaction, 
the service asks a random sample of 150 visitors if 
they favor the proposal. Of these, 89 say “Yes.” 

(a) Construct and interpret a 99% confidence interval 
for the proportion of all visitors to Yellowstone who 
favor the restrictions. 


(b) Based on your work in part (a), can the U.S. Forest 
Service conclude that more than half of visitors 
to Yellowstone National Park favor the proposal? 
Justify your answer. 

18.12 How many people live in South African house- 
holds? ‘To find out, we collected data from an SRS 
of 48 out of the over 700,000 South African students 
who took part in the CensusAtSchool survey proj- 
ect. The mean number of people living in a house- 
hold was 6.208; the standard deviation was 2.576. 


(a) Is the Normal/Large Sample condition met in this 
case? Justify your answer. 
(b) Maurice claims that a 95% confidence interval for 


De 
the population mean is 6.208+ 1.96 es Explain 


why this interval is wrong. Then give the correct 
interval. 


T8.13_ A milk processor monitors the number of bacteria per 
milliliter in raw milk received at the factory. A ran- 
dom sample of 10 one-milliliter specimens of milk 
supplied by one producer gives the following data: 


5370 4890 5100 4500 5260 5150 4900 4760 4700 4870 


Construct and interpret a 90% confidence interval 
for the population mean ju. 


Chapter 


Introduction 


Section 9.1 


Section 9.2 


Section 9.3 


Free Response AP® 
Problem, YAY! 


Chapter 9 Review 
Chapter 9 Review Exercises 


Chapter 9 AP® Statistics 
Practice Test 


Do You Have a Fever? 


Sometimes when you're sick, your forehead feels really warm. You might have a fever. How can you find 
out whether you do? By taking your temperature, of course. But what temperature should the thermometer 
show if you're healthy? Is this temperature the same for everyone? 

Several years ago, researchers conducted a study to determine whether the “accepted” value for normal 
body temperature, 98.6°F, is accurate. They used an oral thermometer to measure the temperatures of 
a random sample of healthy men and women aged 18 to 40. As is often the case, the researchers did not 
provide their original data. 

Allen Shoemaker, from Calvin College, produced a data set with the same properties as the original 
temperature readings. His data set consists of one oral temperature reading for each of 130 randomly 
chosen, healthy 18- to 40-year-olds.' A dotplot of Shoemaker’s temperature data is shown below. We have 
added a vertical line at 98.6°F for reference. 
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Temperature (°F) 


Exploratory data analysis revealed several interesting facts about this data set: 
e The mean temperature was X = 98.25°F. 

e The standard deviation of the temperature readings was s, = 0.73°F. 

¢ 62.3% of the temperature readings were less than 98.6°F. 
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fF Introduction 


Confidence intervals are one of the two most common types of statistical inference. 
Use a confidence interval when your goal is to estimate a population parameter. 
The second common type of inference, called significance tests, has a different 
goal: to assess the evidence provided by data about some claim concerning a 
parameter. Here is an Activity that illustrates the reasoning of statistical tests. 


ACTIVITY | I'ma Great Free-Throw Shooter! 


MATERIALS: A basketball player claims to make 80% of the free throws that he attempts. We 
Computer with Internet think he might be exaggerating. To test this claim, we’ll ask him to shoot some free 
access and projection throws—virtually—using The Reasoning of a Statistical Test applet at the book’s 
capability Web site. 


pePLe,, 1. Go to www.whfreeman.com/tps>e and launch the applet. 


Shots: 25 


| Show true probability 


Hits = 18/25 = 72% 


Misses = 7/25 = 28% 


2. Set the applet to take 25 shots. Click “Shoot.” How many of the 25 shots did 
the player make? Do you have enough data to decide whether the player’s claim 
is valid? 

3. Click “Shoot” again for 25 more shots. Keep doing this until you are 
convinced either that the player makes less than 80% of his shots or that the 
player’s claim is true. How large a sample of shots did you need to make your 
decision? 

4. Click “Show true probability” to reveal the truth. Was your conclusion 
correct? 

5. If time permits, choose a new shooter and repeat Steps 2 through 4. Is it 
easier to tell that the player is exaggerating when his actual proportion of free 
throws made is closer to 0.8 or farther from 0.8? 
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In the free-throw shooter Activity, the parameter of interest is the proportion p 
of free throws that the player will make if he shoots forever. Our player claims that 
p = 0.80. To test his claim, we let the applet simulate 25 shots. If the player makes 
only 40% of his free throws (10 of 25 shots made), we have fairly strong evidence 
that he doesn’t shoot 80%. But what if he makes 76% of his free throws (19 of 
25 shots made)? This provides some evidence that his true long-term percent may 
be less than 80%, but it’s not nearly as convincing as fp = 0.40. Statistical tests 
weigh the evidence against a claim like p = 0.8 and in favor of a counter-claim 
like p < 0.80. 

Section 9.1 focuses on the underlying logic of statistical tests. Once the founda- 
tion is laid, we consider the implications of using these tests to make decisions — 
about everything from free-throw shooting to the effectiveness of a new drug. In 
Section 9.2, we present the details of performing a test about a population propor- 
tion. Section 9.3 shows how to test a claim about a population mean. Along the 
way, we examine the connection between confidence intervals and tests. 


POT Significance Tests: 


The Basics 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e State the null and alternative hypotheses for a e Determine whether the results of a study are 
significance test about a population parameter. statistically significant and make an appropriate 


e Interpret a P-value in context. conclusion using a significance level. 


Interpret a Type | and a Type Il error in context and give 
a consequence of each. 


A significance test is a formal procedure for using observed data to decide between 
two competing claims (also called hypotheses). The claims are often statements 
about a parameter, like the population proportion p or the population mean wu. 
Let’s start by taking a closer look at how to state hypotheses. 


Stating Hypotheses 


A significance test starts with a careful statement of the claims we want to com- 

pare. In our free-throw shooter example, the virtual player claims that his long-run 

proportion of made free throws is p = 0.80. This is the claim we seek evidence 

against. We call it the null hypothesis, abbreviated Hp. Usually, the null hypoth- 

Remember: the null hypothesis is the esis is a statement of “no difference.” For the free-throw shooter, no difference 
dull hypothesis! from what he claimed gives Ho: p = 0.80. 

The claim we hope or suspect to be true instead of the null hypothesis is called 

the alternative hypothesis. We abbreviate the alternative hypothesis as H,. In this 

case, we believe the player might be exaggerating, so our alternative hypothesis is 


Hap < 0,80: 
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ES 


DEFINITION: Null hypothesis Ap, alternative hypothesis H, 

The claim we weigh evidence against in a statistical test is called the null 
hypothesis (Hp). Often the null hypothesis is a statement of “no difference.” 
The claim about the population that we are trying to find evidence for is the 
alternative hypothesis (H,). 


Some people insist that all three 
possibilities—greater than, less than, 


and equal to—be accounted for in the In the free-throw shooter example, our hypotheses are 

hypotheses. For the free-throw shooter 

example, since the alternative hypothesis Ho: p= 0.80 

is Hj: < 0.80, they would write the null H,:p < 0.80 

hypothesis as Ay: p = 0.80. In spite of 

the mathematical appeal of covering all © where p is the long-run proportion of made free throws. The alternative hypoth- 
three cases, we use only the value esis is one-sided because we are interested only in whether the player is overstat- 


p = 0.80 in our calculations. So we'll 


ce ing his free-throw shooting ability. Because H, expresses the effect that we hope to 
stick with Hy: p = 0.80. 


find evidence for, it is sometimes easier to begin by stating H, and then set up Ho 
as the statement that the hoped-for effect is not present. 
Here is an example in which the alternative hypothesis is two-sided. 


Juicy Pineapples 
Stating hypotheses 


At the Hawaii Pineapple Company, managers are interested in the size 
of the pineapples grown in the company’s fields. Last year, the mean 
weight of the pineapples harvested from one large field was 31 ounces. 
A different irrigation system was installed in this field after the growing 
season. Managers wonder how this change will affect the mean weight 
of pineapples grown in the field this year. 


PROBLEM: State appropriate hypotheses for performing a significance test. Be 
sure to define the parameter of interest. 


SOLUTION: The parameter of interest is the mean weight /u of all pineapples grown 
in the field this year. Because managers wonder whether the mean weight of this year’s 
pineapples will differ from last year’s mean weight of 311 ounces, the alternative hypoth- 
esis is two-sided; that is, either 1 < 31 or fs > 31. For simplicity, we write this as 
jt # 31. The null hypothesis says that there is no difference in the mean weight of the pineapples 
after the irrigation system was changed. That is, 


Ho: = 51 
H,: 6 #31 


For Practice Try Exercise 


The hypotheses should express the hopes or suspicions we have before we 
see the data. It is cheating to look at the data first and then frame hypothe- 
ses to fit what the data show. For example, the data for the pineapple study 
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showed that ¥ = 31.935 ounces for a random sample of 50 pineapples grown in 
the field this year. You should not change the alternative hypothesis to H,: > 31 
after looking at the data. If you do not have a specific direction firmly in mind in 
advance, use a two-sided alternative hypothesis. 


DEFINITION: One-sided alternative hypothesis and 
two-sided alternative hypothesis 


The alternative hypothesis is one-sided if it states that a parameter is /arger 
than the null hypothesis value or if it states that the parameter is smaller than the 
null value. It is two-sided if it states that the parameter is different from the null 
hypothesis value (it could be either larger or smaller). 


It is common to refer to a significance The null hypothesis has the form Ho:parameter = value. The alternative 
test with a one-sided alternative hypothesis has one of the forms H,: parameter < value, H,: parameter > value, or 
hypothesis a8 a one-sided test or H,: parameter # value. To determine the correct form of H,, read the problem 
one-tailed test and to a test with a carefully 


two-sided alternative hypothesis as a : 
two-sided test or two-tailed test Hypotheses always refer to a population, not to a sample. Be sure to 4 
state Ho and H, in terms of population parameters. It is never correct to 


write a hypothesis about a sample statistic, such as p = 0.64 or x > 85. 


CHECK YOUR UNDERSTANDING 


For each of the following settings, (a) describe the parameter of interest, and (b) state 
appropriate hypotheses for a significance test. 


1. According to the Web site sleepdeprivation.com, 85% of teens are getting less than 
eight hours of sleep a night. Jannie wonders whether this result holds in her large high 
school. She asks an SRS of 100 students at the school how much sleep they get on a 
typical night. In all, 75 of the responders said less than 8 hours. 

2. As part of its 2010 census marketing campaign, the U.S. Census Bureau advertised 
“10 questions, 10 minutes—that’s all it takes.” On the census form itself, we read, “The 
US. Census Bureau estimates that, for the average household, this form will take about 
10 minutes to complete, including the time for reviewing the instructions and answers.” 
We suspect that the actual time it takes to complete the form may be longer than 
advertised. 


The Reasoning of Significance Tests 


Significance tests ask if sample data give convincing evidence against the null 
hypothesis and in favor of the alternative hypothesis. A test answers the question, 
ieaaence eae annals “How likely is it to geta result like this just by chance when the null hypothesis is 
referred to as a test of significance,a_ true?” ‘The answer comes in the form of a probability. 
hypothesis test, or a test of hypotheses. Here is an activity that introduces the underlying logic of significance tests. 
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ACTIVITY | ’'ma Great Free-Throw Shooter! 


MATERIALS: Our virtual basketball player in the previous Activity claimed to be an 80% 
Copy of pie chart with 80% free-throw shooter. Suppose that he shoots 50 free throws and makes 32 of them. 
shaded and paper clip for 
each student 


x 32 
His sample proportion of made shots is p = x07 0.64. This result suggests that he 


may really make less than 80% of his free throws in the long run. But do we have 
convincing evidence that p < 0.80? In this activity, you and your classmates will 
perform a simulation to find out. 


1. Make a spinner that gives the shooter an 80% chance of making a free throw. 
Using the pie chart provided by your teacher, label the 80% region “made shot” 
and the 20% region “missed shot.” Straighten out one of the ends of a paper clip 
so that there is a loop on one side and a pointer on the other. On a flat surface, 
place a pencil through the loop, and put the tip of the pencil on the center of 
the pie chart. Then flick the paper clip and see where the pointed end lands. 

2. Simulate a random sample of 50 shots. Flick the paper clip 50 times, and 
count the number of times that the pointed end lands in the “made shot” region. 
3. Compute the sample proportion f of made shots in your simulation from 
Step 2. Plot this value on the class dotplot drawn by your teacher. 

4. Repeat Steps 2 and 3 as needed to get at least 40 trials of the simulation for 
your class. 


5. Based on the results of your simulation, how likely is it for an 80% shooter to 
make 64% or less when he shoots 50 free throws? Does the observed fp = 0.64 
result give convincing evidence that the player is exaggerating? 


Figure 9.1 shows what sample Our reasoning in the Activity is based on asking what would happen if the play- 
proportions are likely to occur by er’s claim (p = 0.80) were true and we observed many samples of 50 free throws. 
chance alone, assuming that p= 0.80. | We used Fathom software to simulate 400 sets of 50 shots assuming that the player is 
really an 80% shooter. Figure 9.1 shows a dotplot of the results. 
Each dot on the graph represents the proportion of made shots 
in one set of 50 attempts. For example, if the player makes 43/50 
shots in one trial, the dot would be placed at f = 0.86. 

You can say how strong the evidence against the player’s 
claim is by giving the probability that he would make as few 
as 32 out of 50 free throws if he really makes 80% in the long 
run. Based on the simulation, our estimate of this probabil- 
ity is 3/400 = 0.0075. The observed statistic, f = 0.64, is so 
unlikely if the actual parameter value is p = 0.80 that it gives 
convincing evidence that the player’s claim is not true. 

Be sure that you understand why this evidence is convinc- 
; ing. There are two possible explanations of the fact that our 
virtual player made only 6 = 32/50 = 0.64 of his free throws: 


Poeccccccccocecccocccs 


In 400 sets of 50 shots, 
there were only 3 sets 
when our shooter made 
as few as or fewer than 
the observed p = 0.64 


— eeccccccccccoccccccccccccccccccooocccooooeCe 


| eeccccccsccccccccocecceeccossoocccoos 
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oO 
ye 
se 
— eeeecceece 
= — eee: 
eee 
— eee: 
— eee 
— eee 
o 
ba e00: 
— eecccce 
=] 
eoccccccce 
ze 
ze 


1. The null hypothesis is correct (p = 0.8), and just by 
4 = 0064 chance, a very unlikely outcome occurred. 


FIGURE 9.1 Fathom dotplot of the sample proportion 6 2. The alternative hypothesis is correct—the population 


of free throws made by an 80% shooter in 400 sets of proportion is less than 0.8, so the sample result is not an 
50 shots. unlikely outcome. 
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Explanation | might be correct—the result of our random sample of 50 shots 
could be due to chance alone. But the probability that such a result would occur 
by chance is so small (less than 1 in a 100) that we are quite confident that 
Explanation 2 is right. 

Statistical tests use an elaborate vocabulary, but the basic idea is: an outcome 
that would rarely happen if the null hypothesis were true is good evidence that the 
null hypothesis is not true. 


Interpreting P-Values 


The idea of stating a null hypothesis that we want to find evidence against seems 
odd at first. It may help to think of a criminal trial. The defendant is “innocent 
until proven guilty.” That is, the null hypothesis is innocence and the prosecution 
must try to provide convincing evidence against this hypothesis and in favor of the 
alternative hypothesis: guilt. That’s exactly how statistical tests work, although in 
statistics we deal with evidence provided by data and use a probability to say how 
strong the evidence is. 

The null hypothesis Hp states the claim that we are seeking evidence against. 
The probability that measures the strength of the evidence against Hy and in favor 
of H, is called a P-value. 


a 


DEFINITION: P-value 


The probability, computed assuming Hh is true, that the statistic (such as p or x ) 
would take a value as extreme as or more extreme than the one actually observed, in 
the direction specified by H,, is called the P-value of the test. 


Small P-values are evidence against Hp because they say that the observed re- 
sult is unlikely to occur when Hp is true. Large P-values fail to give convincing 
evidence against Ho and in favor of H, because they say that the observed result is 
likely to occur by chance alone when Hp is true. 

We'll show you how to calculate P-values later. For now, let’s focus on interpret- 
ing them. 


I’m a Great Free-Throw Shooter! 
Interpreting a P-value 


The P-value is the probability of getting a sample result at least as extreme as the 
one we did if Ho were true. Because the alternative hypothesis is H,:p < 0.80, the 
sample results that count as “at least as extreme” are those with p S 0.64. In other 
words, the P-value is the conditional probability P(f S 0.64 | p = 0.80). Earlier, we 
used a simulation to estimate this probability as 3/400 = 0.0075. So if Ho is true 
and the player makes 80% of his free throws in the long run, there’s about a 0.0075 
probability that the player would make 32 or fewer of 50 shots by chance alone. The 
small probability gives strong evidence against Ho and in favor of the alternative 
H,:p < 0.80 because it would be so unlikely for this result to occur just by chance 
if Ho were true. 
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The alternative hypothesis sets the direction that counts as evidence against Hp. 
In the previous example, only values of f that are much less than 0.80 count as 
evidence against the null hypothesis because the alternative is one-sided on the 
low side. If the alternative is two-sided, both directions count. 


Healthy Bones 


Interpreting a P-value 


Calcium is a vital nutrient for healthy bones and teeth. The National Institutes 
of Health (NIH) recommends a calcium intake of 1300 mg per day for teenagers. 
The NIH is concerned that teenagers aren’t getting enough calcium. Is this true? 


Researchers want to perform a test of 

Ho: = 1300 

[AL eS BIO, 
where ju is the true mean daily calcium intake in the population of teenagers. 
They ask a random sample of 20 teens to record their food and drink consumption 
for | day. The researchers then compute the calcium intake for each student. Data 
analysis reveals that x = 1198 mg and s, = 411 mg. After checking that condi- 
tions were met, researchers performed a significance test and obtained a P-value 


of 0.1404. 


PROBLEM: 
(a) Explain what it would mean for the null hypothesis to be true in this setting. 
(b) Interpret the P-value in context. 


SOLUTION: 

(a) Inthis setting, Ho: j4 = 1300 says that the mean daily calcium intake in the population of 
teenagers is 1300 mg. If Ho is true, then teenagers are getting enough calcium, on average. 

(b) Assuming that the mean daily calcium intake in the teen population is 1300 mg, there is a 


0.1404 probability of getting a sample mean of 1196 mg or less just by chance ina random sample 
of 20 teens. 


For Practice Try Exercise 


Statistical Significance 


The final step in performing a significance test is to draw a conclusion about the 
competing claims you were testing. We will make one of two decisions based on 
the strength of the evidence against the null hypothesis (and in favor of the alter- 
native hypothesis) —reject Ho or fail to reject Ho. If our sample result is too un- 
likely to have happened by chance assuming Hp is true, then we'll reject Hp and 
say that there is convincing evidence for H,. Otherwise, we will fail to reject Ho 
and say that there is not convincing evidence for H,. 

This wording may seem unusual at first, but it’s consistent with what hap- 
pens in a criminal trial. Once the jury has weighed the evidence against the null 
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hypothesis of innocence, they return one of two verdicts: “guilty” (reject Ho) or 
“not guilty” (fail to reject Ho). A not-guilty verdict doesn’t guarantee that the de- 
fendant is innocent, just that there’s not convincing evidence of guilt. Likewise, 
a fail-to-reject Hp decision in a significance test doesn’t mean that H is 

true. For that reason, you should never “accept Hy” or use language imply- @ 
ing that you believe Ho is true. 


Free Throws and Healthy Bones 
Drawing conclusions 


In the free-throw shooter example, because the estimated P-value of 0.0075 is so 
small, there is strong evidence against the null hypothesis Hy:p = 0.80. For that 
reason, we would reject Ho in favor of the alternative H,:p < 0.80. We have con- 
vincing evidence that the virtual player makes fewer than 80% of his free throws. 


For the teen calcium study, however, the large P-value of 0.1404 gives weak evi- 
dence against Ho: 4 = 1300 and in favor of H,:u < 1300. We therefore fail to 
reject Hp. Researchers do not have convincing evidence that teens are getting 
less than 1300 mg of calcium per day, on average. 


In a nutshell, our conclusion in a significance test comes down to 


P-value small — reject H) > convincing evidence for H, (in context) 


P-value large — fail to reject Hy — not convincing evidence for H, (in context) 


There is no rule for how small a P-value is required to reject Hp—it’s a matter of 
judgment and depends on the specific circumstances. But we can compare the 
P-value with a fixed value that we regard as decisive, called the significance level. 
We write it as a, the Greek letter alpha. 

If we choose a = 0.05, we are requiring that the data give evidence against Hp 
so strong that it would happen less than 5% of the time just by chance when Ho 
is true. If we choose a = 0.01, we are insisting on stronger evidence against the 
null hypothesis, a result that would occur less often than | in every 100 times by 
chance alone when Hp is true. 

In Chapter 4, we said that an observed result is “statistically significant” if it 
would rarely occur by chance alone. When our P-value is less than the chosen a 
in a significance test, we say that the result is statistically significant at level a. 


DEFINITION: Statistically significant at level « 


If the P-value is smaller than alpha, we say that the results of a study are statistically 
significant at level a. In that case, we reject the null hypothesis Hy and conclude 
that there is convincing evidence in favor of the alternative hypothesis H;. 


tant.” It means simply “not likely to happen just by chance.” The signifi- 
cance level a makes “not likely” more exact. 


“Significant” in the statistical sense does not necessarily mean “impor- T 
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Significance at level 0.01 is often expressed by the statement “The results were 
significant (P < 0.01).” Here, P stands for the P-value. The actual P-value is more 
informative than a statement of significance because it allows us to assess signifi- 
cance at any level we choose. For example, a result with P = 0.03 is significant at 
the a = 0.05 level but is not significant at the a = 0.01 level. When we use a fixed 
significance level to draw a conclusion in a statistical test, 


P-value < a — reject Hp — convincing evidence for H, (in context) 


P-value = a — fail to reject Hp — not convincing evidence for H, (in context) 


Better Batteries 


Statistical significance 


A company has developed a new deluxe AAA battery that is supposed to last longer 
than its regular AAA battery.” However, these new batteries are more expensive 
to produce, so the company would like to be convinced that they really do last lon- 
ger. Based on years of experience, the company knows that its regular AAA batteries 
last for 30 hours of continuous use, on average. The company selects an SRS of 15 
new batteries and uses them continuously until they are completely drained. The 
sample mean lifetime is ¥ = 33.9 hours. A significance test is performed using the 
hypotheses 


Ho: « = 30 hours 
H,: 1 > 30 hours 


AP® EXAM TIP The where ju is the true mean lifetime of the new deluxe AAA batteries. The resulting 
conclusion to a significance test P-value is 0.0729. 

should always include three 

components: (1) an explicit PROBLEM: What conclusion would you make for each of the following significance levels? Justify 
comparison of the P-value your answer. 

to a stated significance level, (a) a=0.10 (b) a=0.05 

(2) a decision about the null 

hypothesis: reject or fail to SOLUTION: 

reject Mp, and (3) a statement (a) Because the P-value, 0.0729, is less than c = 0.10, we reject Hp. We have convincing evidence 
in the context of the problem that the company’s deluxe AAA batteries last longer than 30 hours, on average. 


ae neh eee (b) Because the P-value, 0.0729, is greater than c. = 0.05, we fail to reject Ho. We do not have 
convincing evidence for A, convincing evidence that the company’s deluxe AAA batteries last longer than 30 hours, on average. 


For Practice Try Exercise 


In practice, the most commonly used significance level is a = 0.05. This is 
mainly due to Sir Ronald A. Fisher, a famous statistician who worked on agri- 
cultural experiments in England during the early twentieth century. Fisher was 
the first to suggest deliberately using random assignment in an experiment. In 
a paper published in 1926, Fisher wrote that it is convenient to draw the line at 


THINK 
ABOUT IT 
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about the level at which we can say: “Either there is something in the treatment, 
or a coincidence has occurred such as does not occur more than once in twenty 
trials.” * 

Sometimes it may be preferable to choose a = 0.01 or a = 0.10, for reasons 
we will discuss shortly. Warning: if you are going to draw a conclusion based 
on a significance level a, then a should be stated before the data are produced. 
Otherwise, a deceptive user of statistics might set an a level after the data have 
been analyzed in an attempt to manipulate the conclusion. This is just as inap- 
propriate as choosing an alternative hypothesis to be one-sided in a particular 
direction after looking at the data. 


How do you choose a significance level? The purpose of a signifi- 
cance test is to give a clear statement of the strength of evidence provided by the 
data against the null hypothesis and in favor of the alternative hypothesis. The 
P-value does this. But how small a P-value is convincing evidence against the null 
hypothesis? This depends mainly on two circumstances: 


¢ How plausible is Ho? If Hp represents an assumption that the people you must 
convince have believed for years, strong evidence (a very small P-value) will 
be needed to persuade them. 


¢ What are the consequences of rejecting Ho? If rejecting Ho in favor of H, means 
making an expensive change of some kind, you need strong evidence that the 
change will be beneficial. 


These criteria are a bit subjective. Different people will insist on different levels 
of significance. Giving the P-value allows each of us to decide individually if the 
evidence is sufficiently strong. 


o_O 


Users of statistics have often emphasized standard significance levels such as 
10%, 5%, and 1%. The 5% level, a = 0.05, is very common. For example, courts 
have tended to accept 5% as a standard in discrimination cases.* 

Beginning users of statistical tests generally find it easier to compare a P-value 
to a significance level than to interpret the P-value correctly in context. For that 
reason, we will include stating a significance level as a required part of every sig- 
nificance test. We'll also ask you to explain what a P-value means in a variety of 
settings. 


Type | and Type Il Errors 


When we draw a conclusion from a significance test, we hope our conclusion 
will be correct. But sometimes it will be wrong. There are two types of mistakes 
we can make. We can reject the null hypothesis when it’s actually true, known as 
a Type I error, or we can fail to reject Hp when the alternative hypothesis is true, 
which is a Type II error. 


DEFINITION: Type | error and Type II error 
If we reject Hy when Ab is true, we have committed a Type | error. 
If we fail to reject Hp when H, is true, we have committed a Type Il error. 
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Truth about The possibilities are summarized in Figure 9.2. If Ho is true, 
the population lias : . : ; is 
our conclusion is correct if we fail to reject Ho, but it is a Type I 
Hp true H, true error if we reject Hp. If H, is true, our conclusion is correct if we 
= reject Ho, but is a Type II error if we fail to reject Hp. Only one 
: : orrec ; : : 
Conclusion Reject Hy) TypeTerror| icon | error is possible at a time. 
based on we ‘ 
cane eae It is important to be able to describe Type I and Type II errors 
Fail to reject Ho) oonctusion | ype Herror) in the context of a problem. Considering the consequences of 


each of these types of error is also important, as the following 


FIGURE 9.2 The two types of errors in significance tests. exampl ashows. 


Perfect Potatoes 
Type | and Type II errors 


A potato chip producer and its main supplier agree that each shipment of potatoes 
must meet certain quality standards. If the producer determines that more than 
8% of the potatoes in the shipment have “blemishes,” the truck will be sent away 
to get another load of potatoes from the supplier. Otherwise, the entire truckload 
will be used to make potato chips. To make the decision, a supervisor will inspect 
a random sample of potatoes from the shipment. The producer will then perform 
a significance test using the hypotheses 


Ao: p = 0.08 
H,: p > 0.08 


where fp is the actual proportion of potatoes with blemishes in a given truckload. 


PROBLEM: Describe a Type | and a Type ll error in this setting, and explain the consequences of 
each. 


SOLUTION: A Type error occurs if we reject Ho when Hy is true. That would happen if the 
producer finds convincing evidence that the proportion of potatoes with blemishes is greater than 
0.08 when the actual proportion is 0.08 (or less). Consequence: The potato-chip producer sends the 
truckload of acceptable potatoes away, which may result in lost revenue for the supplier. Further- 
more, the producer will have to wait for another shipment of potatoes before producing the next 
batch of potato chips. 


A Type ll error occurs if we fail to reject Ho when H, is true. That would happen if the producer does 
Here’s a helpful reminder to keep the —_— not find convincing evidence that more than 5% of the potatoes in the shipment have blemishes when 
two types of errors straight. “Failto” that is actually the case. Consequence: The producer uses the truckload of potatoes to make potato 
SPS oi chips. More chips will be made with blemished potatoes, which may upset customers. 


For Practice Try Exercise 


Which is more serious: a Type I error or a Type II error? That depends on the 
situation. For the potato-chip producer, a Type II error could result in upset cus- 
tomers, leading to decreased sales. A Type I error, turning away a shipment even 
though 8% or less of the potatoes have blemishes, may not have much impact if 
additional shipments of potatoes can be obtained fairly easily. However, the sup- 
plier won't be too happy with a Type I error. 
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ay/ cutec YOUR UNDERSTANDING 


Refer to the “Better Batteries” example on page 546. 
1. Describe a Type I error in this setting. 
2. Describe a Type II error in this setting. 


3. Which type of error is more serious in this case? Justify your answer. 


Error Probabilities We can assess the performance of a significance test by 
looking at the probabilities of the two types of error. That’s because statistical in- 
ference is based on asking, “What would happen if I repeated the data-production 
process many times?” We cannot (without inspecting the whole truckload) guar- 
antee that good shipments of potatoes will never be sent away and bad shipments 
will never be used to make chips. But we can think about our chances of making 
each of these mistakes. 


Perfect Potatoes 
Type | error probability 


For the truckload of potatoes in the previous example, we were testing 


Ho: p = 0.08 
Heep = 0.08 
where pf is the proportion of all potatoes with blemishes in the shipment. 


Suppose that the potato-chip producer decides to carry out this test based on 
a random sample of 500 potatoes using a 5% significance level (a = 0.05). 


A Type I error is to reject Hp when Hp is actually true. If our sample results in 
a value of f that is much larger than 0.08, we will reject Ho. How large would 
p need to be? The 5% significance level tells us to count results that could happen 
less than 5% of the time by chance if Hp is true as evidence that Hp is false. 


Assuming Ho: = 0.08 is true, the sampling distribution of f will have 


Shape: Approximately Normal because np = 500(0.08) = 40 and n(l — p)= 
500(0.92) = 460 are both at least 10. 


Center 1; =p = 0,05 


lh 0.08(0.92 
Spread: of = ie = p) - uv = 0.01213, assuming that there are at 


least 10(500) = 5000 potatoes in the shipment. 


Figure 9.3 on the next page shows the Normal curve that approximates this sam- 
pling distribution. 


The shaded area in the right tail of Figure 9.3 is 5%. We used the calculator com- 
mand invNorm(area: .95, pl: .08, 0: .01213) to get the boundary value 
p = 0.10. Values of f to the right of the green line at 6 = 0.10 will cause us to reject 
Hp even though Hp is true. This will happen in 5% of all possible samples. That is, 
the probability of making a Type I error is 0.05. 
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Sampling distribution of 
p if Hy: p = 0.08 is true 


Fail to reject\'7) | Reject Hp 


If Ho is true, a decision to reject Hy 
based on the data is a Type I error. 


P(Type I error) = a 


T T T T T 
0.0437. 0.0558 0.0679 ~—-:0.0800~——:0.0921 


Values of p | 
p=0.10 


T T 
0.1042 0.1163 


FIGURE 9.3 The probability of a Type | error (shaded area) is the probability of rejecting 
Ho: p = 0.08 when Ah is actually true. 


The probability of a Type I error is the probability of rejecting Hp when it 
is true. As the previous example showed, this is exactly the significance level of 
the test. 


SIGNIFICANCE AND TYPE | ERROR 


The significance level a of any fixed-level test is the probability of a Type I 
error. That is, a is the probability that the test will reject the null hypothesis 
Ho when H) is actually true. Consider the consequences of a Type I error 
before choosing a significance level. 


What about Type II errors? We'll discuss them at the end of Section 9.2, after 
you have learned how to carry out a significance test. 


Summary 


e A significance test assesses the evidence provided by data against a null 
hypothesis Hp and in favor of an alternative hypothesis H,. 


e The hypotheses are usually stated in terms of population parameters. Often, 
Hp is a statement of no change or no difference. The alternative hypothesis 
states what we hope or suspect is true. 


e A one-sided alternative H, says that a parameter differs from the null hy- 
pothesis value in a specific direction. A two-sided alternative H, says that a 
parameter differs from the null value in either direction. 
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e The reasoning of a significance test is as follows. Suppose that the null hy- 
pothesis is true. If we repeated our data production many times, would we 
often get data as inconsistent with Hp, in the direction specified by H,, as the 
data we actually have? If the data are unlikely when Hp is true, they provide 
evidence against Hy and in favor of Hy. 


e The P-value of a test is the probability, computed supposing Hp to be true, 
that the statistic will take a value at least as extreme as the observed result in 
the direction specified by Hy. 


e Small P-values indicate strong evidence against Hp. To calculate a P-value, 
we must know the sampling distribution of the test statistic when Ho is true. 


e If the P-value is smaller than a specified value a (called the significance 
level), the data are statistically significant at level a. In that case, we can 
reject Hy and say that we have convincing evidence for H,. If the P-value is 
greater than or equal to a, we fail to reject Hy and say that we do not have 
convincing evidence for Hy. 


e¢ A Type I error occurs if we reject Hp when it is in fact true. In other words, 
the data give convincing evidence for H, when the null hypothesis is correct. 
A Type II error occurs if we fail to reject Hp when H, is true. In other words, 
the data don’t give convincing evidence for H,, even though the alternative 
hypothesis is correct. 


e Ina fixed level a significance test, the probability of a Type I error is the 
significance level a. 


Exercises 


In Exercises | to 6, each situation calls for a significance 
test. State the appropriate null hypothesis Ho and 
alternative hypothesis H, in each case. Be sure to define 
your parameter each time. 


handed. He wonders if the proportion of lefties at 
his large community college is really 12%. Simon 
chooses an SRS of 100 students and records whether 
each student is right- or left-handed. 


1 
1540 
© 


Attitudes ‘The Survey of Study Habits and Attitudes 
(SSHA) is a psychological test that measures students’ 
attitudes toward school and study habits. Scores 

range from 0 to 200. The mean score for U.S. college 
students is about 115. A teacher suspects that older 
students have better attitudes toward school. She gives 
the SSHA to an SRS of 45 of the over 1000 students 
at her college who are at least 30 years of age. 


Anemia Hemoglobin is a protein in red blood cells 
that carries oxygen from the lungs to body tissues. 
People with less than 12 grams of hemoglobin per 
deciliter of blood (g/dl) are anemic. A public health 
official in Jordan suspects that Jordanian children are 
at risk of anemia. He measures a random sample of 


50 children. 


Lefties Simon reads a newspaper report claiming 
that 12% of all adults in the United States are left- 


Don’t argue! A Gallup Poll report revealed that 
72% of teens said they seldom or never argue with 
their friends.” Yvonne wonders whether this result 
holds true in her large high school. So she surveys a 
random sample of 150 students at her school. 


Cold cabin? During the winter months, the temper- 
atures at the Colorado cabin owned by the Starnes 
family can stay well below freezing (32°F or 0°C) for 
weeks at a time. To prevent the pipes from freezing, 
Mrs. Starnes sets the thermostat at 50°F. The manu- 
facturer claims that the thermostat allows variation in 
home temperature of o = 3°F. Mrs. Starnes suspects 
that the manufacturer is overstating how well the 
thermostat works. 


Ski jump When ski jumpers take off, the distance 
they fly varies considerably depending on their 
speed, skill, and wind conditions. Event organizers 
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must position the landing area to allow for differ- 
ences in the distances that the athletes fly. For a 
particular competition, the organizers estimate that 
the variation in distance flown by the athletes will be 
o = 10 meters. An experienced jumper thinks that 
the organizers are underestimating the variation. 


In Exercises 7 to 10, explain what’s wrong with the stated 
hypotheses. Then give correct hypotheses. 


7. Better parking A change is made that should 
improve student satisfaction with the parking 
situation at a local high school. Right now, 37% of 
students approve of the parking that’s provided. The 
null hypothesis Ho: p > 0.37 is tested against the 
alternative H,:p = 0.37. 


8. Better parking A change is made that should im- 
prove student satisfaction with the parking situation 
at your school. Right now, 37% of students approve 
of the parking that’s provided. The null hypothesis 
Ho: p = 0.37 is tested against the alternative 
Hy: p # 0.37. 


9. Birth weights In planning a study of the birth 


weights of babies whose mothers did not see a doctor 
before delivery, a researcher states the hypotheses as 


Ho:x = 1000 grams 
H,:x < 1000 grams 


10. Birth weights In planning a study of the birth 
weights of babies whose mothers did not see a doctor 


before delivery, a researcher states the hypotheses as 
Ho: 46 < 1000 grams 


H,: 6 = 900 grams 
. Attitudes In the study of older students’ attitudes 
from Exercise 1, the sample mean SSHA score was 
125.7 and the sample standard deviation was 29.8. 
A significance test yields a P-value of 0.0101. 


(a) Explain what it would mean for the null hypothesis 
to be true in this setting. 

(b) Interpret the P-value in context. 

12. Anemia For the study of Jordanian children in 


Exercise 2, the sample mean hemoglobin level was 
11.3 g/dl and the sample standard deviation was 


1.6 g/dl. A significance test yields a P-value of 0.0016. 


(a) Explain what it would mean for the null hypothesis 
to be true in this setting. 

(b) Interpret the P-value in context. 

13. Lefties Refer to Exercise 3. In Simon’s SRS, 16 of the 


students were left-handed. A significance test yields a 
P-value of 0.2184. What conclusion would you make 
if a = 0.10? If a = 0.05? Justify your answers. 
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14. Don’t argue! Refer to Exercise +. For Yvonne’s 
survey, 96 students in the sample said they rarely or 
never argue with friends. A significance test yields a 
P-value of 0.0291. What conclusion would you make 


if a = 0.05? If a = 0.01? Justify your answers. 


15. Attitudes Refer to Exercise 11. What conclusion 
would you make if a = 0.05? If a = 0.01? Justify 


your answers. 


16. Anemia Refer to Exercise 12. What conclusion 
would you make if a = 0.05? If a = 0.01? Justify 


your answers. 


17. Interpreting a P-value When asked to explain the 
meaning of the P-value in Exercise 13, a student says, 
“This means there is about a 22% chance that the 
null hypothesis is true.” Explain why the student's 


explanation is wrong. 


18. Interpreting a P-value When asked to explain the 
meaning of the P-value in Exercise 14, a student says, 
“There is a 0.0291 probability of getting a sample 
proportion of 6 = 96/150 = 0.64 by chance alone.” 


Explain why the student’s explanation is wrong. 


19. Drawing conclusions A student performs a test of 
Ho: p = 0.75 versus H,: p > 0.75 and gets a P-value 
of 0.99. The student writes: “Because the P-value is 
greater than 0.75, we reject Ho. The data prove that H, 


is true.” Explain what is wrong with this conclusion. 


20. Drawing conclusions A student performs a test of 
Ho: p = 0.5 versus H,: p # 0.5 and gets a P-value 

of 0.63. The student writes: “Because the P-value 

is greater than a = 0.05, we accept Hp. The data 
provide convincing evidence that the null hypothesis 


is true.” Explain what is wrong with this conclusion. 


Exercises 21 and 22 refer to the following setting. Slow 
response times by paramedics, firefighters, and policemen 
can have serious consequences for accident victims. In 
the case of life-threatening injuries, victims generally 
need medical attention within 8 minutes of the accident. 
Several cities have begun to monitor emergency response 
times. In one such city, the mean response time to all 
accidents involving life-threatening injuries last year was 

jt = 6.7 minutes. Emergency personnel arrived within 

8 minutes on 78% of all calls involving life-threatening 
injuries last year. The city manager shares this information 
and encourages these first responders to “do better.” At the 
end of the year, the city manager selects an SRS of 400 
calls involving life-threatening injuries and examines the 
response times. 


21. 
(a) 


Awful accidents 


State hypotheses for a significance test to determine 
whether the average response time has decreased. Be 
sure to define the parameter of interest. 


2s 


Diy 


Describe a ‘Type I error and a Type II error in this 
setting, and explain the consequences of each. 


Which is more serious in this setting: a Type I error 
or a ‘lype II error? Justify your answer. 


Awful accidents 


State hypotheses for a significance test to determine 
whether first responders are arriving within 8 minutes of 
the call more often. Be sure to define the parameter 
of interest. 


Describe a ‘Type I error and a Type II error in this 
setting and explain the consequences of each. 


Which is more serious in this setting: a Type I error 
or a ‘lype II error? Justify your answer. 


Opening a restaurant You are thinking about opening 
a restaurant and are searching for a good location. 
From research you have done, you know that the mean 
income of those living near the restaurant must be 

over $85,000 to support the type of upscale restaurant 
you wish to open. You decide to take a simple random 
sample of 50 people living near one potential location. 
Based on the mean income of this sample, you will 
decide whether to open a restaurant there.° 


State appropriate null and alternative hypotheses. Be 
sure to define your parameter. 


Describe a Type I and a ‘Type II error, and explain 
the consequences of each. 


Blood pressure screening Your company markets 
a computerized device for detecting high blood 
pressure. The device measures an individual’s blood 
pressure once per hour at a randomly selected time 
throughout a 12-hour period. Then it calculates the 
mean systolic (top number) pressure for the sample 
of measurements. Based on the sample results, the 
device determines whether there is convincing 
evidence that the individual’s actual mean systolic 
pressure is greater than 130. If so, it recommends 
that the person seek medical attention. 


State appropriate null and alternative hypotheses in 
this setting. Be sure to define your parameter. 


Describe a Type and a ‘Type II error, and explain 
the consequences of each. 


Multiple choice: Select the best answer for Exercises 25 to 28. 


25% 


Experiments on learning in animals sometimes 
measure how long it takes mice to find their way 
through a maze. The mean time is 18 seconds for 
one particular maze. A researcher thinks that a loud 
noise will cause the mice to complete the maze 
faster. She measures how long each of 10 mice takes 
with a noise as stimulus. The appropriate hypotheses 
for the significance test are 


Section 9.1 Significance Tests: The Basics 


553 


Ho: = 18; Hy:p # 18. 
Ho: = 18; Hy: > 18. 
Ho: u< 18; Hy: = 18. 
Ho: = 18; Hy: < 18. 
Ho:x = 18; Hy: x < 18. 


Exercises 26-28 refer to the following setting. Members of 
the city council want to know if a majority of city residents 
supports a 1% increase in the sales tax to fund road repairs. 
To investigate, they survey a random sample of 300 city resi- 
dents and use the results to test the following hypotheses: 


Ho:p = 0.50 
H,:p > 0.50 


where p is the proportion of all city residents who support 
a 1% increase in the sales tax to fund road repairs. 


26. 


(a) 


Palle 


A Type I error in the context of this study occurs if 
the city council 


finds convincing evidence that a majority of residents 
supports the tax increase, when in reality there isn’t con- 
vincing evidence that a majority supports the increase. 


finds convincing evidence that a majority of residents 
supports the tax increase, when in reality at most 
50% of city residents support the increase. 


finds convincing evidence that a majority of residents 
supports the tax increase, when in reality more than 
50% of city residents do support the increase. 


does not find convincing evidence that a majority of 
residents supports the tax increase, when in reality more 
than 50% of city residents do support the increase. 


does not find convincing evidence that a majority of 
residents supports the tax increase, when in reality at 
most 50% of city residents do support the increase. 


In the sample, f = 158/300 = 0.527. The resulting 
P-value is 0.18. What is the correct interpretation of 
this P-value? 


Only 18% of the city residents support the tax increase. 


There is an 18% chance that the majority of residents 
supports the tax increase. 


Assuming that 50% of residents support the tax 
increase, there is an 18% probability that the sample 
proportion would be 0.527 or higher by chance alone. 


Assuming that more than 50% of residents support the 
tax increase, there is an 18% probability that the sample 
proportion would be 0.527 or higher by chance alone. 


Assuming that 50% of residents support the tax 
increase, there is an 18% chance that the null 
hypothesis is true by chance alone. 
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28. 
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Based on the P-value in Exercise 27, which of the 
following would be the most appropriate conclusion? 


Because the P-value is large, we reject Hp. We have 
convincing evidence that more than 50% of city 
residents support the tax increase. 


Because the P-value is large, we fail to reject Hp. We 
have convincing evidence that more than 50% of city 
residents support the tax increase. 


Because the P-value is large, we reject Hp. We have 
convincing evidence that at most 50% of city resi- 
dents support the tax increase. 


Because the P-value is large, we fail to reject Hp. We 
have convincing evidence that at most 50% of city 
residents support the tax increase. 


Because the P-value is large, we fail to reject Hp. We 
do not have convincing evidence that more than 
50% of city residents support the tax increase. 


Women in math (5.3) Of the 24,611] degrees in math- 


 ematics given by U.S. colleges and universities in a re- 


cent year, 70% were bachelor’s degrees, 24% were mas- 
ter’s degrees, and the rest were doctorates. Moreover, 


women earned 43% of the bachelor’s degrees, +1% of 
the master’s degrees, and 29% of the doctorates.’ 


How many of the mathematics degrees given in this 
year were earned by women? Justify your answer. 


Are the events “degree earned by a woman” and 
“degree was a bachelor’s degree” independent? 
Justify your answer using appropriate probabilities. 


If you choose 2 of the 24,611 mathematics degrees at 
random, what is the probability that at least | of the 
2 degrees was earned by a woman? Show your work. 


Explaining confidence (8.2) Here is an explanation 
from a newspaper concerning one of its opinion 
polls. Explain what is wrong with the following 
statement. 


For a poll of 1,600 adults, the variation due to sam- 
pling error is no more than three percentage points 
either way. The error margin is said to be valid at the 
95 percent confidence level. This means that, if the 
same questions were repeated in 20 polls, the results 
of at least 19 surveys would be within three percentage 
points of the results of this survey. 


Tests about a Population 


Proportion 


By the end of the section, you should be able to: 


Interpret the power of a test and describe what factors 
affect the power of a test. 

Describe the relationship among the probability of a 
Type | error (significance level), the probability of a 
Type Il error, and the power of a test. 


WHAT YOU WILL LEARN 


e State and check the Random, 10%, and Large Counts e 
conditions for performing a significance test about a 


population proportion. e 


Perform a significance test about a population 
proportion. 


Confidence intervals and significance tests are based on the sampling distribu- 
tions of statistics. That is, both use probability to say what would happen if we 
used the inference method many times. Section 9.1 presented the reasoning of 
significance tests, including the idea of a P-value. In this section, we focus on the 
details of testing a claim about a population proportion. 


Carrying Out a Significance Test 


In Section 9.1, we met a virtual basketball player who claimed to make 80% of 
his free throws. We thought that he might be exaggerating. In an SRS of 50 shots, 
the player made only 32. His sample proportion of made free throws was therefore 


32 
p= — = 0.64 
p 50 0.6 
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This result is much lower than what he claimed. Does it provide convincing evi- 
dence against the player’s claim? To find out, we need to perform a significance 
test of 


Ho: p = 0.80 
Hy: p < 0.80 


where p = the actual proportion of free throws that the shooter makes in the long run. 


Conditions In Chapter 8, we introduced conditions that should be met be- 
fore we construct a confidence interval for an unknown population proportion: 
Random, 10% when sampling without replacement, and Large Counts. These 
same conditions must be verified before carrying out a significance test. 

The Large Counts condition for proportions requires that both np and n(1 — p) 
be at least 10. Because we assume Hy is true when performing a significance test, 
we use the parameter value specified by the null hypothesis (sometimes called 
po) when checking the Large Counts condition. In this case, the Large Counts 
condition says that the expected count of successes np and failures n(1 — po) are 
both at least 10. 


CONDITIONS FOR PERFORMING 

A SIGNIFICANCE TEST ABOUT A PROPORTION 

e Random: The data come from a well-designed random sample or ran- 
domized experiment. 


1 
° 10%: When sampling without replacement, check thatn = 70 N. 
e Large Counts: Both npp and n(1 — po) are at least 10. 


Here’s an example that shows how to check the conditions. 


I’m a Great Free-Throw Shooter! 


Checking conditions 


PROBLEM: Check the conditions for performing a significance test of the virtual basketball 
player's claim. 


SOLUTION: The required conditions are 


* Random: We can view this set of 50 computer-generated shots as a simple random sample from 
the population of all possible shots that the virtual shooter takes. 


0 10%: We're not sampling without replacement from a finite population (because the applet can 
keep on shooting), so we don’t need to check the 10% condition. (Note that the outcomes of 
individual shots are independent because they are determined by the computer's random number 
generator.) 

Large Counts: Assuming Hois true, p = 0.80. Then npo = (50)(0.80) = 40 and n(1 — po) = 
(50)(0.20) = 10 are both at least 10, So this condition is met. 


For Practice Try Exercise 


556 CHAPTER 9 TESTING A CLAIM 


If the null hypothesis Hp:p = 0.80 is true, then the player’s 
sample proportion p of made free throws in an SRS of 50 shots 
would vary according to an approximately Normal sampling 
distribution with mean jug = p = 0.80 and standard deviation 


7 - (a? = [0801020 —pusee 


Figure 9.4 displays this distribution. We have added the play- 


N(0.80,0.0566) 


er’s sample result, 6 = ir = 0.64. 
T T T T T T T 


0.6302 0.6868 0.7434 0.8000 0.8566 0.9132 09698 Calculations: Test Statistic and P-Value A sig- 
nificance test uses sample data to measure the strength of evi- 


p = 0.64 dence against Ho and in favor of H,. Here are some principles 
FIGURE 9.4 Normal approximation to the sampling that apply to most tests: 
distribution of the proportion 6 of made shots in random é 


The test compares a statistic calculated from sample data 


Salnbles al alias MiOWs Ry an OOo shoolet: with the value of the parameter stated by the null hypothesis. 


e Values of the statistic far from the null parameter value in the direction speci- 
fied by the alternative hypothesis give strong evidence against Hp. 


e To assess how far the statistic is from the parameter, standardize the statistic. 
This standardized value is called the test statistic: 
On the AP® exam formula sheet, 


this value is referred to as the test statistic = 
“standardized test statistic.” standard deviation of statistic 


statistic — parameter 


CT 


DEFINITION: Test statistic 
A test statistic measures how far a sample statistic diverges from what we would 
expect if the null hypothesis Hp were true, in standardized units. That is, 


statistic — parameter 
standard deviation of statistic 


test statistic = 


The test statistic says how far the sample result is from the null parameter value, 
and in what direction, on a standardized scale. You can use the test statistic to find 
the P-value of the test, as the following example shows. 


I’m a Great Free-Throw Shooter! 


Computing the test statistic 
PROBLEM: Inan SRS of 50 free throws, the virtual player made 32. 
(a) Calculate the test statistic. 


(b) Find the P-value using Table A or technology. Show this result as an area under a standard Normal 
curve. 
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SOLUTION: 
(a) His sample proportion of made shots is p = 0.64. Standardizing, we get 


statistic — parameter 


test statistic = — — 
standard deviation of statistic 


064-080 064-080 _ 


(0.80)(0.20) 0.0566 
50 


(b) The shaded area under the curve in Figure 9.5(a) shows the P-value. Figure 9.5(b) shows the 
corresponding area on the standard Normal curve, which displays the distribution of the ztest 
statistic. From Table A, we find that the P-value is P(Z = —2.83) = 0.0023. 

Using technology: The command normalcdf (lower: —10000, upper: —2.83, 
[4: 0, 0:1) also givesa P-value of 0.0023. 


2.83 


) 


P-value = 0.0023 P-value = 0.0023 


0.6302 0.6868 9.7434 0.8000 0.8566 0.9132 0.9698 -3 -42 a A 0 1 Za 3 


A 
p= 0.64 


FIGURE 9.5 The shaded area shows the P-value for the player’s sample proportion of made 
shots (a) on the Normal approximation to the sampling distribution of 6 from Figure 9.4 and 
(b) on the standard Normal curve. 


If Ho is true and the player makes 80% of his free throws in the long run, there’s only about a 0.0023 
probability that he would make 32 or fewer of 50 shots by chance alone. 


For Practice Try Exercise 


Earlier, we used simulation to estimate the P-value as 0.0075. As the example 
shows, the P-value is even smaller, 0.0023. So if Ho is true, and the player makes 
80% of his free throws in the long run, there’s only about a 0.0023 probability 
that the player would make 32 or fewer of 50 shots by chance alone. This result 
confirms our earlier decision to reject Ho and gives convincing evidence that the 
player is exaggerating. 


The One-Sample z Test for a Proportion 


To perform a significance test, we state hypotheses, check conditions, calculate 
a test statistic and P-value, and draw a conclusion in the context of the problem. 
The four-step process is ideal for organizing our work. 
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STEP A 


The AP® Statistics course outline 
Calls this test a Jarge-sample test for 
a proportion because it is based on a 
Normal approximation to the sampling 
distribution of 6 that becomes more 


accurate as the sample size increases. 


TESTING A CLAIM 


SIGNIFICANCE TESTS: A FOUR-STEP PROCESS 


State: What hypotheses do you want to test, and at what significance level? 
Define any parameters you use. 


Plan: Choose the appropriate inference method. Check conditions. 

Do: If the conditions are met, perform calculations. 

¢ Compute the test statistic. 

e Find the P-value. 

Conclude: Make a decision about the hypotheses in the context of the problem. 


When the conditions are met—Random, 10%, and Large Counts—the sam- 
pling distribution of f is approximately Normal with 


pl — p) 


mean fj =p and _ standard deviation oj = - 


For confidence intervals, we substitute f for p in the standard deviation formula to 
obtain the standard error. When performing a significance test, however, the null 
hypothesis specifies a value for p, which we call fo. We assume that this value is 
correct when performing our calculations. 

If we standardize the statistic / by subtracting its mean and dividing by its stan- 
dard deviation, we get the test statistic: 


statistic — parameter 


test statistic = aoe at 
standard deviation of statistic 


p — po 
po(l — po) 


n 


Z= 


This z statistic has approximately the standard Normal distribution when Ho 
is true and the conditions are met. P-values therefore come from the standard 
Normal distribution. 

Here is a summary of the details for a one-sample z test for a proportion. 


ONE-SAMPLE z TEST FOR A PROPORTION 


Suppose the conditions are met. To test the hypothesis Ho: p = po, compute 
the z statistic 
Pp — Po 
po(l — po) 
n 


Find the P-value by calculating the probability of getting a z statistic this large 
or larger in the direction specified by the alternative hypothesis H,: 


H,:P > Po Hy: P<Po H,:P # Do 


z z —|z| Iz| 


Section 9.2 Tests about a Population Proportion 4,559 


Here is an example of the test in action. 


One Potato, Two Potato 
Performing a significance test about p 


The potato-chip producer of Section 9.1 has just received a truckload of potatoes 
from its main supplier. Recall that if the producer finds convincing evidence that 
more than 8% of the potatoes in the shipment have blemishes, the truck will be 
sent away to get another load from the supplier. A supervisor selects a random sam- 
ple of 500 potatoes from the truck. An inspection reveals that 47 of the potatoes 
have blemishes. Carry out a significance test at the a = 0.05 significance level. 
What should the producer conclude? 


STATE: We want to perform a test of 
Ho i 0.08 
H,:p > 0.08 


where pis the actual proportion of potatoes in this shipment with blemishes. We'll use an vw = 0.05 
significance level. 


PLAN: If conditions are met, we should do a one-sample ztest for 
the population proportion p. 


* Random: The supervisor took a random sample of 500 potatoes 
from the shipment. 
© 102: \t seems reasonable to assume that there are at least 
10(500) = 5000 potatoes in the shipment. 
* Large Counts: Assuming Ho: p = 0.08 is true, the expected counts 
of blemished and unblemished potatoes are npp = 500(0.08) = 40 
and n(1 — po) = 500(0.92) = 460, respectively. Because both of 
these values are at least 10, we should be safe doing Normal calculations. 


\ | Area = 0.1254 


Z=115 


FIGURE 9.6 The P-value for the one-sided test. 


AP® EXAM TIP Whena 
significance test leads to 

a fail to reject Hy decision, 
as in this example, be sure 
to interpret the results as 
“we don’t have convincing 
evidence to conclude H,.” 
Saying anything that sounds 
like you believe H, is (or 
might be) true will lead to 

a loss of credit. And don’t 
write text-message-type 
responses, like “FTR the Ho.” 


DO: The sample proportion of blemished potatoes is p = 47/500 = 
0.094 


P—-Po _ 0.094—0.08 _ 


= = 
(a — po) " 0.08(0.92) 
n 500 


* P-value Figure 9.6 displays the P-value as an area under the standard Normal curve for this 
one-sided test. Table A gives the P-value as F(Z = 1.15) = 1 — 0.6749 = 0.1251. 

° Using technology: The command normalcdf (lower:1.15, upper:10000, w:0, 
a: 1) also gives a P-value of 0.1251. 


° Test statistic z= 


CONCLUDE: Because our P-value, 0.1251, is greater than a = 0.05, we fail to reject Ho. There 
is not convincing evidence that the shipment contains more than 5% blemished potatoes. Asa result, 
the producer will use this truckload of potatoes to make potato chips. 


For Practice Try Exercise 
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The preceding example reminds us why significance tests are important. The 
sample proportion of blemished potatoes was f = 47/500 = 0.094. This result 
gave evidence against Ho in favor of H,. To see whether such an outcome is un- 
likely to occur just by chance when Hp is true, we had to carry out a significance 
test. The P-value told us that a sample proportion this large or larger would occur 
in about 12.5% of all random samples of 500 potatoes when Hp is true. So we 
can’t rule out sampling variability as a plausible explanation for getting a sample 
proportion of p = 0.094. 


What happens when the data don’t support H,? Suppose the 
THINK ace act her ee : 
pervisor had inspected a random sample of 500 potatoes from the shipment and 
ABOUT IT found 33 with blemishes. This yields a sample proportion of / = 33/500 = 0.066. 
Such a sample doesn’t give any evidence to support the alternative hypothesis 
H,:p > 0.08! There’s no need to continue with the significance test. The conclu- 
sion is clear: we should fail to reject Ho: p = 0.08. This truckload of potatoes will 
be used by the potato-chip producer. 
If you weren’t paying attention, you might end up carrying out the test. Let’s see 
what would happen. The corresponding test statistic is 


b-po 0.066 - 0.08 | 


— pe = a ao 
n 500 


What's the P-value? It’s the probability of getting a z statistic this 
large or larger in the direction specified by H,, P(Z = —1.15). 
Figure 9.7 shows this P-value as an area under the standard 
Normal curve. Using Table A or technology, the P-value is 
1 — 0.1251 = 0.8749. There’s about an 87.5% chance of get- 
ting a sample proportion as large as or larger than p = 0.066 if 
p = 0.08. As a result, we would fail to reject Hp. Same conclu- 
sion, but with lots of unnecessary work! 


Area = 0.8749 


3 2 " 0 1 2 3 e7z— oH 

ge-115 Always check to see whether the data give evidence 

FIGURE 9.7 The P-value for the one-sided test. against Hp in the direction specified by H, before you do 
calculations. 


CHECK YOUR UNDERSTANDING 


According to the National Campaign to Prevent Teen and Unplanned Pregnancy, 20% 
of teens aged 13 to 19 say that they have electronically sent or posted sexually suggestive 
images of themselves.* The counselor at a large high school worries that the actual figure 
might be higher at her school. To find out, she administers an anonymous survey to a ran- 
dom sample of 250 of the school’s 2800 students. All 250 respond, and 63 admit to sending 
or posting sexual images. Carry out a significance test at the a = 0.05 significance level. 
What conclusion should the counselor draw? 


Your calculator will handle the “Do” part of the four-step process, as the follow- 
ing Technology Corner illustrates. However, be sure to read the AP® Exam Tip 
on the next page. 
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8 CORNER’ ONE-PROPORTION z TEST ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


The ‘T1-83/84 and TI-89 can be used to test a claim about a population proportion. We'll demonstrate using the 


previous example. In a random sample of size n = 500, the supervisor found X = 47 potatoes with blemishes. ‘To 
perform a significance test: 


TI-83/84 TI-89 


e Press[stat], then choose TESTS and e In the Statistics/List Editor, press [2nda][F1] ([F'6]) 

1-PropZTest. and choose 1-PropZTest. 
On the 1-PropZTest screen, enter the values shown: pp = 0.08, x = 47, and n = 500. Specify the alternative hypoth- 
esis as “prop > po.” Note: x is the number of successes and n is the number of trials. Both must be whole numbers! 


pep scrrvorion cre 
Pa: ea ee 


successes: x: [7 ____] 
; Real n 
Colon ENS Alternate Hye: Fror > FO+ 
Calculate Draw Results: pedicu lobe ba 
<Enters0k > 


lis [ ; 
USE € AND > 10 DFEN CHOICE 


e Ifyou select “Calculate” and press [ENTER] you will see that the test statistic is z = 1.15 and the P-value is 0.1243. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


Prop>.@8 

z=1. 153915828 
p=. 1242673934 
b=. 094 

n=S580 


11st1=(0,1,2.5,4,5,6,7,5, _ 
Main RAD AUTO FUNC 1/6 


e If you select the “Draw” option, you will see the screen shown here. Compare these results with those in the example 


on page 559. 


1-Prop2test 
221.1539 p=.1243 


AP® EXAM TIP You can use your calculator to carry out the mechanics of a significance 
test on the AP® exam. But there’s a risk involved. If you just give the calculator answer with 


no work, and one or more of your values is incorrect, you will probably get no credit for the 
“Do” step. If you opt for the calculator-only method, be sure to name the procedure (one- 
proportion z test) and to report the test statistic (z= 1.15) and P-value (0.1243). 
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Two-Sided Tests 


Both the free-throw shooter and blemished-potato examples involved one-sided 
tests. The P-value in a one-sided test is the area in one tail of a standard Normal 
distribution —the tail specified by H,. In a two-sided test, the alternative hypothesis 
has the form H,:p # po. The P-value in such a test is the probability of getting a 
sample proportion as far as or farther from fo in either direction than the observed 
value of f. As a result, you have to find the area in both tails of a standard Normal 


distribution to get the P-value. The following example shows how this process works. 
r 
STEP 
Nonsmokers 4 
A two-sided test L. 


According to the Centers for Disease Control and Prevention (CDC) Web site, 50% 
of high school students have never smoked a cigarette. Taeyeon wonders whether 
this national result holds true in his large, urban high school. For his AP® Statistics 
class project, ‘Taeyeon surveys an SRS of 150 students from his school. He gets re- 
sponses from all 150 students, and 90 say that they have never smoked a cigarette. 
What should Taeyeon conclude? Give appropriate evidence to support your answer. 


STATE: We want to perform a significance test using the hypotheses 
Ho: p= 0.50 
H,: p # 0.50 
where p = the proportion of all students in Taeyeon’s school who would say they have never smoked 
cigarettes. Because no significance level was stated, we will use « = 0.05. 
PLAN: If conditions are met, we'll do a one-sample z test for the population proportion p. 
° Random: Taeyeon surveyed an SRS of 150 students from his school. 


° 10%: It seems reasonable to assume that there are at least 10(150) = 1500 studentsina 
large high school. 


* Large Counts: Assuming Ho : p = 0.50 is true, the expected 
counts of smokers and nonsmokers in the sample are npo = 
150(0.50) = 75 and n(1 — po) = 150(0.50) = 75. Because 


both of these values are at least 10, we should be safe doing 


Normal calculations. 
DO: The sample proportion is p = 90/150 = 0.60. 
* Test statistic 


P-value = 2(0.0071) 
= 0.0142 


P—Po 060-050 _ 


= 2.45 
(a — po) ae 
n 150 


* P-value Figure 9.6 displays the P-value as an area under the 
3 a “1 0 7 - 3 standard Normal curve for this two-sided test. To compute this 

P-value, we find the area in one tail and double it. Table A gives 
P(z= 2.45) = 0.0071 (the right-tail area). So the desired 
FIGURE 9.8 The P-value for the two-sided test. P-value is 2(0.0071) = 0.0142. 


Z=-245 Za245 
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Using technology: The calculator's 1-PropZTest gives z= 2.449 and P-value = 0.0143. 


CONCLUDE: Because our P-value, 0.0143, is less than a = 0.05, we reject Ho. We have 
convincing evidence that the proportion of all students at Taeyeon’s school who would say they have 
never smoked differs from the national result of 0.50. 


For Practice Try Exercise 


CHECK YOUR UNDERSTANDING 


According to the National Institute for Occupational Safety and Health, job stress poses 
a major threat to the health of workers. A news report claims that 75% of restaurant em- 
ployees feel that work stress has a negative impact on their personal lives.” Managers of a 
large restaurant chain wonder whether this claim is valid for their employees. A random 
sample of 100 employees finds that 68 answer “Yes” when asked, “Does work stress have a 
negative impact on your personal life?” Is this good reason to think that the proportion of 
all employees in this chain who would say “Yes” differs from 0.75? Support your answer 
with a significance test. 


Why Confidence Intervals Give 
More Information 


The result of a significance test begins with a decision to reject Hp or fail to reject 
Hp. In Taeyeon’s smoking study, for instance, the data led us to reject Ho: p = 0.50 
because we found convincing evidence that the proportion of students at his school 
who would say they have never smoked cigarettes differs from the national value. 
We're left wondering what the actual proportion p might be. A confidence interval 
might shed some light on this issue. 


Nonsmokers 


A confidence interval gives more info 


Taeyeon found that 90 of an SRS of 150 students said that they had never smoked 
a cigarette. We checked the conditions for performing the significance test earlier. 
Before we construct a confidence interval for the population proportion p, we 
should check that both nf and n(1 — f) are at least 10. Because the number of 
successes and the number of failures in the sample are 90 and 60, respectively, we 
can proceed with calculations. 


Our 95% confidence interval is 


pulp 


p) 


0.60(0.40) 
= + fe scicemraalsa aS 
zs 0.60 + 1.96 150 


We are 95% confident that the interval from 0.522 to 0.678 captures the true pro- 
portion of students at Taeyeon’s high school who would say that they have never 
smoked a cigarette. 


H+ 2° 


= 0.60 + 0.078 = (0.522, 0.678) 
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The confidence interval in this example is much more informative than the 
significance test we performed earlier. The interval gives the values of p that are 
plausible based on the sample data. We would not be surprised if the true propor- 
tion of students at Taeyeon’s school who would say they have never smoked ciga- 
rettes was as low as 0.522 or as high as 0.678. However, we would be surprised if 
the true proportion was 0.50 because this value is not contained in the confidence 
interval. Figure 9.9 gives computer output from Minitab software that includes 
both the results of the significance test and the confidence interval. 


Test and Cl for One Proportion f| 
Test of p = 0.5 vs p not = 0.5 2, 
Sample <X N Sample p 95% CI Z-Value P-Value 
1 90 150 0.600000 (0.521601, 0.678399) 2.45 0.014 

hI 
(6) | id) 12) os: 


FIGURE 9.9 Minitab output for the two-sided significance test at a = 0.05 and a 95% confidence 
interval for Taeyeon’s smoking study. 


There is a link between confidence intervals and two-sided tests. The 95% con- 
fidence interval (0.522, 0.678) gives an approximate set of o's that would not be 
rejected by a two-sided test at the a = 0.05 significance level. With proportions, 
the link isn’t perfect because the standard error used for the confidence interval 
is based on the sample proportion f, while the denominator of the test statistic is 
based on the value fp from the null hypothesis. 


ee Ts 
Test statistic: z = a2 Confidence interval: p + z* Pil — Pp) 


po(l = po) 7 


n 


The big idea is still worth considering: a two-sided test at significance level a and 
a 100(1 — a)% confidence interval (a 95% confidence interval if a = 0.05) give 
similar information about the population parameter. 


CHECK YOUR UNDERSTANDING 


The figure below shows Minitab output from a significance test and confidence interval 
for the restaurant worker data in the previous Check Your Understanding (page 563). 
Explain how the confidence interval is consistent with, but gives more information than, 
the test. 


Test and Confidence Interval for One Proportion 
Test of p = 0.75 vs p not = 0.75 
Sample X N Sample p 95.0 % CI Z-Value P-Value 


L 68 100 0.680000 (0.588572, 0.771428) -1.62 0.106 
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Type Il Error and the Power of a Test 


A significance test makes a Type II error when it fails to reject a null hypothesis Ho 
that really is false. There are many values of the parameter that make the alterna- 
tive hypothesis H, true, so we concentrate on one value. Consider the potato-chip 
example on page 559 that involved a test of Ho: p = 0.08 versus H,: p > 0.08. If 
the true proportion of blemished potatoes in the shipment was p = 0.09, we made 
a Type I error by failing to reject Hp based on the sample data. Of course, we also 
made a Type II error if p = 0.11 orp = 0.15. 

The probability of making a Type II error depends on several factors, includ- 
ing the actual value of the parameter. In the potato-chip example, our test will be 
more likely to reject Hp: p = 0.08 in favor of H,: p > 0.08 if the true proportion of 
blemished potatoes in the shipment is p = 0.11 than if it is p = 0.09. Why? because 
p = 0.11 is farther away from the null value than is p = 0.09. So we will be less likely 
to make a Type II error if 11% of potatoes in the shipment are blemished than if only 
9% are blemished. A high probability of Type II error for a specific alternative param- 
eter value means that the test is not sensitive enough to usually detect that alternative. 

It is more common to report the probability that a significance test does re- 
ject Hp when an alternative parameter value is true. This probability is called the 
power of the test against that specific alternative. 


DEFINITION: Power 


The power of a test against a specific alternative is the probability that the test will 
reject Hp at a chosen significance level a when the specified alternative value of the 
parameter is true. 


As the following example illustrates, ‘Type II error and power are closely linked. 


Perfect Potatoes 
Type II error and power 


The potato-chip producer wonders whether the significance test of Hp: p = 0.08 ver- 
sus H,:p > 0.08 based on a random sample of 500 potatoes has enough power to 
detect a shipment with, say, 11% blemished potatoes. In this case, a particular Type II 
error is to fail to reject Hp: p = 0.08 when p = 0.11. Figure 9.10 on the next page shows 
two sampling distributions of f, one when p = 0.08 and the other when p = 0.11. 


Earlier, we decided to reject Ho if our sample yielded a value of f to the right of 
the green line at = 0.10. That decision was based on using a significance level 
(Type I error probability) of a = 0.05. 


Now look at the sampling distribution for p = 0.11. The shaded area to the right of 
the green line represents the probability of correctly rejecting Hp when p = 0.11. 
That is, the power of this test to detect p = 0.11 is about 0.76. In other words, the 
potato-chip producer has roughly a 3-in-4 chance of rejecting a truckload with 11% 
blemished potatoes based on a random sample of 500 potatoes from the shipment. 


We would fail to reject Ho if the sample proportion f falls to the left of the green 
line. The white area under the bottom Normal distribution shows the probability 
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of failing to reject Hp when Hh is false. This is the probability of a Type II error. 
The potato-chip producer has about a 1-in-4 chance of failing to send away a 
shipment with 11% blemished potatoes. 


sampling distribution of p if 
Hy: p = 0.08 is true 


Normal approximation to the | 


= 


Reject Hp 


If Hy is true, a decision to reject Hy 
based on the data is a Type I error. 


P(Type I error) = @ 


T T T T T T T 
0.0437 0.0558 0.0679 0.0800 0.0921 0.1042 0.1163 


Values of p | 
p=0.10 


Normal approximation to the 
sampling distribution of p if 
His false and p = 0.11 is true 


If Ho is false, a decision to fail to reject 
H based on the data is a Type II error. 


P(Type I error) 
= 1-0.7626 
= 0.2374 


The power of the test to 
detect that p = 0.11 


0.0680 0.0820 0.0960 0.1100 0.1240 0.1380 0.1520 
Values of p 


p=0.10 


FIGURE 9.10 In the bottom graph, the power of the test (shaded area) is the probability that it 
correctly rejects Hy: p = 0.08 when the truth is p = 0.11. In this case, power = 0.7626. The 
probability of making a Type II error (white area) is 1 — 0.7626 = 0.2374. 


After reading the example, you might be wondering whether 0.76 is a high 
power or a low power. That depends on how certain the potato-chip producer 
wants to be to detect a shipment with 11% blemished potatoes. The power of a 
test against a specific alternative value of the parameter (like p = 0.11) is a num- 
ber between 0 and 1. A power close to 0 means the test has almost no chance of 
correctly detecting that Hp is false. A power near | means the test is very likely to 
reject Hp in favor of H, when Hp is false. 

The significance level of a test is the probability of reaching the wrong conclu- 
sion when the null hypothesis is true. The power of a test to detect a specific alter- 
native is the probability of reaching the right conclusion when that alternative is 
true. We can just as easily describe the test by giving the probability of making a 
Type II error (sometimes called {). 
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POWER AND TYPE II ERROR | 


Calculating a Type II error probability or power by hand is possible but 
unpleasant. It’s better to let technology do the work for you. 


ACTIVITY | What Affects the Power of a Test? 


MATERIALS: A virtual basketball player claims to make 80% of his free throws. Suppose that the 
Computer with Internet player is exaggerating—he really makes less than 80% in the long run. We have 
access and projection the computer player shoot 50 shots and record the sample proportion f of made 
capability free throws. We then use the sample result to perform a test of 

Ho:p = 0.80 

Hoe = 0:50 


at the a = 0.05 significance level. How can we increase the power of the test to detect 
that the player is exaggerating? In this Activity, we will use an applet to investigate. 


1. Go to www.amstat.org/publications/jse/v1 1n3/java/Power and select the 
Proportions applet at the bottom of the page. 


2. Adjust the applet settings as follows: choose “One Sample” for the Test, “Less 
Than” for the Alternative, enter 0.8 for pl, 50 for nl, and 0.05 for a. The Null 
distribution should appear in the applet window. 


Power = 0.8009 


Alternative 


Delta = -0.1500 


A 
L 
T 
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3. Let’s assume for now that the virtual player really makes 65% of his free throws 
(p = 0.65). Drag your mouse to the left in the applet screen and watch as an Alter- 
native distribution appears. Keep dragging until the Delta value in the top panel 
shows —0.1500. This sets the alternative parameter value to be 0.15 less than the 
null parameter value of 0.80. Click your mouse to set the Alternative distribution. 
The Power of the test to detect p = 0.65 is shown in the top panel: 0.8009. 


4. Sample size Change the sample size from n = 50 shots to n = 100 shots. 
What happens in the bottom panel of the applet? Does the power increase or 
decrease? Explain why this makes sense. 

5. Significance level Reset the sample size to n = 50. 

(a) Change the significance level to a = 0.01. What happens in the bottom 
panel of the applet? Does the power increase or decrease? 

(b) Make a guess about what will happen if you change the significance level to 
a = 0.10. Use the applet to test your conjecture. 

(c) Explain what the results in parts (a) and (b) tell you about the relationship 
between Type I error probability and Type II error probability. 


6. Difference between null parameter value and alternative parameter value 
Reset the sample size ton = 50 and the significance level to a = 0.05. Will 
we be mote likely to detect that the player is really a 65% shooter or that he is 
really a 70% shooter? Use your mouse to adjust the location of the Alternative 
distribution. How does the power change? Explain why this makes sense. 


Step 5 of the Activity reveals an important link between Type I and Type II 
error probabilities. Because P(Type I error) = a, increasing the significance level 
increases the chance of making a Type I error. As the applet shows, this change 
also increases the power of the test. Because P(‘Type II error) = 1 — Power, higher 
power means a smaller chance of making a Type II error. So increasing the Type I 
error probability a decreases the Type II error probability 3. By the same logic, de- 
creasing the chance of a Type I error results in a higher chance of a Type II error. 

Let’s summarize what the Activity reveals about how to increase the power of a 
significance test to detect when Hp is false and H, is true. 


e Increase the sample size. As Step 4 of the Activity confirms, we get better in- 
formation about the virtual player's free-throw shooting from a random sample 
of 100 shots than from a random sample of 50 shots. Increasing the sample size 
decreases the spread of both the Null and Alternative distributions. This change 
decreases the amount of overlap between the two distributions, making it easier 
to detect a difference between the null and alternative parameter values. 


¢ Increase the significance level a. Using a larger value of a increases the area of 
the green and blue “reject Ho” regions in both the Null and Alternative distribu- 
tions. This change makes it more likely to get a sample proportion that leads us 
to correctly reject the null hypothesis when the shooter is exaggerating. 


e Increase the difference between the null and alternative parameter values 
that is important to detect. Step 6 of the Activity shows that it is easier to 
detect large differences between the null and alternative parameter values 

Manvnanhes whem than smaller differences. The size of difference that is important to detect is 
studies refer to the difference that’s usually determined by experts in the field, so the statistician usually gets little 
important to detect as the effect size. or no input on this factor. 
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In addition to these three factors, we can also gain power by making wise choices 
when collecting data. For example, using blocking in an experiment or stratified 
random sampling can greatly increase the power of a test in some circumstances. 
Our best advice for maximizing the power of a test is to choose as high an a level 
(Type I error probability) as you are willing to risk and as large a sample size as 
you can afford. 


CHECK YOUR UNDERSTANDING 


Refer to the Perfect Potatoes example on page 565. 


1. Which is more serious for the potato-chip producer in this setting: a ‘Type I error or a 
‘Type II error? Based on your answer, would you choose a significance level of a = 0.01, 


0.05, or 0.10? 

2. Tell if each of the following would increase or decrease the power of the test. Justify 
your answers. 

(a) Change the significance level to a = 0.10. 

(b) ‘Take a random sample of 250 potatoes instead of 500 potatoes. 

(c) Insist on being able to detect that p = 0.10 instead of p = 0.11. 


Summary 


e The conditions for performing a significance test of Hp: p = fo are: 


¢ Random: The data were produced by a well-designed random sample or 
randomized experiment. 
© 10%: When sampling without replacement, check that the popula- 
tion is at least 10 times as large as the sample. 


e Large Counts: The sample is large enough to satisfy npy) = 10 and 
n(1— po) = 10 (that is, the expected counts of successes and failures are 
both at least 10). 


e The one-sample z test for a population proportion is based on the test statistic 
P — Po 
po(l — po) 
n 
with P-values calculated from the standard Normal distribution. 


e — Follow the four-step process when you are asked to carry out a significance test: 


STEP A STATE: What hypotheses do you want to test, and at what significance level? 
2 Define any parameters you use. 
PLAN: Choose the appropriate inference method. Check conditions. 


DO: If the conditions are met, perform calculations. 
e¢ Compute the test statistic. 
e = Find the P-value. 


CONCLUDE: Make a decision about the hypotheses in the context of the 
problem. 
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¢ Confidence intervals provide additional information that significance tests 
do not—namely, a set of plausible values for the true population parameter 
p. Atwo-sided test of Ho: p = po at significance level a gives roughly the same 
conclusion as a 100(1 — a)% confidence interval. 


e ‘The power of a significance test against a specific alternative is the probabil- 
ity that the test will reject Hp when the alternative is true. Power measures the 
ability of the test to detect an alternative value of the parameter. For a specific 
alternative, P(Type II error) = 1 — power. 


e There is an important link between the probabilities of Type I and Type II 
error in a significance test: as one increases, the other decreases. 

e We can increase the power of a significance test by increasing the sample 
size, increasing the significance level, or increasing the difference that is im- 
portant to detect between the null and alternative parameter values. 


TECHNOLOGY 
CORNER 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


18. One-proportion z test on the calculator 


Exercises 


In Exercises 31 and 32, check that the conditions for 
carrying out a one-sample z test for the population 
proportion p are met. 


31. 
M555 
wo 


SRS of 200 of the college’s 15,000 living alumni to 
perform a test of Ho: p = 0.99 versus H,: p < 0.99. 


Home computers Refer to Exercise 31. In Jason’s 
Home computers Jason reads a report that says SRS, 41 of the students had a computer at home. 
80% of U.S. high school students have a computer 


; arn Iculate th istic. 
at home. He believes the proportion is smaller than Caloulate Mie test sttusue 


32) 


0.80 at his large rural high school. Jason chooses an 
SRS of 60 students and records whether they have a 
computer at home. 


Walking to school A recent report claimed that 13% 
of students typically walk to school.” DeAnna thinks 
that the proportion is higher than 0.13 at her large 
elementary school, so she surveys a random sample 
of 100 students to find out. 


In Exercises 33 and 34, explain why we aren’t safe 
carrying out a one-sample z test for the population 
proportion p. 


Find the P-value using Table A or technology. Show 
this result as an area under a standard Normal curve. 


Walking to school Refer to Exercise 32. For 
DeAnna’s survey, 17 students in the sample said they 
typically walk to school. 


Calculate the test statistic. 


Find the P-value using Table A or technology. Show 
this result as an area under a standard Normal curve. 


Significance tests A test of Hy: p = 0.5 versus 
H,:p > 0.5 has test statistic z = 2.19. 


33. No test You toss a coin 10 times to perform a test (a) What conclusion would you draw at the 5% signifi- 
of Ho:p = 0.5 that the coin is balanced against cance level? At the 1% level? 
Te! (b) Ifthe alternative hypothesis were H,:p # 0.5, what 
34. No test A college president says, “99% of the alumni conclusion would you draw at the 5% significance 


support my firing of Coach Boggs.” You contact an 


level? At the 1% level? 


40. 
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Significance tests A test of Hy: p = 0.65 against 
H,:p < 0.65 has test statistic z = —1.78. 


What conclusion would you draw at the 5% signifi- 
cance level? At the 1% level? 


If the alternative hypothesis were H,:p # 0.65, what 
conclusion would you draw at the 5% significance 
level? At the 1% level? 


Better parking A local high school makes a change 
that should improve student satisfaction with the 
parking situation. Before the change, 37% of the 
school’s students approved of the parking that was 
provided. After the change, the principal surveys an 
SRS of 200 of the over 2500 students at the school. 
In all, 83 students say that they approve of the new 
parking arrangement. The principal cites this as 
evidence that the change was effective. Perform a test 
of the principal’s claim at the a = 0.05 significance 
level. 


Side effects A drug manufacturer claims that less 
than 10% of patients who take its new drug for treat- 
ing Alzheimer’s disease will experience nausea. ‘To 
test this claim, researchers conduct an experiment. 
They give the new drug to a random sample of 300 
out of 5000 Alzheimer’s patients whose families have 
given informed consent for the patients to participate 
in the study. In all, 25 of the subjects experience 
nausea. Use these data to perform a test of the drug 
manufacturer's claim at the a = 0.05 significance 
level. 


Are boys more likely? We hear that newborn babies 
are more likely to be boys than girls. Is this tue? A 
random sample of 25,468 firstborn children included 
13,173 boys. 


Do these data give convincing evidence that firstborn 
children are more likely to be boys than girls? 


To what population can the results of this study be 
generalized: all children or all firstborn children? 
Justify your answer. 


Fresh coffee People of taste are supposed to 
prefer fresh-brewed coffee to the instant variety. 
On the other hand, perhaps many coffee drinkers 
just want their caffeine fix. A skeptic claims that 
only half of all coffee drinkers prefer fresh-brewed 
coffee. To test this claim, we ask a random sample 
of 50 coffee drinkers in a small city to take part in 
a study. Each person tastes two unmarked cups— 
one containing instant coffee and one containing 
fresh-brewed coffee —and says which he or she 
prefers. We find that 36 of the 50 choose the fresh 
coffee. 


Do these results give convincing evidence that coftee 
drinkers favor fresh-brewed over instant coffee? 


(b) 


aie 


We presented the two cups to each coffee drinker in 
a random order, so that some people tasted the fresh 
coffee first, while others drank the instant coffee first. 


Why do you think we did this? 


Bullies in middle school A University of Illinois 
study on aggressive behavior surveyed a random 
sample of 558 middle school students. When asked 
to describe their behavior in the last 30 days, 445 
students said their behavior included physical ag- 
gression, social ridicule, teasing, name-calling, and 
issuing threats. This behavior was not defined as 
bullying in the questionnaire.' Is this evidence that 
more than three-quarters of middle school students 
engage in bullying behavior? To find out, Maurice 
decides to perform a significance test. Unfortunately, 
he made a few errors along the way. Your job is to 
spot the mistakes and correct them. 


Ho:p = O75 
H,:p > 0.797 


where p = the true mean proportion of middle school 
students who engaged in bullying. 

A random sample of 558 middle school students was 
surveyed. 


558(0.797) = 444.73 is at least 10. 
O75 = O77 


44, 


a 


0.5-0.507 _ 


——$——— = —2.46; P-value = 2(0.0069) = 0.0138 
V'0.797(0.203) 


445 


The probability that the null hypothesis is true is 
only 0.0138, so we reject Ho. This proves that more 
than three-quarters of the school engaged in bullying 
behavior. 

Is this coin fair? The French naturalist Count 
Buffon (1707-1788) tossed a coin +040 times. He 
got 2048 heads. That’s a bit more than one-half. 

Is this evidence that Count Buffon’s coin was not 
balanced? To find out, Luisa decides to perform 

a significance test. Unfortunately, she made a few 
errors along the way. Your job is to spot the mistakes 
and correct them. 


Ho: > 0.5 

lee = 0.5 
10%: 4040(0.5) = 2020 and 4040(1 — 0.5) = 2020 
are both at least 10. 


Large Counts: There are at least 40,400 coins in the 
world. 


= —0.89; P-value = 1 — 0.1867 = 0.8133 
0.5(0.5) 


4040 
Reject Ho because the P-value is so large and 
conclude that the coin is fair. 
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Teen drivers A state’s Division of Motor Vehicles 
(DMV) claims that 60% of teens pass their driv- 
ing test on the first attempt. An investigative 
reporter examines an SRS of the DMV records 
for 125 teens; 86 of them passed the test on their 
first try. Is there convincing evidence at the a = 
0.05 significance level that the DMV’s claim is 


incorrect? 


We want to be rich In a recent year, 73% of first- 
year college students responding to a national survey 
identified “being very well-off financially” as an 
important personal goal. A state university finds that 
132 of an SRS of 200 of its first-year students say that 
this goal is important. Is there convincing evidence 
at the a = 0.05 significance level that the proportion 
of all first-year students at this university who think 
being very well-off is important differs from the 
national value, 73%? 


Teen drivers Refer to Exercise 45. 


Construct and interpret a 95% confidence 
interval for the proportion of all teens in the state 
who passed their driving test on the first attempt. 


Explain what the interval in part (a) tells you about 
the DMV’s claim. 


We want to be rich Refer to Exercise 46. 


Construct and interpret a 95% confidence interval 
for the true proportion p of all first-year students at 
the university who would identify being well-off as an 
important personal goal. 


Explain what the interval in part (a) tells you about 
whether the national value holds at this university. 


Do you Tweet? In early 2012, the Pew Internet 
and American Life Project asked a random sample 
of U.S. adults, “Do you ever . . . use ‘Twitter or 
another service to share updates about yourself or 
to see updates about others?” According to Pew, 
the resulting 95% confidence interval is (0.123, 
0.177).'* Does this interval provide convincing 
evidence that the actual proportion of U.S. adults 
who would say they use ‘Twitter differs from 0.16? 
Justify your answer. 


Losing weight A Gallup Poll found that 59% of the 
people in its sample said “Yes” when asked, “Would 
you like to lose weight?” Gallup announced: “For 
results based on the total sample of national adults, 
one can say with 95% confidence that the margin of 
(sampling) error is +3 percentage points.”'* Does 
this interval provide convincing evidence that the 
actual proportion of U.S. adults who would say they 
want to lose weight differs from 0.55? Justify your 
answer. 


Sule 


Teens and sex ‘The Gallup Youth Survey asked a 
random sample of U.S. teens aged 13 to 17 whether 
they thought that young people should wait to have 
sex until marriage.’” The Minitab output below 
shows the results of a significance test and a 95% 
confidence interval based on the survey data. 


& Session 


= 
@) 
— 


Test and Cl for One Proportion 
Test of p = 0.5 vs p not = 0.5 


N Sample Z-Value P-Value 


Sample x Pp 95% CI 
1 246 439 0.560364 (0.513935, 0.606794) 2.53 0.012 


< 


Define the parameter of interest. 


Check that the conditions for performing the signifi- 
cance test are met in this case. 


Interpret the P-value in context. 


Do these data give convincing evidence that the 
actual population proportion differs from 0.5? Justify 
your answer with appropriate evidence. 


Reporting cheating What proportion of students 
are willing to report cheating by other students? A 
student project put this question to an SRS of 172 
undergraduates at a large university: “You witness 
two students cheating on a quiz. Do you go to the 
professor?” The Minitab output below shows the 
results of a significance test and a 95% confidence 
interval based on the survey data.!° 


Test and Cl for One Proportion 


Test of p = 0.15 vs p not = 0.15 


Sample x N Sample p 958 CI Z-Value P-Value 
1 19 172 0.110465 (0.063619, 0.157312) -1.45 0.146 


< 


Define the parameter of interest. 


Check that the conditions for performing the signifi- 
cance test are met in this case. 


Interpret the P-value in context. 


Do these data give convincing evidence that the ac- 
tual population proportion differs from 0.15? Justify 
your answer with appropriate evidence. 


Better parking Refer to Exercise 39. 


Describe a ‘Type I error and a ‘Type II error in this 
setting, and explain the consequences of each. 


The test has a power of 0.75 to detect that p = 0.45. 
Explain what this means. 


Identify two ways to increase the power in part (b). 


56. 


bis 


58. 


Section 9.2 Tests about a Population Proportion ve We 


Side effects Refer to Exercise 40. 


Describe a ‘Type I error and a Type II error in this 
setting, and explain the consequences of each. 


The test has a power of 0.54 to detect that p = 0.07. 
Explain what this means. 


Identify two ways to increase the power in part (b). 


Error probabilities You read that a statistical test at 
significance level a = 0.05 has power 0.78. What are 
the probabilities of Type I and Type II errors for this 
test? 


Error probabilities You read that a statistical test 
at the a = 0.01 level has probability 0.14 of making 
a Type II error when a specific alternative is true. 
What is the power of the test against this alternative? 


Power A drug manufacturer claims that fewer than 
10% of patients who take its new drug for treating 
Alzheimer’s disease will experience nausea. ‘To test 
this claim, a significance test is carried out of 


Hy:p = 0.10 
H,:p < 0.10 


You learn that the power of this test at the 5% signifi- 
cance level against the alternative p = 0.08 is 0.29. 


Explain in simple language what “power = 0.29” 
means in this setting. 


You could get higher power against the same 
alternative with the same a by changing the 
number of measurements you make. Should you 
make more measurements or fewer to increase 
power? Explain. 


If you decide to use a = 0.01 in place of a = 0.05, 
with no other changes in the test, will the power 
increase or decrease? Justify your answer. 


If you shift your interest to the alternative p = 0.07 
with no other changes, will the power increase or 
decrease? Justify your answer. 


What is power? You manufacture and sell a liquid 
product whose electrical conductivity is supposed to 
be 5. You plan to make six measurements of the con- 
ductivity of each lot of product. If the product meets 
specifications, the mean of many measurements will 
be 5. You will therefore test 


Ho: = 5 
Hy:w#5 


If the true conductivity is 5.1, the liquid is not suit- 
able for its intended use. You learn that the power 

of your test at the 5% significance level against the 

alternative pp = 5.1 is 0.23. 


(d) 


Explain in simple language what “power = 0.23” 
means in this setting. 


You could get higher power against the same alterna- 
tive with the same a by changing the number of 
measurements you make. Should you make more 
measurements or fewer to increase power? 


If you decide to use a = 0.10 in place of a = 0.05, 
with no other changes in the test, will the power 
increase or decrease? Justify your answer. 


If you shift your interest to the alternative pu = 5.2, 
with no other changes, will the power increase or 
decrease? Justify your answer. 


Multiple choice: Select the best answer for Exercises 59 
to 62. 


5o! 


After once again losing a football game to the 
archrival, a college’s alumni association conducted 
a survey to see if alumni were in favor of firing the 
coach. An SRS of 100 alumni from the population 
of all living alumni was taken, and 64 of the alumni 
in the sample were in favor of firing the coach. Sup- 
pose you wish to see if a majority of living alumni 
are in favor of firing the coach. The appropriate test 
statistic is 


0.64 — 0.5 0.64 — 0.5 
@) = @ = 
0.64(0.36) /0.64(0.36) 
100 64 
.64 — 0. 0.5 — 0.64 
(bee 0.64 = 0.5 _ eee 
0.64(0.36) 0.5(0.5) 
100 100 
0.64 — 0.5 
(jez — 
0.5(0.5) 
100 
60. Which of the following is not a condition for 


performing a significance test about a population 
proportion p? 


The data should come from a random sample or 
randomized experiment. 


Both npp and n(1 — po) should be at least 10. 


If you are sampling without replacement from a 
finite population, then you should sample no more 
than 10% of the population. 


The population distribution should be approximately 
Normal, unless the sample size is large. 


All of the above are conditions for performing a 
significance test about a population proportion. 
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. The z statistic for a test of Hp: p = 0.4 versus 


Hy:p # 0.4 is z = 2.43. This test is 


Describe the shape, center, and spread of the 
distribution of the random variable X — Y. What is 
the importance of this random variable to the CD 


(a) not significant at either a = 0.05 or a = 0.01. see ietreg? 

(b) significant at a = 0.05 but not at a = 0.01. (b) Compute the probability that a randomly selected 

(c) significant at a = 0.01 but not at a = 0.05. CD will fit inside a randomly selected case. 

(d) significant at both a = 0.05 and a = 0.01. (c) The production process actually runs in batches of 
, Fi 100 CDs. If each of these CDs is paired with a ran- 

(e) inconclusive because we don’t know the value of p. domly che cn plete cre Gadltne probability dict 

62. Which of the following 95% confidence intervals all the CDs fit in their cases. 


(a) (0.19, 0.27) 
(b) (0.24, 0.30) 


would lead us to reject Hp: p = 0.30 in favor of 
Hy: p # 0.30 at the 5% significance level? 


(c) (0.27, 0.31) 
(d) (0.29, 0.38) 


Packaging CDs (6.2, 5.3) A manufacturer of 
compact discs (CDs) wants to be sure that their CDs 
will fit inside the plastic cases they have bought 

for packaging. Both the CDs and the cases are 
circular. According to the supplier, the plastic cases 
vary Normally with mean diameter jz = 4.2 inches 
and standard deviation o = 0.05 inches. The CD 
manufacturer decides to produce CDs with mean 
diameter jz = 4 inches. Their diameters follow a 
Normal distribution with a = 0.1 inches. 


(e) None of these 


Let X = the diameter of a randomly selected CD 
and Y = the diameter of a randomly selected case. 


ez 


Cash to find work? (4.2) Will cash bonuses speed 
the return to work of unemployed people? The II- 
linois Department of Employment Security designed 
an experiment to find out. The subjects were 10,065 
people aged 20 to 54 who were filing claims for 
unemployment insurance. Some were offered $500 
if they found a job within 11 weeks and held it for at 
least 4 months. Others could tell potential employers 
that the state would pay the employer $500 for hiring 
them. A control group got neither kind of bonus.'” 


Describe a completely randomized design for this 
experiment. 


How will you label the subjects for random assign- 
ment? Use ‘Table D at line 127 to choose the first 3 
subjects for the first treatment. 


Explain the purpose of a control group in this setting. 


Tests about a Population Mean 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e State and check the Random, 10%, and Normal/Large e Use aconfidence interval to draw a conclusion for a 


two-sided test about a population parameter. 
Perform a significance test about a mean difference 
using paired data. 


Sample conditions for performing a significance test 
about a population mean. 


Perform a significance test about a population mean. 


Confidence intervals and significance tests for a population proportion p are 
based on z-values from the standard Normal distribution. Inference about 
a population mean 4 uses a t distribution with n — 1 degrees of freedom, 
except in the rare case when the population standard deviation o is known. 
We learned how to construct confidence intervals for a population mean in 
Section 8.3. Now we'll examine the details of testing a claim about a popula- 
tion mean J. 
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Carrying Out a Significance Test for pz 


In an earlier example, a company claimed to have developed a new AAA battery 
that lasts longer than its regular AAA batteries. Based on years of experience, the 
company knows that its regular AAA batteries last for 30 hours of continuous use, 
on average. An SRS of 15 new batteries lasted an average of 33.9 hours with a 
standard deviation of 9.8 hours. Do these data give convincing evidence that the 
new batteries last longer on average? To find out, we perform a test of 


Ho: u = 30 hours 
H,: 2 > 30 hours 


where yu is the true mean lifetime of the new deluxe AAA batteries. 


Conditions In Chapter 8, we introduced conditions that should be met be- 
fore we construct a confidence interval for a population mean: Random, 10% 
when sampling without replacement, and Normal/Large Sample. These same 
three conditions must be verified before performing a significance test about a 
population mean. 

As in the previous chapter, the Normal/Large Sample condition for means is 


population distribution is Normal or sample size is large (n = 30) 


We often don’t know whether the population distribution is Normal. But if the 
sample size is large (n = 30), we can safely carry out a significance test. If the 
sample size is small, we should examine the sample data for any obvious depar- 
tures from Normality, such as strong skewness and outliers. 


CONDITIONS FOR PERFORMING A 
SIGNIFICANCE TEST ABOUT A MEAN 


¢ Random: The data come from a well-designed random sample or ran- 

domized experiment. 
1 

° 10%: When sampling without replacement, check that n = 10 

e¢ Normal/Large Sample: The population has a Normal distribution or 
the sample size is large (n = 30). If the population distribution has un- 
known shape and n < 30, use a graph of the sample data to assess the 
Normality of the population. Do not use t procedures if the graph shows 
strong skewness or outliers. 


Here’s an example that shows how to check the conditions. 


Better Batteries 


Checking conditions 


Figure 9.1] on the next page shows a dotplot, boxplot, and Normal probability plot 
of the battery lifetimes for an SRS of 15 batteries. 
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FIGURE 9.11 (a) A dotplot, (b) a boxplot, and (c) a Normal probability plot of the lifetimes of a 
simple random sample of 15 AAA batteries. 


PROBLEM: Check the conditions for carrying out a significance test of the company’s claim 
about its deluxe AAA batteries. 


SOLUTION: 
Random: The company tested a simple random sample of 15 new AAA batteries. 


° 10%: Because the batteries are being sampled without replacement, we need to check that 
there are at least 10(15) = 150 new AAA batteries. This seems reasonable to believe. 


Normal/Large Sample: We don't know if the population distribution of battery lifetimes for the 
company’s new AAA batteries is Normal. With such a small sample size (n = 15), we need to graph the 
data to look for any departures from Normality. The dotplot and boxplot show slight right-skewness but 
no outliers. The Normal probability plot is fairly linear. Because none of the graphs shows any strong 
skewness or outliers, we should be safe performing a test about the population mean lifetime ju. 


For Practice Try Exercise 


There is a small number of real-world 
situations in which we might know the 
population standard deviation o. When 
this is the case, the test statistic 


2 X — Mo 
o/Vn 


will follow a standard Normal 
distribution if the Normal/Large Sample 
condition is met. If so, then we can 
calculate P-values using Table A or 
technology. The TI-83/84 and TI-89’s 
Z-Test option in the TESTS menu is 
designed for this special situation. 


Calculations: Test Statistic and P-Value When performing a signifi- 
cance test, we do calculations assuming that the null hypothesis Hp is true. The 
test statistic measures how far the sample result diverges from the parameter value 
specified by Ho, in standardized units. As before, 


statistic — parameter 


test statistic = — at 
standard deviation of statistic 


For a test of Ho: ju = Uo, our statistic is the sample mean x. Its standard deviation is 
Oz = ae 
In an ideal world, our test statistic would be 
_ x7 Ho 
a/Vn 
Because the population standard deviation o is usually unknown, we use the 


sample standard deviation s, in its place. The resulting test statistic has the stan- 
dard error of x in the denominator 


s,/Vn 
As we saw earlier, when the Normal/Large Sample condition is met, this statistic 
has approximately a ¢ distribution with n — 1 degrees of freedom. 


t 


t distribution 
with df = 14 
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In Section 8.3, we used Table B to find critical values from the t distributions 
when constructing confidence intervals about an unknown population mean p. 
Once we have calculated the test statistic, we can use ‘lable B to find the P-value 
for a significance test about ju. The following example shows how this works. 


Better Batteries 


Computing the test statistic and P-value 


The battery company wants to test Ho: 4 = 30 versus H,: 4 > 30 based on an SRS 
of 15 new AAA batteries with mean lifetime x = 33.9 hours and standard deviation 
s, = 9.8 hours. The test statistic is 


statistic — parameter 


test statistic = 
standard deviation of statistic 


tater 
3/Vn  9.8/V15 


t ere 


The P-value is the probability of getting a result this large 
or larger in the direction indicated by H,, that is, P(t = 
1.54). Figure 9.12 shows this probability as an area under 
the ¢ distribution curve with df = 15 — 1 = 14. We can 
find this P-value using Table B. 


Go to the df = 14 row. The Upper-talli probability p 


P-value = t statistic falls between the | 9 -10 05 — .025 
P(t2 1.54) values 1.345 and 1.761. If | 18 1.350 1.771 2.160 
you look at the top of the | 14 1.345 1.761 2.145 


corresponding columns in | 15 1.341 1.753. 2.131 
Table B, you ll find that the 80% 90% 95% 


“Upper-tail probability p” is 
between 0.10 and 0.05. (See 
the excerpt from Table B at right.) Because we are looking 
for P(t = 1.54), this is the probability we seek. That is, the 


Confidence level C 


FIGURE 9.12 The P-value for a one-sided test with t= 1.54. P-value for this test is between 0.05 and 0.10. 


As you can see, Table B gives an interval of possible P-values for a significance 
test. We can still draw a conclusion from the test in much the same way as if 
we had a single probability. In the case of the new AAA batteries, for instance, we 
would fail to reject Ho: 4 = 30 because the P-value exceeds our default a = 0.05 
significance level. We don’t have convincing evidence that the company’s new 
AAA batteries last longer than 30 hours, on average. 

‘Table B has other limitations for finding P-values. It includes probabilities only for 
t distributions with degrees of freedom from 1 to 30 and then skips to df = 40, 50, 60, 
80, 100, and 1000. (The bottom row gives probabilities for df = %, which corresponds 
to the standard Normal distribution.) Also, ‘Table B shows probabilities only for posi- 
tive values of t. To find a P-value for a negative value of t, we use the symmetry of the 
t distributions. The next example shows how we deal with both of these issues. 
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df 
29 
30 
40 


Upper-tail probability p 
005 .0025 .001 
2.756 3.038 3.396 
2.704 2.971 3.307 
99% 99.5% 99.8% 
Confidence level C 


Two-Sided Tests, Negative f-Values, 
and More 


Using Table B wisely 


What if you were performing a test of Ho: 4 = 5 versus H,: 1 # 5 based on a 
sample size of n = 37 and obtained t = —3.17? Because this is a two-sided test, 
you are interested in the probability of getting a value of t less than or equal to 
—3.17 or greater than or equal to 3.17. Figure 9.13 shows the desired P-value as 
an area under the t¢ distribution curve with 36 degrees of freedom. Notice that 
P(t $3.17) = P(t = 3.17) due to the symmetric shape of the density curve. ‘Table B 
shows only positive t-values, so we will focus on t = 3.17. 


P-value = area 
in both tails 


t=-3.17 


FIGURE 9.13 The P-value for a two-sided test with t = —3.17. 


Because df = 37 — 1 = 36 is not available on the table, use df = 30. You might be 
tempted to use df = 40, but doing so would result in a smaller P-value than you are 
entitled to with df = 36. (In other words, you’d be cheating!) Move across the 
df = 30 row, and notice that t = 3.17 falls between 3.030 and 3.385. The corre- 
sponding “Upper-tail probability p” is between 0.001 and 0.0025. (See the excerpt 
from ‘Table B.) For this two-sided test, the corresponding P-value would be be- 
tween 2(0.001) = 0.002 and 2(0.0025) = 0.005. 


One point from the example deserves repeating: if the df you need isn’t 4 
provided in Table B, use the next lower df that is available. It’s no fair 
“rounding up” to a larger df. This is like pretending that your sample size 
is larger than it really is. Doing so would give you a smaller P-value than is true 
and would make you more likely to incorrectly reject Hp when it’s true (make a 
‘Type I error). 

Given the limitations of Table B, our advice is to use technology to find P-values 
when carrying out a significance test about a population mean. 
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TEcHNoLogy COMPUTING P-VALUES FROM 
CORNER t DISTRIBUTIONS ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


You can use the tcdf command on the TI-83/84 and T1-89 to calculate areas under a ¢ distribution curve. ‘The syntax 
is tcdf (lower bound,upper bound,df). 


Let’s use the ted£ command to compute the P-values from the last two examples. 


Better batteries: ‘Vo find P(t = 1.54), 
TI-83/84 TL-89 


e Press[2nd]/VARS] (DISTR) and choose tcdf (. In the Stats/List Editor, press [F5](Distr) and 
OS 2.55 or later: In the dialog box, enter these choose t Cdf.... 


values: lower:1.54, upper: 10000, df:14, In the dialog box, enter these values: Lower val - 
choose Paste, and then press [ENTER |. Older OS: ue:1.54, Upper value:10000, Deg of 
Complete the command tcdf (1.54,10000,14) Freedom, df£:14, and then choose | ENTER]. 
and press [ENTER |. 
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tcedf(1.54,10000,14) 
- 8729268628 


Low Yq1 
Ur "91 
dt 


zscores[(10]=-.87 789629063... 
MAIN FAD AUTO FUNC a9 


Two-sided test: ‘To find the P-value for the two-sided test with df = 36 and t = —3.17, do ted£ (-10000,-3.17, 36) 
and multiply the result by 2. 


CHECK YOUR UNDERSTANDING 


The makers of Aspro brand aspirin want to be sure that their tablets contain the right 
amount of active ingredient (acetylsalicylic acid). So they inspect a random sample of 
36 tablets from a batch in production. When the production process is working prop- 
erly, Aspro tablets have an average of 4 = 320 milligrams (mg) of active ingredient. The 
amount of active ingredient in the 36 selected tablets has mean 319 mg and standard 
deviation 3 mg. 

1. State appropriate hypotheses for a significance test in this setting. 

2. Check that the conditions are met for carrying out the test. 

3. Calculate the test statistic. Show your work. 


4. Use Table B to find the P-value. Then use technology to get a more accurate result. 
What conclusion would you draw? 


The One-Sample ¢ Test 


When the conditions are met, we can test a claim about a population mean ju 
using a one-sample ¢ test for a mean. Here are the details. 
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ONE-SAMPLE ¢t TEST FOR A MEAN 


Suppose the conditions are met. To test the hypothesis Ho: 4 = j49, compute 
the one-sample ¢ statistic 


_ X= bo 


on s,/Vn 


Find the P-value by calculating the probability of getting a ¢ statistic this large 
or larger in the direction specified by the alternative hypothesis H, ina t 
distribution with df = n- 1: 


Ay: > Uo Ay: M< Mo Ay: MA Mo 


Now we are ready to test a claim about an unknown population mean. Once 
again, we follow the four-step process. 


Healthy Streams 


Performing a significance test about ju 


The level of dissolved oxygen (DO) in a stream or river is an important in- 
dicator of the water’s ability to support aquatic life. A researcher measures 
the DO level at 15 randomly chosen locations along a stream. Here are 
the results in milligrams per liter (mg/l): 


4.53 5.04 3.29 5.23 4.13 5.50 4.83 4.40 
5.42 6.38 4.01 4.66 2.87 5.73 5.55 


A dissolved oxygen level below 5 mg/l puts aquatic life at risk. 


PROBLEM: 


(a) Dowe have convincing evidence at the a = 0.05 significance level that aquatic life in this 
stream is at risk? 


(b) Given your conclusion in part (a), which kind of mistake—a Type | error ora Type Il error—could 
you have made? Explain what this mistake would mean in context. 

SOLUTION: 

(a) We will follow the four-step process. 

STATE: We want to test a claim about the true mean dissolved oxygen level ju in this stream at 


the a = 0.05 level. Our hypotheses are 
Ho 2 lg) 
Ee u<2 


AP® EXAM TIP It is not 
enough just to make a graph 
of the data on your calculator 
when assessing Normality. 


You must sketch the graph 
on your paper to receive 
credit. You don’t have to 
draw multiple graphs—any 
appropriate graph will do. 
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PLAN: Ifthe conditions are met, we should do a one-sample ttest for ju. 


° Random: The researcher measured the DO level at 15 randomly chosen locations. 


° 10%: There is an infinite number of possible locations along the stream, so it isn't necessary to 
check the 10% condition. 


* Normal/Large Sample: We don't know whether the population distribution of DO levels at all points 
along the stream is Normal. With such a small sample size (n = 15), we need to graph the data to see 
if it’s safe to use t procedures. Figure 9.14 shows our hand sketches of a calculator histogram, 
boxplot, and Normal probability plot for these data. The histogram looks roughly symmetric; the 
boxplot shows no outliers; and the Normal probability plot is fairly linear. With no outliers or strong 
skewness, the t procedures should be pretty accurate. 


(a) 


ee SiO irene 8 


DO level (mg/l) 


DO level (mg/l) 
(b) (c) 


DO level (mg/l) 


FIGURE 9.14 Sketches of (a) a histogram, (b) a boxplot, and (c) a Normal probability plot for the 
dissolved oxygen (DO) readings in the sample, in mg/l. 


0 


Values of t 


r= -0.94 


FIGURE 9.15 The P-value for a one-sided test when 


t= -0.94. 


DO: We entered the data into our calculator and did 1-Var Stats (see 
screen shot). The sample meanis x = 4.771 and the sample standard 
deviation is 5, = 0.9396. 


NORMAL FLOAT AUTO REAL RADIAN CL f 


x=4. 771333333 
=x=71.57 
=x?=353. 8441 
Sx=. 9395961645 
ox=. 907736134 
n=15 

minxX=2. 87 
401=4.13 


° Test statistic 
X—[p 4771-5 
5,/Vn  0.9396/\/15 


¢ P-value The P-value is the area to the left of t = —0.94 under the 
tdistribution curve with degrees of freedom df = 15-1 = 14. 
Fete 9.15 shows this probability. df 25 20 415 
Using the table: Table B shows only areas in the upper tail of the a ae er 
distribution. Because the t distributions are symmetric, , : a 

P(t <—0.94) = P(t = 0.94). Search the df= 14 rowofTable | 14 
B for entries that bracket t = 0.94 (see the excerpt at right). 15 691 =—.866—- 1.074 
Because the observed tlies between 0.868 and 1.076, the 50% 60% 70% 
P-value lies between 0.15 and 0.20. Confidence level C 


5 = — 0.94 


Upper-tail probability p 
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Using technology: We can find the exact P-value using a calculator: tcdf£ (lower: —100, 
upper:—0.94, d—£: 14) =0.1816. 

CONCLUDE: Because the P-value, 0.1816, is greater than our c = 0.05 significance level, 

we fail to reject: Ho. We don’t have convincing evidence that the mean DO level in the stream is less 
than 5 mg/l. 

(b) Because we decided not to reject Hp in part (a), we could have made a Type ll error (failing to 
reject Ho when Hp is false). If we did, then the mean dissolved oxygen level ju in the stream is actually 
less than 5 mg/l, but we didn’t find convincing evidence of that with our significance test. 


For Practice Try Exercise 


Because the t procedures are so common, all 


@ Session 
statistical software packages will do the calculations for 


“A 
One-Sample T: DO (mg/L) you. Figure 9.16 shows the output from Minitab for the 
» one-sample ¢ test in the previous example. Note that the 
Test of m= 5 va < 5 ~ results match! 
Gites & tes Ree en YS p You can also use your calculator to carry out a one- 
DO (mg/L) 15 4.771 0.940 0.243 -0.94 0.181 sample t test. But be sure to read the AP® exam tip at the 
¥ end of the Technology Corner. 
3 il ual .:: 


FIGURE 9.16 Minitab output for the one-sample ¢ test from the 
dissolved oxygen example. 


ONE-SAMPLE ¢ TEST FOR A MEAN ON 


20. TECHNOLOGY 
CORNER THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


You can perform a one-sample t test using either raw data or summary statistics on the ‘T1-83/84 or T1-89. Let’s use the 
calculator to carry out the test of Ho: pp = 5 versus Hy: pu < 5 from the dissolved oxygen example. Start by entering the 
sample data in LI/listl. Then, to do the test: 


TI-83/84 TI-89 


e Press[STat|, choose TESTS and T-Test. e Press [2nd]/F1] ({F6]) and choose T-Test. 
e Adjust your settings as shown. 


e Adjust your settings as shown. 


(FE ETE 
- 
.F ———— 
Inet :BERE) Stats ————— | 
ue:S : 
Lastibs x — 
Frea:1 
true due Alternate Hyrs we BOD 
Color: MLS S.| Results: CatcuTate + 
Calculate Draw = 
lis b J= 
USE € AND > TO OPEN CHOICES 
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If you select “Calculate,” the following screen appears: 


NORMAL FLOAT AUTO REAL RADIAN CL f ee 
atts T Test al 


iT—-Test| 
Huts 
t=~-. 9425562016 
P=.1809448972 
x=4. 771333333 
Sx=. 9395961645 
n=15 


The test statistic is ¢ = —0.94 and the P-value is 0.1809. 
If you specify “Draw,” you see a t distribution curve (df = 14) with the lower tail shaded. 
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RIN RAD AUTO FUNC 


Note: If you are given summary statistics instead of the original data, you would select the option “Stats” instead of 
“Data” in the first screen. 


AP® EXAM TIP Remember: if you just give calculator results with no work, and one or more values are 
wrong, you probably won’t get any credit for the “Do” step. If you opt for the calculator-only method, name the 
procedure (test) and report the test statistic (t = —0.94), degrees of freedom (df = 14), and P-value (0.1809). 


CHECK YOUR UNDERSTANDING 


A college professor suspects that students at his school are getting less than 8 hours of sleep 
a night, on average. To test his belief, the professor asks a random sample of 28 students, 
“How much sleep did you get last night?” Here are the data (in hours): 


96868 86656794345 61163661078 4597 7 


Do these data provide convincing evidence at the a = 0.05 significance level in support 
of the professor’s suspicion? 


Two-Sided Tests and Confidence Intervals 


Now let’s look at an example involving a two-sided test. 


Juicy Pineapples 
A two-sided test LL 


At the Hawaii Pineapple Company, managers are interested in the sizes of the 
pineapples grown in the company’s fields. Last year, the mean weight of the pine- 
apples harvested from one large field was 31 ounces. A different irrigation system 
was installed in this field after the growing season. Managers wonder how this 
change will affect the mean weight of future pineapples grown in the field. To 
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find out, they select and weigh a random sample of 50 pineapples from this year’s 
crop. The Minitab output below summarizes the data. 


Descriptive Statistics: Weight (0z) 


Variable N Mean SE Mean StDev Minimum Ql Median Q3 Maximum 
Weight (oz) 50 31.935 0.3319 2.394 26.491 29.990 31.739 34.115 35.547 


PROBLEM: 

(a) Do these data give convincing evidence that the mean weight of pineapples produced in the field 
has changed this year? 

(b) Canwe conclude that the different irrigation system caused a change in the mean weight of 
pineapples produced? Explain your answer. 


SOLUTION: 

(a) STATE: We want to perform a test of 
Hp: 6 = 31 
H,: 2 31 


where {1 = the mean weight (in ounces) of all pineapples grown in the field this year. Because no 
significance level is given, we'll use a. = 0.05. 


PLAN: Ifthe conditions are met, we should conduct a one-sample t test for ju. 

* Random: The data came froma random sample of 50 pineapples from this year's crop. 
0 10%: There need to be at least 10(50) = 500 pineapples in the field because managers are 
sampling without replacement. We would expect many more than 500 pineapples in a “large field.” 

* Normal/Large Sample: We don't know whether the population distribution of pineapple weights this year is 

Normally distributed. But n = 50 = 30, 50 the large sample size makes it OK to use t procedures. 

DO: From the Minitab output, x = 31.935 ounces and 

5, = 2.394 ounces. 


t distribution, 
49 degrees of 
freedom 


° Test statistic 


X— po 31.935 — 31 
5,/Vn  2.394/\/50 


* P-value Figure 9.17 displays the P-value for this two-sided test as 
an area under the t distribution curve with 50-1 = 49 degrees of 


C= = 2.762 


P-value = 0.0081 


freedom. 
Using the table: Table B doesn’t have an entry for df = 49, so we have 
<< _ values of + ———/ to use the more conservative df = 40. As the excerpt below shows, 
T= 2762 P= 2762 the upper-tail probability is between 0.0025 and 0.005. So the 


desired P-value for this two-sided test is between 2(0.0025) = 
FIGURE 9.17 The P-value for the two-sided test with t= 2.762. 0.005 and 2(0.005) = 0.01. 


Upper-tail probability p Using technology: The calculator’s T-Test command gives t = 2.762 and P-value 0.0081 using df = 49. 


Se ee CONCLUDE: B he P-value, 0.0081, is less than a = 0.05, we reject Hy.Weh 
30 2.750 3.030 3.385 : Because the P-value, O. , 1s less than 7 = 0.009, we reject Ho. We have con- 


a “2.704 2.971. 4407 vincing evidence that the mean weight of the pineapples grown this year is not 31 ounces. 

, (b) No. This was not a comparative experiment, so we cannot infer causation. It is possible that other 
things besides the irrigation system changed from last year’s growing season. Maybe the weather was 
different this year, and that’s why the pineapples have a different mean weight than last year. 


50 2.678 2.937 3.261 
99% 99.5% 99.8% 
Confidence level C 


For Practice Try Exercise 


THINK 
ABOUT IT 
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The significance test in the previous example gives convincing evidence that 
the mean weight ju of the pineapples grown in the field this year differs from last 
year’s 31 ounces. Unfortunately, the test doesn’t give us an idea of what the actual 
value of yu is. For that, we need a confidence interval. 


Juicy Pineapples 
Confidence intervals give more information 


Minitab output for a significance test and confidence interval based on the pine- 
apple data is shown below. The test statistic and P-value match what we got earlier 
(up to rounding). 


One-Sample T: Weight (0z) 


Test of mu = 31 vs not = 31 
Variable N Mean StDev SE Mean 95% CI T Pp 
Weight (oz) 50 31.935 2.394 0.339 (31.255, 32.616) 2.76 0.008 


The 95% confidence interval for the mean weight of all the pineapples grown in 
the field this year is 31.255 to 32.616 ounces. We are 95% confident that this inter- 
val captures the true mean weight ju of this year’s pineapple crop. 


As with proportions, there is a link between a two-sided test at significance level 
a and a 100(1 — a)% confidence interval for a population mean ju. For the pine- 
apples, the two-sided test at a = 0.05 rejects Ho: ps = 31 in favor of H,: uw # 31. The 
corresponding 95% confidence interval does not include 31 as a plausible value of 
the parameter ju. In other words, the test and interval lead to the same conclusion 
about Ho. But the confidence interval provides much more information: a set of 
plausible values for the population mean. 


‘The connection between two-sided tests and confidence intervals is even stron- 
ger for means than it was for proportions. That’s because both inference methods 
for means use the standard error of x in the calculations. 


x — Ho ’ . 8 
Confidence interval: x + t*— 
s,/Wn Vn 
When the two-sided significance test at level a rejects Ho: pp = uo, the 100(1 — a)% 
confidence interval for jz will not contain the hypothesized value jp. And when 
the test fails to reject the null hypothesis, the confidence interval will contain juo. 


test statistic: t = 


Isthereaconnection between one-sided tests and confidence 
intervals for a population mean? As you might expect, the answer is 
yes. But the link is more complicated. Consider a one-sided test of Ho: 4 = 10 
versus H,: 4 > 10 based on an SRS of 30 observations. With df = 30 — 1 = 29, 
Table B says that the test will reject Hp at a = 0.05 if the test statistic t is greater 
than 1.699. For this to happen, the sample mean ¥ would have to exceed up = 10 
by more than 1.699 standardized units. 

Table B also shows that t* = 1.699 is the critical value for a 90% confidence 
interval. That is, a 90% confidence interval will extend 1.699 standardized units 
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on either side of the sample mean x. If X exceeds 10 by more than 1.699 standard- 
ized units, the resulting interval will not include 10. And the one-sided test will 
reject Ho: 4 = 10. There’s the link: our one-sided test at a = 0.05 gives the same 
conclusion about Ho as a 90% confidence interval for ju. 


OR 


CHECK YOUR UNDERSTANDING 


The health director of a large company is concerned about the effects of stress on the 
company’s middle-aged male employees. According to the National Center for Health 
Statistics, the mean systolic blood pressure for males 35 to 44 years of age is 128. The 
health director examines the medical records of a random sample of 72 male employees 
in this age group. The Minitab output displays the results of 
a significance test and a confidence interval. 


1. Do the results of the significance test give convincing 
. evidence that the mean blood pressure for all the company’s 
Test of mu = 128 vs not = 128 ~ middle-aged male employees differs from the national aver- 
N Mean StDev SE Mean 95% CI t P age? Justify your answer. 

Ie ic STR NEZBRSS) SSSERR) SeaG! 105279 2. Interpret the 95% confidence interval in context. 
¥ — Explain how the confidence interval leads to the same 
~ conclusion as in Question 1. 


One-Sample T 


4) sil 


Iv 


Inference for Means: Paired Data 


Study designs that involve making two observations on the same individual, or one 
observation on each of two similar individuals, yield paired data. When paired 
data result from measuring the same quantitative variable twice, we can make 
comparisons by analyzing the differences in each pair. If the conditions for infer- 
ence are met, we can use one-sample t procedures to perform inference about the 
mean difference jug. (These methods are sometimes called paired t procedures. ) 
An example should help illustrate what we mean. 


Is Caffeine Dependence Real? “"4 
Paired data and one-sample t procedures L. 


Sm _] Researchers designed an experiment to study the effects of caffeine 
withdrawal. They recruited 11 volunteers who were diagnosed as being caf- 
feine dependent to serve as subjects. Each subject was barred from coffee, 
colas, and other substances with caffeine for the duration of the experiment. 
During one 2-day period, subjects took capsules containing their normal caf- 
feine intake. During another 2-day period, they took placebo capsules. The 
order in which subjects took caffeine and the placebo was randomized. At 
the end of each 2-day period, a test for depression was given to all 11] subjects. 
Researchers wanted to know whether being deprived of caffeine would lead 
to an increase in depression.!® 


It is uncommon for the 

subjects in an experiment to 

be randomly selected from 

some larger population. In fact, 
most experiments use recruited 
volunteers as subjects. When there 
is no sampling, we don’t need to 
check the 10% condition. 
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The table below contains data on the subjects’ scores on the depression test. Higher 
scores show more symptoms of depression. For each subject, we calculated the 
difference in test scores following each of the two treatments (placebo — caffeine). 
We chose this order of subtraction to get mostly positive values. 


Depression Depression Difference 
Subject (caffeine) (placebo) (placebo — caffeine) 
1 5 16 11 
2 5 23 18 
3 4 5 1 
4 3 7 4 
5 8 14 6 
6 5 24 19 
7 0 6 6 
8 0 3 3 
9 2 15 13 
10 11 12 1 
11 1 0 -] 
PROBLEM: 


(a) Why did researchers randomly assign the order in which subjects received placebo and caffeine? 


(b) Carry out a test to investigate the researchers’ question. 


SOLUTION: 


(a) Researchers want to be able to conclude that any statistically significant change in depres- 
sion score is due to the treatments themselves and not to some other variable. One obvious 
concernis the order of the treatments. Suppose that caffeine were given to all the subjects during 
the first 2-day period. What if the weather were nicer on these 2 days than during the second 
2-day period when all subjects were given a placebo? Researchers wouldn't be able to tell if a large 
increase in the mean depression score is due to the difference in weather or due to the treat- 
ments. Random assignment of the caffeine and placebo to the two time periods in the experiment 
should help ensure that no other variable (like the weather) is systematically affecting subjects’ 
responses. 


(b) We'll follow the four-step process. 
STATE: If caffeine deprivation has no effect on depression, then we would expect the actual mean 
difference in depression scores to be O. We therefore want to test the hypotheses 
Ho + qe O 
H,: Ltd >0 
where /1is the true mean difference (placebo — caffeine) in depression score for subjects like these. 
Because no significance level is given, we'll use « = 0.05. 
PLAN: Ifthe conditions are met, we should conduct a paired t test for /14 
* Random: Researchers randomly assigned the treatments—placebo then caffeine, caffeine then 
placebo—to the subjects. 
° 10%: We aren't sampling, so it isn’t necessary to check the 10% condition. 
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Change in depression 
(a) (placebo - caffeine) (6) 


TESTING A CLAIM 


* Normal/Large Sample: We don't know whether the actual distribution of difference in depression 
scores (placebo — caffeine) for subjects like these is Normal. With sucha small sample size (n = 11), 
we need to graph the data to see if it’s safe to use t procedures. Figure 9.18 shows hand sketches of 
acalculator histogram, boxplot, and Normal probability plot for these data. The histogram has an 
irregular shape with so few values; the boxplot shows some right skewness but no outliers; and the 
Normal probability plot is slightly curved, indicating mild skewness. With no outliers or strong 
skewness, the t procedures should be fairly accurate. 


Gr th ES ey) ay eee at Zee Tome 20) 


Change in depression 
(placebo - caffeine) (¢) 


Change in depression 
(placebo - caffeine) 


Just by looking at the data, it appears 
that the true mean change in 


depression score jug is greater than 0. 


However, it’s possible that there has 
been no change and we got a result 
this much larger than jug = 0 by the 
luck of the random assignment. The 
significance test tells us whether this 
explanation is plausible. 


FIGURE 9.18 Sketches of (a) a histogram, (b) a boxplot, and (c) a Normal probability plot of the 
change in depression scores (placebo — caffeine) for the 11 subjects in the caffeine experiment. 
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DO: We entered the differences inlist1 and then 
used the calculator’s t test command with the 
“Draw” option. 

° Teststatistic t= 3.53 

¢ P-value 0.0027, which is the area to the right of 


t = 3.53 on the t distribution curve with / 
df = 11-1=10. 
Note: The calculator doesn’t report the degrees of T-Test 
t=3.5304 P=.0027 


freedom, but you should. 


CONCLUDE: Because the P-value of 0.0027 is less than « = 0.05, we reject Ho: [14 = 0. We 
have convincing evidence that the true mean difference (placebo — caffeine) in depression score is 
positive for subjects like these. 


For Practice Try Exercise 


A few follow-up comments about this example are in order. 
1. We could have calculated the test statistic in the example using the formula 
Xd 7364-0 
sa/Vn — 6.918/VI11 


t= Eee. 


and obtained the P-value using Table B or technology. Check with your teacher 
on whether the calculator-only method is acceptable. Be sure to report 
the degrees of freedom with any t procedure, even if technology doesn't. 


2. The subjects in this experiment were not chosen at random from the 
population of caffeine-dependent individuals. As a result, we can’t generalize 
our findings to all caffeine-dependent people—only to people like the ones 
who took part in this experiment. 
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3. Because researchers randomly assigned the treatments, they can make an 
inference about cause and effect. The data from this experiment provide 
convincing evidence that depriving caffeine-dependent subjects like these of 
caffeine causes an average increase in depression scores. 


Until now, we have only used one-sample t procedures in settings involving 
random sampling. The paired data in the caffeine example came from a matched 
pairs experiment, in which each subject received both treatments in a random 
order. A coin toss or some other chance process was used to carry out the random 
assignment. Why is it legitimate to use a ¢ distribution to perform inference about 
the parameter py in a randomized experiment? The answer to that question will 
have to wait until the next chapter. 


_— YOUR UNDERSTANDING 

Refer to the Data Exploration from Chapter 4 
on page 257. Do the data give convincing evi- 
dence at the a = 0.05 significance level that 
filling tires with nitrogen instead of air decreas- 
es pressure loss? 


Using Tests Wisely 


Significance tests are widely used in reporting the results of research in many 
fields. New drugs require significant evidence of effectiveness and safety. Courts 
ask about statistical significance in hearing discrimination cases. Marketers want 
to know whether a new ad campaign significantly outperforms the old one, and 
medical researchers want to know whether a new therapy performs significantly 
better. In all these uses, statistical significance is valued because it points to an ef- 
fect that is unlikely to occur simply by chance. 

Carrying out a significance test is often quite simple, especially if you use tech- 
nology. Using tests wisely is not so simple. Here are some points to keep in mind 
when using or interpreting significance tests. 


Determining Sample Size How large a sample should researchers take 
when they plan to carry out a significance test? The answer depends on three 
factors: significance level, effect size, and the desired power of the test. Here 
are the questions that researchers must answer to decide how many observations 
they need: 


1. Significance level. How much risk of a Type I error—rejecting the null hy- 
pothesis when Hp is actually true—are we willing to accept? If a Type I error 
has serious consequences, we might opt for a = 0.01. Otherwise, we should 
choose a = 0.05 or a = 0.10. Recall that using a higher significance level 
would decrease the Type II error probability and increase the power. 
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2. Effect size. How large a difference between the null parameter value and the 
actual parameter value is important for us to detect? 


3. Power. What chance do we want our study to have to detect a difference of the 
size we think is important? 


Let’s illustrate typical answers to these questions using an example. 


Developing Stronger Bones 
Planning a study 


Can a 6-month exercise program increase the total body bone mineral content 
(TBBMC) of young women? A team of researchers is planning a study to examine 
this question. The researchers would like to perform a test of 


Ao: = 0 
Jel 10) 


where ju is the true mean percent change in TBBMC due to the exercise program. 
To decide how many subjects they should include in their study, researchers begin 
by answering the three questions above. 


1. Significance level. The researchers decide that a = 0.05 gives enough pro- 
tection against declaring that the exercise program increases bone mineral 
content when it really doesn’t (a Type I error). 

. Effect size. Amean increase in TBBMC of 1% would be considered important 
to detect. 

. Power. The researchers want probability at least 0.9 that a test at the chosen signifi- 
cance level will reject the null hypothesis Ho: u = 0 when the truth is ps = 1. 


The following Activity gives you a chance to investigate the sample size needed 
to achieve a power of 0.9 in the bone mineral content study. 


ACTIVITY | Investigating Power 


MATERIALS: In this Activity, you will use the Statistical Power applet at the book’s Web site to 
Computer with ey determine the sample size needed for the exercise study of the previous exam- 
Internet connec- ra) ple. Based on the results of a previous study, researchers are willing to assume 


tion and display 
capability 


that o = 2 for the percent change in TBBMC over the 6-month period. We'll 
start by seeing whether or not 25 subjects are enough. 
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/ 
/ 
/ 
/ 
/ \ 
—_— 
o.00n0 0.427 1.765 


“1.897 -1.265 -0.637 


Power = 0.804 


1. Go to www.whfreeman.com/tps5e and launch the Statistical Power applet. 
Enter the values: Ho: = 0, Hy: > 0, 0 = 2,n = 25, a = 0.05, and alternate 
pu = 1. Then click “Update.” What is the power? As a class, discuss what this 
number means in simple terms. 


2. Change the significance level to 0.01. What effect does this have on the 
power of the test to detect 4. = 1? Why? 


3. The researchers decide that they are willing to risk a 5% chance of making a 
Type I error. Change the significance level back to a = 0.05. Now increase the 
sample size to 30. What happens to the power? Why? 


4. Keep increasing the sample size until the power is at least 0.90. What mini- 
mum sample size should the researchers use for their study? 

5. Would the researchers need a smaller or a larger sample size to detect a mean 
increase of 1.5% in TBBMC? A 0.85% increase? Use the applet to investigate. 


6. Summarize what you have learned about how significance level, effect size, 
and power influence the sample size needed for a significance test. 


Here is a summary of influences on “How large a sample do I need?” from the 
Activity. 
e If you insist on a smaller significance level (such as 1% rather than 5%), you 
have to take a larger sample. A smaller significance level requires stronger 
evidence to reject the null hypothesis. 


e Ifyou insist on higher power (such as 0.99 rather than 0.90), you will need a 
larger sample. Higher power gives a better chance of detecting a difference 
when it really exists. 


e At any significance level and desired power, detecting a small difference be- 
tween the null and alternative parameter values requires a larger sample than 
detecting a large difference. 
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Statistical Significance and Practical Importance When a null 

hypothesis (“no effect” or “no difference”) can be rejected at the usual levels 

(a = 0.05 or a = 0.01), there is convincing evidence of a difference. But that dif- 

ference may be very small. When large samples are available, even tiny deviations 
from the null hypothesis will be significant. 


Wound Healing Time 
Significant doesn’t mean important 


Suppose we're testing a new antibacterial cream, “Formulation NS,” on a small cut 
made on the inner forearm. We know from previous research that with no medica- 
tion, the mean healing time (defined as the time for the scab to fall off) is 7.6 days. 
The claim we want to test here is that Formulation NS speeds healing. We will use 
a 5% significance level. 


Procedure: We cut a random sample of 250 college students and apply Formulation 
NS to the wounds. The mean healing time for these subjects is x = 7.5 days and 
the standard deviation is s, = 0.9 days. 


Discussion: We want to test a claim about the mean healing time yu in the population 
of college students whose cuts are treated with Formulation NS. Our hypotheses are 


Ho: u = 7.6 days 
fy = /-odays 


An examination of the data reveals no outliers or strong skewness, so the condi- 
tions for performing a one-sample ¢ test are met. We carry out the test and find that 
t = —1.76 and P-value = 0.04 with df = 249. Because 0.04 is less than a = 0.05, 
we reject Ho. We have convincing evidence that Formulation NS reduces the av- 
erage healing time. However, this result is not practically important. Having your 
scab fall off one-tenth a day sooner is no big deal. 


Remember the wise saying: Statistical significance is not the same thing 
as practical importance. The remedy for attaching too much importance 
to statistical significance is to pay attention to the actual data as well as to 
the P-value. Plot your data and examine them carefully. Are there outliers or other 
departures from a consistent pattern? A few outlying observations can produce 
highly significant results if you blindly apply common significance tests. Outliers 
can also destroy the significance of otherwise-convincing data. 

The foolish user of statistics who feeds the data to a calculator or computer _¢ 
without exploratory analysis will often be embarrassed. Is the difference you @ 
are seeking visible in your plots? If not, ask yourself whether the difference is 
large enough to be practically important. Give a confidence interval for the param- 
eter in which you are interested. A confidence interval gives a set of plausible values 
for the parameter rather than simply asking if the observed result is too surprising to 
occur by chance alone when Hp is true. Confidence intervals are not used as often 
as they should be, whereas significance tests are perhaps overused. 


For more on the pitfalls of multiple 
analyses, do an Internet search for 
the XKCD comic about jelly beans 
causing acne. 
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Beware of Multiple Analyses ‘Statistical significance ought to mean that 
you have found a difference that you were looking for. The reasoning behind 
statistical significance works well if you decide what difference you are seeking, 
design a study to search for it, and use a significance test to weigh the evidence 
you get. In other settings, significance may have little meaning. 


Cell Phones and Brain Cancer 


Don’t search for significance! 


Might the radiation from cell phones be harmful to users? Many studies have 
found little or no connection between using cell phones and various illnesses. 
Here is part of a news account of one study: 


A hospital study that compared brain cancer patients and a similar group with- 
out brain cancer found no statistically significant difference between brain can- 
cer rates for the two groups. But when 20 distinct types of brain cancer were 


considered separately, a significant difference in brain cancer rates was found 
for one rare type. Puzzlingly, however, this risk appeared to decrease rather than 
increase with greater mobile phone use.” 


Think for a moment. Suppose that the 20 null hypotheses for these 20 significance 
tests are all true. Then each test has a 5% chance of being significant at the 5% level. 
That’s what a@ = 0.05 means: results this extreme occur only 5% of the time just 
by chance when the null hypothesis is true. We expect about | of 20 tests to give a 
significant result just by chance. Running one test and reaching the a = 0.05 level 
is reasonably good evidence that you have found something; running 20 tests and 
reaching that level only once is not. 


Searching data for patterns is certainly legitimate. Exploratory data analysis 
is an important part of statistics. But the reasoning of formal inference does not 
apply when your search for a striking effect in the data is successful. ‘The remedy 
is clear. Once you have a hypothesis, design a study to search specifically for the 
effect you now think is there. If the result of this study is statistically significant, 
you have real evidence. 


At the beginning of the chapter, we described a study investigat- 
ing whether “normal” human body temperature is really 98.6°F- 
Here is a summary of the details we provided in the chapter- 
opening Case Study (page 537). 
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Temperature (°F) 


e The mean temperature was x = 98.25°F. 
e ‘The standard deviation of the temperature readings was s, = 0.73°F. 
© 62.3% of the temperature readings were less than 98.6°F. 


1. If “normal” body temperature really is 98.6°F, we would expect 
that about half of all healthy 18- to 40-year-olds will have a body 
temperature less than 98.6°F. Do the data from this study provide 
convincing evidence at the a = 0.05 level that this is not the case? 

2. The test in Question 1 has power 0.66 to detect that the actual 
population proportion is 0.60. Describe two changes that could 
be made to increase the power of the test. 


Do the data provide convincing evidence that average normal body tem- 
perature is not 98.6°F? The computer output below shows the results of a one- 
sample t test and a 95% confidence interval for the population mean p. 


One-Sample T 
Test of mu = 98.6 vs not = 98.6 
N Mean StDev SE Mean 95% CI ly Pp 


130 98.2500 0.7300 0.0640 (98.1233, 98.3767) —5.47 0.000 


3. What conditions must be satisfied for a one-sample t test to give 
valid results? Show that these conditions are met in this setting. 

4. Explain how the P-value and the confidence interval lead to the 
same conclusion for the significance test. 

5. Based on the conclusion in Question 4, which type of error could 
have been made: a Type I error or a Type II error? Justify your 


answer. 


Summary 


e The conditions for performing a significance test of Ho: ju = [uo are: 
e Random: The data were produced by a well-designed random sample 
or randomized experiment. 
© 10%: When sampling without replacement, check that the popula- 
tion is at least 10 times as large as the sample. 
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¢ Normal/Large Sample: The population distribution is Normal or the 
sample size is large (n = 30). When the sample size is small (n < 30), 
examine a graph of the sample data for any possible departures from 
Normality in the population. You should be safe using a ¢ distribution as 
long as there is no strong skewness and no outliers are present. 


e ‘The one-sample ¢ test for a mean uses the test statistic 


s,/Vn 


with P-values calculated from the ¢ distribution with n — | degrees of freedom. 


t 


e¢ Confidence intervals provide additional information that significance tests 
do not—namely, a set of plausible values for the parameter pu. A two-sided 
test of Ho: 4 = fo at significance level a gives the same conclusion as a 
100(1 — a)% confidence interval for pu. 

e Analyze paired data by first taking the difference within each pair to produce 
a single sample. Then use one-sample t procedures. 

e There are three factors that influence the sample size required for a statistical 
test: significance level, effect size, and the desired power of the test. 

e Very small differences can be highly significant (small P-value) when a test 
is based on a large sample. A statistically significant difference need not be 
practically important. 

e Many tests run at once will probably produce some significant results by 
chance alone, even if all the null hypotheses are true. 


9.3) TECHNOLOGY 
CORNERS 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


19. Computing P-values from t distributions on the calculator page 579 


20. One-sample t test for a mean on the calculator page 582 


Exercises 


65. Attitudes The Survey of Study Habits and At- 30 years of age. Check the conditions for carrying 
M575] titudes (SSHA) is a psychological test that measures out a significance test of the teacher’s suspicion. 
& students’ attitudes toward school and study habits. 
Scores range from 0 to 200. Higher scores indicate 66. Anemia Hemoglobin is a protein in red blood cells 
more positive attitudes. The mean score for U.S. that carries oxygen from the lungs to body tissues. 
college students is about 115. A teacher suspects People with fewer than 12 grams of hemoglobin per 
that older students have better attitudes toward deciliter of blood (g/dl) are anemic. A public health 
school. She gives the SSHA to an SRS of 45 of the official in Jordan suspects that Jordanian children are 


over 1000 students at her college who are at least at risk of anemia. He measures a random sample of 


596 


67. 


63.4 65.0 64.4 63.3 


68. 
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50 children. Check the conditions for carrying out a 
significance test of the official’s suspicion. 


Ancient air ‘The composition of the earth’s 
atmosphere may have changed over time. To try 
to discover the nature of the atmosphere long ago, 
we can examine the gas in bubbles inside ancient 
amber. Amber is tree resin that has hardened and 
been trapped in rocks. The gas in bubbles within 
amber should be a sample of the atmosphere at the 
time the amber was formed. Measurements on 9 
specimens of amber from the late Cretaceous era 
(75 to 95 million years ago) give these percents of 
nitrogen:”” 


Sas (Ann C03 49. SIL 


Explain why we should not carry out a one-sample t 
test in this setting. 


Paying high prices? A retailer entered into an 
exclusive agreement with a supplier who guaranteed 
to provide all products at competitive prices. ‘The 
retailer eventually began to purchase supplies from 
other vendors who offered better prices. ‘The original 
supplier filed a lawsuit claiming violation of the 
agreement. In defense, the retailer had an audit 
performed on a random sample of 25 invoices. For 
each audited invoice, all purchases made from other 
suppliers were examined and compared with those 
offered by the original supplier. The percent of 
purchases on each invoice for which an alternative 
supplier offered a lower price than the original sup- 
plier was recorded.! For example, a data value of 38 
means that the price would be lower with a different 
supplier for 38% of the items on the invoice. A his- 
togram and some computer output for these data are 
shown below. Explain why we should not carry out a 
one-sample t test in this setting. 


Frequency 


0 20 40 60 80 100 120 


Percent lower 


Column 


pctlower 25 77.76 32.6768 6.5353603 100 0) 


Summary statistics 
n Mean Std. Dev. Std. Err. Median Min Max Q1 Q3 


100 68 100 
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Attitudes In the study of older students’ attitudes 
from Exercise 65, the sample mean SSHA score was 
125.7 and the sample standard deviation was 29.8. 


Calculate the test statistic. 


Find the P-value using ‘Table B. Then obtain a more 
precise P-value from your calculator. 


Anemia For the study of Jordanian children in 
Exercise 66, the sample mean hemoglobin level was 
11.3 mg/dl and the sample standard deviation was 
1.6 mg/dl. 


Calculate the test statistic. 


Find the P-value using ‘Table B. Then obtain a more 
precise P-value from your calculator. 


One-sided test Suppose you carry out a significance 
test of Ho: uw = 5 versus H,: 4 < 5 based on a sample 
of size n = 20 and obtaint = —1.81. 


Find the P-value for this test using ‘Table B or tech- 
nology. What conclusion would you draw at the 5% 
significance level? At the 1% significance level? 


Redo part (a) using an alternative hypothesis of 

El 3/0 a Ds. 

Two-sided test The one-sample t statistic from a 
sample of n = 25 observations for the two-sided test of 


Ho: = 64 
Hy: # 64 


has the value t = —1.12. 


Find the P-value for this test using ‘Table B or tech- 
nology. What conclusion would you draw at the 5% 
significance level? At the 1% significance level? 


Redo part (a) using an alternative hypothesis of 
Jala /l Storr. 


Construction zones Every road has one at some 
point—construction zones that have much lower speed 
limits. ‘To see if drivers obey these lower speed limits, a 
police officer uses a radar gun to measure the speed (in 
miles per hours, or mph) of a random sample of 10 driv- 
ers ina 25 mph construction zone. Here are the data: 


Be 33 Bye Pil 30) 30) AS) BE 


Is there convincing evidence that the average speed 
of drivers in this construction zone is greater than the 
posted speed limit? 


Given your conclusion in part (a), which kind of 
mistake—a ‘Type I error or a ‘Type II error—could 
you have made? Explain what this mistake would 
mean in context. 


Heat through the glass How well materials conduct 
heat matters when designing houses, for example. 
Conductivity is measured in terms of watts of heat 


power transmitted per square meter of surface per 
degree Celsius of temperature difference on the two 
sides of the material. In these units, glass has conduc- 
tivity about 1. The National Institute of Standards 
and ‘Technology provides exact data on properties 

of materials. Here are measurements of the heat 
conductivity of 11 randomly selected pieces of a 
particular type of glass:”” 


WollM Oy Wel MAO We AOS WAOkes WCIKS) TUS} Its) Mh 


Is there convincing evidence that the mean conduc- 
tivity of this type of glass is greater than 1? 


(b) Given your conclusion in part (a), which kind of 
mistake—a ‘Type I error or a ‘Type II error—could 
you have made? Explain what this mistake would 
mean in context. 

75. Healthy bones The recommended daily allowance 


(RDA) of calcium for women between the ages of 18 
and 24 years is 1200 milligrams (mg). Researchers who 
were involved in a large-scale study of women’s bone 
health suspected that their participants had significantly 
lower calcium intakes than the RDA. To test this suspi- 
cion, the researchers measured the daily calcium intake 
of a random sample of 36 women from the study who 
fell in the desired age range. ‘The Minitab output below 
displays the results of a significance test. 


One-Sample T: Calcium intake (mg) 
Test of mu = 1200 vs < 1200 


StDev SE Mean ae 12) 


Variable N 


Mean 


Calcium 5 85652 SOG 5 7 Slee Sos: Wc{olono) 


(a) 


Do these data give convincing evidence to support 
the researchers’ suspicion? Justify your answer. 


(b) Interpret the P-value in context. 


76. ‘Taking stock An investor with a stock portfolio 
worth several hundred thousand dollars sued his bro- 
ker due to the low returns he got from the portfolio 
at a time when the stock market did well overall. The 
investor’s lawyer wants to compare the broker’s per- 
formance against the market as a whole. He collects 
data on the broker’s returns for a random sample of 
36 weeks. Over the 10-year period that the broker has 
managed portfolios, stocks in the Standard & Poor's 
500 index gained an average of 0.95% per week. The 
Minitab output below displays the results of a signifi- 
cance test. 


One-Sample T: Return (percent) 


Test of mu = 0.95 vs < 0.95 


Variable N Mean StDev SE Mean uy P 


Return 36 —1.441 4.810 0.802 =A. 98) 0) OOS) 


(percent) 
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(a) 


Do these data give convincing evidence to support 
the lawyer’s case? Justify your answer. 


Interpret the P-value in context. 


Pressing pills A drug manufacturer forms tablets by 
compressing a granular material that contains the 
active ingredient and various fillers. The hardness of 
a sample from each batch of tablets produced is mea- 
sured to control the compression process. The target 
value for the hardness is = 11.5. The hardness data 
for a random sample of 20 tablets are 


11.627 11.613 EOS 11.602 11.360 
11.374 Sz 11.458 I 2 11.463 
11.383 11.715 11.485 11.509 E29 
Were 11.570 11.623 il ar72 Il sesitl 


Hiss 


79. 


80. 


81. 


Is there convincing evidence at the 5% level that the 
mean hardness of the tablets differs from the target 
value? 


Filling cola bottles Bottles of a popular cola are 
supposed to contain 300 milliliters (ml) of cola. 
There is some variation from bottle to bottle be- 
cause the filling machinery is not perfectly precise. 
An inspector measures the contents of six randomly 
selected bottles from a single day’s production. The 
results are 


Aer ANG OU) 2) sX02 2977/0 


Do these data provide convincing evidence that the 
mean amount of cola in all the bottles filled that day 
differs from the target value of 300 ml? 


Pressing pills Refer to Exercise 77. Construct and 
interpret a 95% confidence interval for the popula- 
tion mean jz. What additional information does the 
confidence interval provide? 


Filling cola bottles Refer to Exercise 78. Construct 
and interpret a 95% confidence interval for the popu- 
lation mean pu. What additional information does the 
confidence interval provide? 


Fast connection? How long does it take for a 
chunk of information to travel from one server to 
another and back on the Internet? According to the 
site internettrafficreport.com, a typical response 
time is 200 milliseconds (about one-fifth of a sec- 
ond). Researchers collected data on response times 
of a random sample of 14 servers in Europe. A 
graph of the data reveals no strong skewness or out- 
liers. The following figure displays Minitab output 
for a one-sample t interval for the population mean. 
Is there convincing evidence at the 5% significance 
level that the site’s claim is incorrect? Justify your 
answer. 
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One-Sample T: Response times 


Variable N Mean StDev SE Mean 
Response times 14 173.93 27.21 7.27 


95% CI 
(158.22, 189.64) 


< 


Water! A blogger claims that U.S. adults drink an 
average of five 8-ounce glasses of water per day. 
Skeptical researchers ask a random sample of 24 

US. adults about their daily water intake. A graph of 
the data shows a roughly symmetric shape with no 
outliers. The figure below displays Minitab output 
for a one-sample t interval for the population mean. 
Is there convincing evidence at the 10% significance 
level that the blogger’s claim is incorrect? Justify your 
answer. 


One-Sample T: Water intake (0z) 


Variable N 
Water intake (oz) 24 4.204 1.173 


< 


Mean StDev SE Mean 90% CI 


0.240 (3.794, 4.615) 


Tests and CIs The P-value for a two-sided test of the 
null hypothesis Ho: js = 10 is 0.06. 


Does the 95% confidence interval for yz include 10? 
Why or why not? 


Does the 90% confidence interval for yz include 10? 
Why or why not? 


Tests and CIs The P-value for a two-sided test of the 
null hypothesis Ho: ps = 15 is 0.03. 


Does the 99% confidence interval for yu include 15? 
Why or why not? 


Does the 95% confidence interval for yu include 15? 
Why or why not? 


Right versus left The design of controls and instru- 
ments affects how easily people can use them. A 
student project investigated this effect by asking 25 
right-handed students to turn a knob (with their right 
hands) that moved an indicator. ‘There were two 
identical instruments, one with a right-hand thread 
(the knob turns clockwise) and the other with a 
left-hand thread (the knob must be turned counter- 
clockwise). Each of the 25 students used both instru- 
ments in a random order. The following table gives 
the times in seconds each subject took to move the 
indicator a fixed distance.” Note that smaller times 
are better. 


(a) 


(b) 


86. 


Subject Right thread Left thread 
1 11s 137 
2 105 105 
3 130 133 
4 101 108 
5 138 115 
6 118 170 
v 87 103 
8 116 145 
9 15 78 

10 96 107 
11 122 84 
12 103 148 
13 116 147 
14 107 87 
15 118 166 
16 103 146 
17 114 128) 
18 104 135 
19 111 112 
20 89 93 
21 78 76 
22 100 116 
23 89 78 
24 85 101 
25 88 123 


Explain why it was important to randomly assign the 
order in which each subject used the two knobs. 


The project designers hoped to show that right- 
handed people find right-hand threads easier to use, 
on average. Carry out a test at the 5% significance 
level to investigate this claim. 


Floral scents and learning We hear that listening 
to Mozart improves students’ performance on tests. 
Maybe pleasant odors have a similar effect. To 

test this idea, 21 subjects worked two different but 
roughly equivalent paper-and-pencil mazes while 
wearing a mask. ‘The mask was either unscented or 
carried a floral scent. Each subject used both masks, 
in a random order. The table below gives the sub- 
jects’ times (in seconds) with both masks.** Note that 
smaller times are better. 


Subject Unscented Scented 
1 30.60 37.97 
2 48.43 Sy doy 
3 60.77 56.67 
4 36.07 40.47 


(a) 


87. 


88. 


Subject Unscented Scented 
5 68.47 49.00 
6 32.43 43.23 
7 43.70 44.57 
8 37.10 28.40 
9 Sl? 28.23 

10 SileZs) 68.47 
11 65.40 51.10 
12 58.93 83.50 
is 54.47 38.30 
14 43,53 SikS 
15 37.93 29.33 
16 43.50 54.27 
le 87.70 62.73 
18 BSt53 58.00 
19 64.30 52.40 
20 47.37 53.63 
21 53.67 47.00 


Explain why it was important to randomly assign the 
order in which each subject used the two masks. 


Do these data provide convincing evidence that the 
floral scent improved performance, on average? 


Growing tomatoes Researchers suspect that Variety A 
tomato plants have a higher average yield than Variety 
B tomato plants. To find out, researchers randomly 
select 10 Variety A and 10 Variety B tomato plants. 
Then the researchers divide in half each of 10 small 
plots of land in different locations. For each plot, a 
coin toss determines which half of the plot gets a Va- 
riety A plant; a Variety B plant goes in the other half. 
After harvest, they compare the yield in pounds for 

the plants at each location. The 10 differences in yield 
(Variety A — Variety B) are recorded. A graph of the 
differences looks roughly symmetric and single-peaked 
with no outliers. A paired t test on the differences 
yields t = 1.295 and P-value = 0.1138. 


State appropriate hypotheses for the paired t test. Be 
sure to define your parameter. 


What are the degrees of freedom for the paired t test? 


Interpret the P-value in context. What conclusion 
should the researchers draw? 


Describe a ‘Type I error and a Type II error in this 
setting. Which mistake could researchers have made 
based on your answer to part (c)? 


Music and memory Does listening to music while 
studying hinder students’ learning? Two AP® Sta- 
tistics students designed an experiment to find out. 
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They selected a random sample of 30 students from 
their medium-sized high school to participate. Each 
subject was given 10 minutes to memorize two dif 
ferent lists of 20 words, once while listening to music 
and once in silence. The order of the two word lists 
was determined at random; so was the order of the 
treatments. The difference in the number of words 
recalled (music — silence) was recorded for each 
subject. A paired t test on the differences yielded 

t = —3.01 and P-value = 0.0027. 


State appropriate hypotheses for the paired t test. Be 
sure to define your parameter. 


What are the degrees of freedom for the paired t test? 


Interpret the P-value in context. What conclusion 
should the students draw? 


Describe a ‘Type I error and a ‘Type II error in this set- 
ting. Which mistake could students have made based 
on your answer to part (c)? 


The power of tomatoes Refer to Exercise 87. 
Explain two ways that the researchers could have 
increased the power of the test to detect ys = 0.5. 


Music and memory Refer to Exercise 88. Which of the 
following changes would give the test a higher power to 
detect 4p = —1: using a = 0.01 or a = 0.10? Explain. 


Significance and sample size A study with 5000 
subjects reported a result that was statistically signifi- 
cant at the 5% level. Explain why this result might 
not be particularly large or important. 


Sampling shoppers A marketing consultant 
observes 50 consecutive shoppers at a supermarket, 
recording how much each shopper spends in the 
store. Explain why it would not be wise to use these 
data to carry out a significance test about the mean 
amount spent by all shoppers at this supermarket. 


Do you have ESP? A researcher looking for evidence 
of extrasensory perception (ESP) tests 500 subjects. 
Four of these subjects do significantly better (P < 0.01) 
than random guessing. 


Is it proper to conclude that these four people have 
ESP? Explain your answer. 


What should the researcher now do to test whether 
any of these four subjects have ESP? 


Ages of presidents Joe is writing a report on the 
backgrounds of American presidents. He looks up 
the ages of all the presidents when they entered 
office. Because Joe took a statistics course, he uses 
these numbers to perform a significance test about 
the mean age of all U.S. presidents. Explain why this 


makes no sense. 
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Multiple choice: Select the best answer for Exercises 95 
to 102. 


95. ‘The reason we use t procedures instead of z proce- 
dures when carrying out a test about a population 
mean is that 


(a) z requires that the sample size be large. 


(b) z requires that you know the population standard 
deviation o. 


(c) zrequires that the data come from a random sample 
or randomized experiment. 


(d) z requires that the population distribution be per- 
fectly Normal. 


(e) zcan only be used for proportions. 


96. You are testing Ho: 4 = 10 against H,: 4 < 10 based 
on an SRS of 20 observations from a Normal popula- 
tion. The t statistic is t = —2.25. The P-value 


(a) falls between 0.01 and 0.02. 
(b) falls between 0.02 and 0.04. 
(c) falls between 0.04 and 0.05. 
(d) falls between 0.05 and 0.25. 
(e) is greater than 0.25. 


97. You are testing Ho: 4 = 10 against H,: 4 # 10 based 
on an SRS of 15 observations from a Normal popula- 
tion. What values of the t statistic are statistically 
significant at the a = 0.005 level? 


(a) t> 3.326 (d) t< —3.326 ort > 3.326 
(b) > 3.286 (e) t< —3.286 ort > 3.286 
(c) t> 2.977 


98. After checking that conditions are met, you perform 
a significance test of Ho: 4 = | versus Hy: u # 1. You 
obtain a P-value of 0.022. Which of the following 
must be true? 


(a) A95% confidence interval for sz will include the 
value 1. 


(b) A95% confidence interval for jz will include the 
value 0. 


(c) A99% confidence interval for sz will include the 
value |. 


(d) A99% confidence interval for sz will include the 
value 0. 


(e) None of these is necessarily true. 


99. Does Friday the 13th have an effect on people’s 
behavior? Researchers collected data on the number 
of shoppers at a sample of 45 nearby grocery stores 


on Friday the 6th and Friday the 13th in the same 
month. The dotplot and computer output below 
summarize the data on the difference in the number 
of shoppers at each store on these two days (subtract- 
ing in the order 6th minus 13th).” 
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Researchers would like to carry out a test of 

Ho: fg = 0 versus Hy: wg # 0, where ig is the true 
mean difference in the number of grocery shoppers 
on these two days. Which of the following conditions 
for performing a paired ¢ test are clearly satisfied? 


I. Random _ II. 10% 
(c) IH only 
(d) [and II only 


III. Normal/Large Sample 

(a) I only (e) I, Il, and IL 

(b) II only 

100. The most important condition for sound conclu- 
sions from statistical inference is that 

(a) the data come from a well-designed random sample 
or randomized experiment. 

(b) the population distribution be exactly Normal. 


(c) the data contain no outliers. 


(d) the sample size be no more than 10% of the popula- 
tion size. 


(e) the sample size be at least 30. 


101. Vigorous exercise helps people live several years 
longer (on average). Whether mild activities like 
slow walking extend life is not clear. Suppose that the 
added life expectancy from regular slow walking is 
just 2 months. A statistical test is more likely to finda 
significant increase in mean life expectancy if 

(a) itis based ona very large random sample and a 5% 
significance level is used. 


(b) itis based on a very large random sample and a 1% 
significance level is used. 


(c) itis based on a very small random sample and a 5% 
significance level is used. 


(d) itis based ona very small random sample and a 1% 
significance level is used. 


(e) the size of the sample doesn’t have any effect on the 
significance of the test. 


102. A researcher plans to conduct a significance test 
at the a = 0.01 significance level. She designs her 
study to have a power of 0.90 at a particular alterna- 
tive value of the parameter of interest. The probabil- 
ity that the researcher will commit a Type II error for 
the particular alternative value of the parameter at 
which she computed the power is 


(a) 0.01. (b) 0.10. (ce) 0.89. (d) 0.90. (e) 0.99. 


103. Is your food safe? (8.1) “Do you feel confident or 

p>, not confident that the food available at most grocery 

“© stores is safe to eat?” When a Gallup Poll asked this 
question, 87% of the sample said they were confi- 
dent.” Gallup announced the poll’s margin of error 
for 95% confidence as +3 percentage points. Which 
of the following sources of error are included in this 
margin of error? Explain. 

(a) Gallup dialed landline telephone numbers at random 
and so missed all people without landline phones, 
including people whose only phone is a cell phone. 
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(b) Some people whose numbers were chosen never 
answered the phone in several calls or answered but 
refused to participate in the poll. 


(c) ‘There is chance variation in the random selection of 
telephone numbers. 


104. Spinning for apples (6.3 or 7.3) In the “Ask Mari- 

> lyn” column of Parade magazine, a reader posed this 

© question: “Say that a slot machine has five wheels, 
and each wheel has five symbols: an apple, a grape, 
a peach, a pear, and a plum. I pull the lever five 
times. What are the chances that I'll get at least one 
apple?” Suppose that the wheels spin independently 
and that the five symbols are equally likely to appear 
on each wheel in a given spin. 


(a) Find the probability that the slot player gets at least one 
apple in one pull of the lever. Show your method clearly. 


(b) Now answer the reader’s question. Show your 
method clearly. 


Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


Anne reads that the average price of regular gas in her 
state is $4.06 per gallon. To see if the average price of gas 
is different in her city, she selects 10 gas stations at random 
and records the price per gallon for regular gas at each sta- 
tion. The data, along with the sample mean and standard 
deviation, are listed in the table below. 


Station Price 
1 4.13 
2 4.01 
3 4.09 
4 4.05 


Station Price 
5 3.97 
6 3.99 
7 4.05 
8 3.98 
9 4.09 
4.02 
4.038 
0.0533 


Do the data provide convincing evidence that the average 
price of gas in Anne’s city is different from $4.06 per gallon? 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you think 
each solution is “complete,” “substantial,” “developing,” or “minimal.” 
If the solution is not complete, what improvements would you suggest 
to the student who wrote it? Finally, your teacher will provide you with 
a scoring rubric. Score your response and note what, if anything, you 
would do differently to improve your own score. 
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Section 9.1: Significance Tests: The Basics 


In this section, you learned the basic ideas of significance 
testing. Start by stating the hypotheses that you want to test. 
The null hypothesis (Ho) is typically a statement of “no dif- 
ference” and the alternative hypothesis (H,) describes what 
we suspect is true. Remember that hypotheses are always 
about parameters, not statistics. 

When sample data provide support for the alternative 
hypothesis, there are two possible explanations: (1) the null 
hypothesis is true, and data supporting the alternative hy- 
pothesis occurred just by chance, or (2) the alternative 
hypothesis is true, and the data are consistent with an alter- 
native value of the parameter. In a significance test, always 
start with the belief that the null hypothesis is true. If you 
can tule out chance as a plausible explanation for the ob- 
served data, there is convincing evidence that the alterna- 
tive hypothesis is true. 

The P-value in a significance test measures how likely 
it is to get results at least as extreme as the observed re- 
sults by chance alone, assuming the null hypothesis is true. 
To determine if the P-value is small enough to reject Ho, 
compare it to a predetermined significance level such as 
a = 0.05. If P-value < a, reject Hy—there is convincing 
evidence that the alternative hypothesis is true. However, 
if P-value = a, fail to reject Hyp—there is not convincing 
evidence that the alternative hypothesis is true. 

Because conclusions are based on sample data, there is 
a possibility that the conclusion will be incorrect. You can 
make two types of errors in a significance test: a Type | error 
occurs if you find convincing evidence for the alternative 
hypothesis when, in reality, the null hypothesis is true. A 
Type II error occurs when you don’t find convincing evi- 
dence that the alternative hypothesis is true when, in re- 
ality, the alternative hypothesis is true. The probability of 
making a ‘Type I error is equal to the significance level (a) 
of the test. 


Section 9.2: Tests about a Population Proportion 


In this section, you learned the details of conducting a sig- 
nificance test for a population proportion p. Whenever you 
are asked if there is convincing evidence for a claim about 
a population proportion, you are expected to respond using 
the familiar four-step process. 


STATE: Give the hypotheses you are testing in terms of p, 
state the significance level, and define the parameter p. 


PLAN: Name the procedure you plan to use (one-sample z 
test for a population proportion) and check the appropriate 
conditions (Random, 10%, Large Counts) to see if the pro- 
cedure is appropriate. 


e Random: The data come from a well-designed random 
sample or randomized experiment. 


© 10%: The sample size should be no larger than 10% of 
the population when sampling without replacement. 


e Large Counts: Both npp and n(1—po) must be at least 10, 
where fo is the value of p in the null hypothesis. 


DO: Calculate the test statistic and P-value. The test statistic 
z measures how far away the sample statistic is from the hy- 
pothesized parameter value in standardized units: 


p — Po 
po(l — po) 


n 


To calculate the P-value, use ‘Table A or technology. 


CONCLUDE: Use the P-value to make an appropriate conclu- 
sion about the hypotheses in context. 

Perform a two-sided test when looking for convincing ev- 
idence that the true value of the parameter is different from 
the hypothesized value, in either direction. The P-value for 
a two-sided test is calculated by finding the probability of 
getting a sample statistic at least as extreme as the observed 
statistic, in either direction, assuming the null hypothesis 
is true. 

You can also use a confidence interval to make a conclu- 
sion for a two-sided test. If the null parameter value is one 
of the plausible values in the interval, there isn’t convinc- 
ing evidence that the alternative hypothesis is true. How- 
ever, if the null parameter value is not one of the plausible 
values in the interval, there is convincing evidence that the 
alternative hypothesis is true. Besides helping you draw a 
conclusion, the interval tells you which alternative param- 
eter values are plausible. 

The probability that you avoid making a ‘Type II error 
when an alternative value of the parameter is true is called 
the power of the test. Power is good—if the alternative hy- 
pothesis is true, we want to maximize the probability of 
finding convincing evidence that it is true. We can increase 
the power of a significance test by increasing the sample 
size or by increasing the significance level. The power of 
a test will also be greater when the alternative value of the 
parameter is farther away from the null hypothesis value. 


Section 9.3: Tests about a Population Mean 


In this section, you learned the details of conducting a sig- 
nificance test for a population mean. Although some of the 
details are different, the reasoning and structure of the tests 
in this section are the same as in Section 9.2. In fact, the 
“State” and “Conclude” steps are exactly the same, other 
than the switch from proportions to means. 


PLAN: Name the procedure you are using (one-sample 
t test for a population mean), and check the conditions 
(Random, 10%, and Normal/Large Sample). The Random 
and 10% conditions are the same as in Section 9.2. The 
Normal/Large Sample condition states that the population 
distribution must be Normal or the sample size must be 
large (n = 30). If the sample is small and the population 
shape is unknown, graph the sample data to make sure 
there is no strong skewness or outliers. 


DO: Calculate the test statistic and P-value. ‘The test statistic 
t measures how far away the sample statistic is from the hy- 
pothesized parameter value in standardized units: 


jee 
s,/Wn 


What Did You Learn? 


Learning Objective 


Section 


To calculate the P-value, determine the degrees of freedom 
(df = n- 1) and use ‘Table B or technology. 

Use a paired t test to analyze the results of comparative 
experiments and observational studies that produce paired 
data. Start by calculating the difference for each pair and 
use the set of differences to check the Normal/Large Sam- 
ple condition and to calculate the test statistic and P-value. 

Remember to use significance tests wisely. When plan- 
ning a study, use a large enough sample size so the test 
will have adequate power. Also, remember that statistically 
significant results aren’t always “practically” important. Fi- 
nally, be aware that the probability of making at least one 
‘Type I error goes up dramatically when conducting mul- 
tiple tests. 


Related Example 
on Page(s) 


Relevant Chapter 
Review Exercise(s) 


State the null and alternative hypotheses for a significance test 
about a population parameter. 


540 R9.1 


Interpret a P-value in context. 


943, 544 R9.5 


Determine if the results of a study are statistically significant and 
draw an appropriate conclusion using a significance level. 


546 R9.5 


Interpret a Type | and a Type Il error in context, and give a 
consequence of each. 


548 R9.3, R9.4 


State and check the Random, 10%, and Large Counts conditions 
for performing a significance test about a population proportion. 


555 R9.4 


Perform a significance test about a population proportion. 


599, 562 R9.4 


Interpret the power of a test and describe what factors affect the 
power of a test. 


565, Discussion on 568 R9.3 


Describe the relationship among the probability of a Type | error 
(significance level), the probability of a Type Il error, and the power 
of a test. 


State and check the Random, 10%, and Normal/Large Sample 
conditions for performing a significance test about a population 
mean. 


575 R9.2, R9.6, R9.7 


Perform a significance test about a population mean. 


580, 583 R9.6 


Use a confidence interval to draw a conclusion for a two-sided test 
about a population parameter. 


563, 585 R9.5, R9.6 


Perform a significance test about a mean difference using 
paired data. 


586 R9.7 
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Chapter 9 Chapter Review Exercises 


These exercises are designed to help you review the impor- 
tant ideas and methods of the chapter. 


R9.1 Stating hypotheses State the appropriate null and 
alternative hypotheses in each of the following set- 
tings. Be sure to define the parameter. 


— 
~ 
~ 


The average height of 18-year-old American women 
is 64.2 inches. You wonder whether the mean 
height of this year’s female graduates from a large 
local high school (over 3000 students) differs from 
the national average. You measure an SRS of 48 
female graduates and find that x = 63.1 inches. 
Mr. Starnes believes that less than 75% of the stu- 
dents at his school completed their math homework 
last night. ‘The math teachers inspect the homework 
assignments from a random sample of students at 
the school to help Mr. Starnes test his claim. 


Ss 


R9.2 Fonts and reading ease Does the use of fancy type 
fonts slow down the reading of text on a computer 
screen? Adults can read four paragraphs of text in 
the common ‘Times New Roman font in an average 
time of 22 seconds. Researchers asked a random 
sample of 24 adults to read this text in the ornate 
font named Gigi. Here are their times, in seconds: 


23.2 21.2 28.9 27.7 29.1 27.3 16.1 22.6 25.6 34.2 23.9 26.8 
20.5 34.3 21.4 32.6 26.2 34.1 31.5 24.6 23.0 28.6 24.4 28.1 


State and check the conditions for performing a 
significance test using these data. 


R9.3 Strong chairs? A company that manufactures class- 
room chairs for high school students claims that the 
mean breaking strength of the chairs that they make 
is 300 pounds. One of the chairs collapsed beneath 
a 220-pound student last week. You wonder whether 
the manufacturer is exaggerating the breaking 
strength of the chairs. 


= 


State appropriate null and alternative hypotheses in 

this setting. Be sure to define your parameter. 

(b) Describe a Type I error and a Type II error in this 
setting, and give the consequences of each. 

(c) Would you recommend a significance level of 0.01, 
0.05, or 0.10 for this test? Justify your choice. 

(d) The power of the test to detect p = 294 using 
a = 0.05 is 0.71. Interpret this value in context. 

(e) Explain two ways that you could increase the power 

of the test from (d). 


R9.4 Flu vaccine A drug company has developed a new 
vaccine for preventing the flu. The company claims 


(a 


that fewer than 5% of adults who use its vaccine 
will get the flu. To test the claim, researchers give 
the vaccine to a random sample of 1000 adults. Of 
these, +3 get the flu. 


(a) Do these data provide convincing evidence to sup- 
port the company’s claim? 

(b) Which kind of mistake—a Type I error or a ‘Type II 
error—could you have made in (a)? Explain. 

(c) From the company’s point of view, would a Type I 
error or ‘lype II error be more serious? Why? 


R9.5 Roulette An American roulette wheel has 18 red 
slots among its 38 slots. ‘To test if a particular rou- 
lette wheel is fair, you spin the wheel 50 times and 
the ball lands in a red slot 31 times. The resulting 
P-value is 0.0384. 


(a) Interpret the P-value in context. 

(b) Are the results statistically significant at the a = 
0.05 level? Explain. What conclusion would you 
make? 

(c) The casino manager uses your data to produce a 
99% confidence interval for p and gets (0.44, 0.80). 
He says that this interval provides convincing evi- 
dence that the wheel is fair. How do you respond? 


R9.6 Radon detectors Radon is a colorless, odorless 
gas that is naturally released by rocks and soils and 
may concentrate in tightly closed houses. Because 
radon is slightly radioactive, there is some concern 
that it may be a health hazard. Radon detectors are 
sold to homeowners worried about this risk, but the 
detectors may be inaccurate. University research- 
ers placed a random sample of 11 detectors in a 
chamber where they were exposed to 105 picocuries 
per liter of radon over 3 days. A graph of the radon 
readings from the 11 detectors shows no strong 
skewness or outliers. The mean reading is 104.82 
and the standard deviation of the readings is 9.54. 


(a) Is there convincing evidence at the 10% level that 
the mean reading differs from the true value 105? 

(b) A90% confidence interval for the true mean read- 
ing is (99.61, 110.03). Is this interval consistent with 
your conclusion from part (a)? Explain. 


R9.7 Better barley Does drying barley seeds in a kiln 
increase the yield of barley? A famous experiment 
by William S. Gosset (who discovered the ¢ distribu- 
tions) investigated this question. Eleven pairs of adja- 
cent plots were marked out ina large field. For each 
pair, regular barley seeds were planted in one plot 
and kiln-dried seeds were planted in the other. The 
following table displays the data on yield (Ib/acre).”” 


Plot Regular Kiln 
1 1903 2009 
2 1935 1915 
3 1910 2011 
4 2496 2463 
5 2108 2180 
6 1961 1925 
7 2060 2122 
8 1444 1482 
9 1612 1542 

10 1316 1443 
11 1511 1535 
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(a) How can the Random condition be satisfied in this 
study? 

(b) Assuming that the Random condition has been met, 
do these data provide convincing evidence that 
drying barley seeds in a kiln increases the yield of 
barley, on average? Justify your answer. 


Chapter 9 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T9.1 An opinion poll asks a random sample of adults 
whether they favor banning ownership of handguns 
by private citizens. A commentator believes that 
more than half of all adults favor such a ban. The 
null and alternative hypotheses you would use to test 
this claim are 


T9.2 You are thinking of conducting a one-sample t test 
about a population mean y using a 0.05 significance 
level. Which of the following statements is correct? 


(a) You should not carry out the test if the sample does 
not have a Normal distribution. 


(b) You can safely carry out the test if there are no outli- 
ers, regardless of the sample size. 

(c) You can carry out the test if a graph of the data shows 
no strong skewness, regardless of the sample size. 

(d) You can carry out the test only if the population 
standard deviation is known. 


(e) You can safely carry out the test if your sample size is 
at least 30. 


T9.3 ‘To determine the reliability of experts who interpret 
lie detector tests in criminal investigations, a random 
sample of 280 such cases was studied. The results were 


Suspect’s True Status 
Examiner’s Decision Innocent Guilty 
“Innocent” 131 15 
“Guilty” 9 125) 


If the hypotheses are Ho: suspect is innocent versus 
H,;: suspect is guilty, then we could estimate the 
probability that experts who interpret lie detector 
tests will make a Type II error as 


15/280. (c) 15/140. 
9/280. (d) 9/140. 


— 
fo 
=e 


(e) 15/146. 


S 


T9.4 A significance test allows you to reject a null hypoth- 
esis Ho in favor of an alternative H, at the 5% signifi- 
cance level. What can you say about significance at 
the 1% level? 


(a) Ho can be rejected at the 1% significance level. 


(b) There is insufficient evidence to reject Hp at the 1% 
significance level. 


(c) There is sufficient evidence to accept Hp at the 1% 
significance level. 
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(d) H, can be rejected at the 1% significance level. 


(e) ‘The answer can’t be determined from the informa- 
tion given. 

T9.5 Arandom sample of 100 likely voters in a small city 
produced 59 voters in favor of Candidate A. The 
observed value of the test statistic for testing the 
null hypothesis Hp: p = 0.5 versus the alternative 
hypothesis H,:p > 0.5 is 


eee 0.59 — 0.5 y= ee 
[0.59(0.41) 0.5(0.5) 
100 100 
0.59 — 0.5 0.59 — 0.5 
— (eg = 
0.5(0.5) 0.5(0.5) 
100 100 
0.5 — 0.59 
(c) z= 
0.59(0.41) 
100 


T9.6 A researcher claims to have found a drug that causes 
people to grow taller. The coach of the basketball 
team at Brandon University has expressed interest 
but demands evidence. Over 1000 Brandon students 
volunteer to participate in an experiment to test this 
new drug. Fifty of the volunteers are randomly se- 
lected, their heights are measured, and they are given 
the drug. Two weeks later, their heights are measured 
again. ‘lhe power of the test to detect an average 
increase in height of 1 inch could be increased by 


(a) using only volunteers from the basketball team in 
the experiment. 

(b) using a = 0.01 instead of a = 0.05. 

(c) using a = 0.05 instead of a = 0.01. 

(d) giving the drug to 25 randomly selected students 
instead of 50. 

(e) using a two-sided test instead of a one-sided test. 

T9.7 A95% confidence interval for a population mean su is 

calculated to be (1.7, 3.5). Assume that the conditions 
for performing inference are met. What conclusion 


can we draw for a test of Ho: 2 = 2 versus H,: 2 # 2 at 
the a = 0.05 level based on the confidence interval? 


(a) None. We cannot carry out the test without the 
original data. 


(b) None. We cannot draw a conclusion at the a = 
0.05 level because this test corresponds to the 97.5% 
confidence interval. 


(c) None. Confidence intervals and significance tests 
are unrelated procedures. 


(d) We would reject Ho at level a = 0.05. 
(e) We would fail to reject Hp at level a = 0.05. 


T9.8 Ina test of Hy:p = 0.4 against H,:p # 0.4, a random 
sample of size 100 yields a test statistic of z = 1.28. 
The P-value of the test is approximately equal to 


(a) 0.90. (ce) 0.05. (e) 0.10. 
(b) 0.40. (d) 0.20. 


T9.9 An SRS of 100 postal employees found that the 
average time these employees had worked at the 
postal service was 7 years with standard deviation 
2 years. Do these data provide convincing evi- 
dence that the mean time of employment yu for 
the population of postal employees has changed 
from the value of 7.5 that was true 20 years 
ago? To determine this, we test the hypotheses 
Ho: = 7.5 versus Hy: 4 # 7.5 using a one- 
sample t test. What conclusion should we draw at 
the 5% significance level? 


(a) There is convincing evidence that the mean time 
working with the postal service has changed. 


(b) There is not convincing evidence that the 
mean time working with the postal service has 
changed. 


(c) There is convincing evidence that the mean time 
working with the postal service is still 7.5 years. 


(d) There is convincing evidence that the mean time 
working with the postal service is now 7 years. 


(e) We cannot draw a conclusion at the 5% signifi- 
cance level. ‘lhe sample size is too small. 


T9.10 Are T'V commercials louder than their surround- 
ing programs? lo find out, researchers collected 
data on 50 randomly selected commercials in a 
given week. With the television’s volume at a fixed 
setting, they measured the maximum loudness of 
each commercial and the maximum loudness in 
the first 30 seconds of regular programming that 
followed. Assuming conditions for inference are 
met, the most appropriate method for answering 
the question of interest is 


(a) a one-proportion z test. 
(b 


) 

) a one-proportion z interval. 
(c) a paired t test. 
) 
) 


(d) a paired t interval. 
(e) None of these. 
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Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T9.11 


(b) 


T9.12 


A software company is trying to decide whether to 
produce an upgrade of one of its programs. Cus- 
tomers would have to pay $100 for the upgrade. 
For the upgrade to be profitable, the company 
needs to sell it to more than 20% of their custom- 
ers. You contact a random sample of 60 customers 
and find that 16 would be willing to pay $100 for 
the upgrade. 


Do the sample data give good evidence that more 
than 20% of the company’s customers are willing to 
purchase the upgrade? Carry out an appropriate test 
at the a = 0.05 significance level. 


Which would be a more serious mistake in this 
setting—a Type I error or a ‘Type II error? Justify 
your answer. 


Describe two ways to increase the power of the test 
in part (a). 

“T can’t get through my day without coffee” is a 
common statement from many students. Assumed 
benefits include keeping students awake during 
lectures and making them more alert for exams 
and tests. Students in a statistics class designed an 
experiment to measure memory retention with and 
without drinking a cup of coffee one hour before 

a test. This experiment took place on two different 
days in the same week (Monday and Wednesday). 
‘Ten students were used. Each student received no 
coffee or one cup of coffee one hour before the test 
ona particular day. The test consisted of a series of 
words flashed on a screen, after which the student 
had to write down as many of the words as possible. 
On the other day, each student received a different 
amount of coffee (none or one cup). 


(a) 


SS 


TOs 


One of the researchers suggested that all the 
subjects in the experiment drink no coffee before 
Monday’s test and one cup of coffee before Wednes- 
day’s test. Explain to the researcher why this is a bad 
idea and suggest a better method of deciding when 
each subject receives the two treatments. 


The data from the experiment are provided in the 
table below. Set up and carry out an appropriate test 
to determine whether there is convincing evidence 
that drinking coffee improves memory. 


Student No cup One cup 
1 24 25 
2 30 31 
3 22 23 
4 24 24 
5 26 27 
6 23 25 
i 26 28 
8 20 20 
9 27 27 

10 28 30 


A government report says that the average amount 
of money spent per U.S. household per week on 
food is about $158. A random sample of 50 house- 
holds in a small city is selected, and their weekly 
spending on food is recorded. ‘The sample data 
have a mean of $165 and a standard deviation of 
$20. Is there convincing evidence that the mean 
weekly spending on food in this city differs from the 
national figure of $158? 
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Comparing Two 
Populations or Groups 


Fast-Food Frenzy! 


More than $70 billion is spent each year in the drive-thru lanes of America’s fast-food restaurants. Having 
quick, accurate, and friendly service at a drive-thru window translates directly into revenue for the restau- 
rant. According to Jack Greenberg, former CEO of McDonald’, sales increase 1% for every six seconds 
saved at the drive-thru. So industry executives, stockholders, and analysts closely follow the ratings of 
fast-food drive-thru lanes that appear annually in QSR, a publication that reports on the quick-service 
restaurant industry. 

The 2012 QSR magazine drive-thru study involved visits to a random sample of restaurants in the 20 
largest fast-food chains in all 50 states. During each visit, the researcher ordered a modified main item (for 
example, a hamburger with no pickles), a side item, and a drink. If any item was not received as ordered, 
or if the restaurant failed to give the correct change or supply a straw and a napkin, then the order was 
considered “inaccurate.” Service time, which is the time from when the car stopped at the speaker to 
when the entire order was received, was measured each visit. Researchers also recorded whether or not 
each restaurant had an order-confirmation board in its drive-thru.! 


Here are some results from the 2012 OSR study: 
e For restaurants with order-confirmation boards, 1169 of 1327 visits (88.1%) resulted in accurate orders. 
For restaurants with no order-confirmation board, 655 of 726 visits (90.2%) resulted in accurate orders. 
¢ McDonald’s average service time for 362 drive-thru visits was 188.83 seconds with a standard 
deviation of 17.38 seconds. Burger King’s service time for 318 drive-thru visits had a mean of 201.33 
seconds and a standard deviation of 18.85 seconds. 
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ACTIVITY 


Introduction 


Which of two popular drugs— Lipitor or Pravachol —helps lower “bad cholesterol” 
more? Researchers designed an experiment, called the PROVE-IT Study, to find 
out. They used about 4000 people with heart disease as subjects. These individuals 
were randomly assigned to one of two treatment groups: Lipitor or Pravachol. At 
the end of the study, researchers compared the proportion of subjects in each group 
who died, had a heart attack, or suffered other serious consequences within two 
years. For those using Pravachol, the proportion was 0.263; for those using Lipitor, it 
was ().224.? Could such a difference have occurred purely by the chance involved in 
the random assignment? This is a question about comparing two proportions. 

Who studies more in college—men or women? Researchers asked separate 
random samples of 30 males and 30 females at a large university how many min- 
utes they studied on a typical weeknight. The females reported studying an aver- 
age of 165.17 minutes; the male average was 117.17 minutes. How large is the 
difference in the corresponding population means? This is a question about com- 
paring two means. 

Comparing two proportions or means based on random sampling or a random- 
ized experiment is one of the most common situations encountered in statisti- 
cal practice. In the PROVE-IT experiment, the goal of inference is to determine 
whether the treatments (Lipitor and Pravachol) caused the observed difference 
in the proportion of subjects who experienced serious consequences in the two 
groups. For the college studying survey, the goal of inference is to draw a conclu- 
sion about the actual mean study times for all women and all men at the university. 

The following Activity gives you a taste of what lies ahead in this chapter. 


Is Yawning Contagious? 


MATERIALS: 


Set of 50 index cards or 
standard deck of playing 
cards for each pair of 
students 


According to the popular TV show Mythbusters, the answer is “Yes.” The Myth- 
busters team conducted an experiment involving 50 subjects. Each subject was 
placed in a booth for an extended period of time and monitored by hidden cam- 
era. Thirty-four subjects were given a “yawn seed” by one of the experimenters; 
that is, the experimenter yawned in the subject’s presence before leaving the 
room. The remaining 16 subjects were given no yawn seed. 

What happened in the experiment? The table below shows 
the results’ 


Yawn Seed? 
Subject Yawned? Yes No Total 
Yes 10 4 14 
No 24 12 36 
Total 34 16 50 


Ten of the 34 subjects (29.4%) in the yawn-seed group yawned, 
compared to 4 of the 16 subjects (25.0%) in the no-yawn-seed 
group. The difference in the proportions who yawned for the 
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two groups is 10/34 — 4/16 = 0.044. Adam Savage and Jamie Hyneman, the co- 
hosts of MythBusters, used this difference as evidence that yawning is contagious. 
But is the evidence convincing? 

In this Activity, your class will investigate whether the results of the experiment 
were really statistically significant. Let’s see what would happen just by chance if 
we randomly reassign the 50 people in this experiment to the two groups (yawn 
seed and no yawn seed) many times, assuming the treatment received doesn’t affect 
whether or not a person yawns. 


1. We need 50 cards to represent the subjects in this study. In the MythBusters 
experiment, 14 people yawned and 36 didn’t. Because we’re assuming that the 
treatment received won’t change whether each subject yawns, we use 14 cards to 
represent people who yawn and 36 cards to represent those who don’t. 


¢ Using index cards: Write “Yes” on 14 cards and “No” on 36 cards. 


¢ Using playing cards: Remove the ace of spades and ace of clubs from the 
deck. All jacks, queens, kings, and aces represent subjects who yawn. All 
remaining cards represent subjects who don’t yawn. 


2. Shuffle and deal two piles of cards—one with 34 cards and one with 16 cards. 
The first pile represents the yawn-seed group and the second pile represents the 
no-yawn-seed group. The shuffling reflects our assumption that the outcome for 
each subject is not affected by the treatment. 

Calculate the difference in the proportions who yawned for the two groups 
(yawn seed — no yawn seed). For example, if you get 9 yawners in the yawn-seed 
group and 5 yawners in the no-yawn-seed group, the resulting difference in pro- 
portions is 


A negative difference would mean that a smaller proportion of people in the 
yawn-seed group yawned during the experiment than in the no-yawn-seed group. 
3. Your teacher will draw and label axes for a class dotplot. Plot the result you 
got in Step 2 on the graph. 

4. Repeat Steps 2 and 3 if needed to get a total of at least 40 repetitions of the 
simulation for your class. 

5. Based on the class’s simulation results, how surprising would it be to get a dif- 
ference in proportions of 0.044 (what the Mythbusters got in their experiment) 
or larger simply due to the chance involved in the random assignment? 


6. What conclusion would you draw about whether yawning is contagious? Explain. 


Here is an example of what the class dotplot in the Activity might look like 
after 100 trials. In this simulation, 50 of the 100 trials (in red) produced 
a difference in proportions of at least 0.044, so the approximate P-value 
is 0.50. It is very likely that a difference this big could occur just due 
to the chance variation in random assignment! This result is not statisti- 
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03 cally significant and does not provide convincing evidence that yawning 
is contagious. 
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| 10.1 Comparing Two Proportions 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 
e Describe the shape, center, and spread of the sampling e Construct and interpret a confidence interval to 


distribution of 6; — po. compare two proportions. 


Determine whether the conditions are met for doing e Perform a significance test to compare two proportions. 
inference about p,; — Po. 


Suppose we want to compare the proportions of individuals with a certain charac- 
teristic in Population | and Population 2. Let’s call these parameters of interest p; 
and 2. The ideal strategy is to take a separate random sample from each popula- 
tion and to compare the sample proportions f; and > with that characteristic. 

What if we want to compare the effectiveness of Treatment | and Treatment 2 
in a completely randomized experiment? This time, the parameters p, and p2 
that we want to compare are the true proportions of successful outcomes for each 
treatment. We use the proportions of successes in the two treatment groups, 
and f, to make the comparison. 

Here’s a table that summarizes these two situations: 


Population or treatment Parameter Statistic Sample size 
1 Dr D, n 
2 Po Po M 


We compare the populations or treatments by doing inference about the differ- 
ence Pp; — p2 between the parameters. The statistic that estimates this difference 
is the difference between the two sample proportions, fp; — f2. To use fp; — p> for 
inference, we must know its sampling distribution. 


The Sampling Distribution of a Difference 
between Two Proportions 


To explore the sampling distribution of /; — fz, let’s start with two populations 
having a known proportion of successes. Suppose that there are two large high 
schools, each with over 2000 students, in a certain town. At School 1, 70% of 
students did their homework last night. Only 50% of the students at School 2 did 
their homework last night. The counselor at School | takes an SRS of 100 stu- 
dents and records the proportion /; that did the homework. School 
2’s counselor takes an SRS of 200 students and records the propor- 
tion f> that did the homework. What can we say about the difference 
pf — f2 in the sample proportions? 

We used Fathom software to take an SRS of n; = 100 students from 
School | and a separate SRS of n2 = 200 students from School 2 and 
to plot the values of 6), 62, and f; — f2 from each sample. Our first 
set of simulated samples gave pf; = 0.68 and f2 = 0.505, so dots 
were placed above each of those values in Figure 10.1(a) and (b). 


(a) Approximate sampling 


oo OO SSS eee 
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(b) Approximate sampling (c) Approximate sampling 
distribution of p. distribution of p, — p, 


Bas o oO 


0.85 
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phat2 diffprop 


FIGURE 10.1 Simulated sampling distributions of (a) the sample proportion p, of successes 
in 1000 SRSs of size n, = 100 from a population with p, = 0.70, (b) the sample proportion 
p> of successes in 1000 SRSs of size np = 200 from a population with p» = 0.50, and (c) the 
difference in sample proportions 6, — p> for each of the 1000 repetitions. 


The difference in the sample proportions for this first set of samples is fp; — pz 
= 0.68 — 0.505 = 0.175. A dot for this value appears in Figure 10.1(c). The 
three dotplots in Figure 10.1 show the results of repeating this process 1000 times. 
These are the approximate sampling distributions of fj, 62, and p; — pz. 

In Chapter 7, we saw that the sampling distribution of a sample proportion f 
has the following properties: 


Shape: Approximately Normal if np = 10 and n(1 — p) = 10 
Center: jug = p 


_ 
Spread: of yee ifn Ss aN 


For the sampling distributions of f; and fh in this case: 


Sampling distribution of p, Sampling distribution of p, 


Shape = Approximately Normal; 7,0, = 100(0.70) = 70 Approximately Normal; np. = 200(0.50) = 100 


= 10 and n,(1 — p;) = 100(0.30) = 30 = 10 = 10 and n,(1 — p.) = 200(0.50) = 100 = 10 


Center = jus, = Pp; = 0.70 Lp, = Po = 0.50 
pi(1 — pi) —— = — pr) see 
= = = 0.04 = 7 = 0.0354 
Spread a5, F j 100 0.0458 =a, s 500 0.035 
because School 1 has a population of over because School 2 has a population of over 
10(100) = 1000 students. 10(200) = 2000 students. 


The approximate sampling distributions in Figures 10.1(a) and (b) give similar 
results. 

What about the sampling distribution of f; — f? Figure 10.1(c) suggests that it 
has an approximately Normal shape, is centered at about 0.198, and has standard 
deviation about 0.0572. The shape makes sense because we are combining two 
independent random variables, f; and f2, that have approximately Normal distri- 
butions. How about the center? The true proportion of students who did last night’s 
homework at School 1 is p; = 0.70 and at School 2 is pz = 0.50. We expect the 
difference f; — f2 to center on the actual difference in the population proportions, 
pi — p2 = 0.70 — 0.50 = 0.20. The spread, however, is a bit more complicated. 
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How can we find formulas for the mean and standard devia- 

tion of the sampling distribution of 6; — 62? Both f; and f; are ran- 

dom variables. That is, their values would vary in repeated independent SRSs of size 

nj; and nz. Independent random samples yield independent random variables fp; and 

pz. The statistic  — fz is the difference of these two independent random variables. 
In Chapter 6, we learned that for any two random variables X and Y, 


Lx-y — Hx ~ by 
For the random variables f; and f2, we have 
Lp —p, = Mp, — Lp, = Pi — P2 
In the school homework survey, 
Mg,—§ = Pi — p2 = 0.70 — 0.50 = 0.20 
We also learned in Chapter 6 that for independent random variables X and Y, 
te <3 = ox + oy 


For the random variables f; and f2, we have 


pill — pi)\* pl —p2)\? pil —- pi) — p2(l — pr) 
Of, o% + on = ( a) + ( tg ) ~ ae 


ny n2 
p 1 - p p 1 - p 
So ory aa i( ) 2( 2) 


NY n2 


In the school homework survey, 


pill — pi) , pol —p2) _ /0.7(0.3) 
ae moO = 100 


This is similar to the result from the Fathom simulation. 


0.5(0.5) 


700. 0.058 


Here are the facts we need. 


THE SAMPLING DISTRIBUTION OF fp, — pp | 
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When conditions are met, the sampling distribution of f;—f will be 
approximately Normal with mean juj,-,, = Pp — pz and standard deviation 


(- = 
Off, - [2 Pd pal! s po). Figure 10.2 displays this distribution. 


The formula for the standard deviation of the sampling distribution involves 
the unknown parameters p, and /2. Just as in Chapters 8 and 9, we must replace 
these by estimates to do inference. And just as before, we do this a bit differently 
for confidence intervals and for tests. We'll get to inference short- 
ly. For now, let’s focus on the sampling distribution of fp; — 2. 


Standard deviation 


\ ae =Py a = 


FIGURE 10.2 Select independent SRSs from two populations having pro- 
portions of successes p, and p>. The proportions of successes in the two 
samples are 6; and #..When the samples are large, the sampling distribution 
~— Values of p, - p) — of the difference 6, — fp. is approximately Normal. 


Yummy Goldfish! 


Describing the sampling distribution of 6, — pe 


Your teacher brings two bags of colored goldfish crackers to class. Bag 1 
has 25% red crackers and Bag 2 has 35% red crackers. Each bag contains 
more than 1000 crackers. Using a paper cup, your teacher takes an SRS 
of 50 crackers from Bag | and a separate SRS of 40 crackers from Bag 2. 
Let fp; — f2 be the difference in the sample proportions of red crackers. 


PROBLEM: 

(a) What is the shape of the sampling distribution of p, — p2 ? Why? 

(b) Find the mean of the sampling distribution. Show your work. 

(c) Find the standard deviation of the sampling distribution. Show your work. 


SOLUTION: 

(a) Because np, = 50(0.25) = 12.5, 2,(1 — p,) = 50(0.75) = 37.5, 

fap. = 40(0.35) = 14, and n2(1 — p2) = 40(0.65) = 26 are all at least 10, the 
sampling distribution of p, — p. is approximately Normal. 

(b) The meanis jp —p, = pi — p2 = 0.25 — 0.55 = —0.10. 

(c) Because there are at least 10(50) = 500 crackers in Bag 1 and 10(40) = 400 crackers 
in Bag 2, the standard deviation is 


_ (a — pr) pol1 = po) _ {eee _ 0.35(0.65) 
mh 50. ~—«AO 


= 0.0971 


For Practice Try Exercise 
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Confidence Intervals for p, — p2 


When data come from two independent random samples or two groups in a ran- 
domized experiment (the Random condition), the statistic p; — fz is our best 
guess for the value of p; — pz. We can use our familiar formula to calculate a 
confidence interval for p; — pa: 


statistic + (critical value) - (standard deviation of statistic) 


When the 10% condition is met, the standard deviation of the statistic ; — fz is 


_ foitl= pi), poll = p2) 
Of-f:— n as 


12 


If the Large Counts condition is met, we find the critical value z* for the given 
confidence level from the standard Normal curve. 


CONDITIONS FOR CONSTRUCTING A CONFIDENCE INTERVAL 
ABOUT A DIFFERENCE IN PROPORTIONS 


Because we don’t know the values of the parameters p; and p2, we replace them 
in the standard deviation formula with the sample proportions. The result is the 
standard error (also called the estimated standard deviation) of the statistic p, — po: 


i ee — — pi) = p21 — fr) 
Pi~p2 ny 


nz 


This value tells us how far the difference in sample proportions will typically be 
from the difference in population proportions if we repeat the random sampling 
or random assignment many times. 

When the conditions are met, our confidence interval for p; — 2 is therefore 


statistic + (critical value) - (standard deviation of statistic) 


seas aaa 
(hi- fay 2 2[PO PB Pe 


nz 


This is often called a two-sample z interval for a difference between two 
proportions. 
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TWO-SAMPLE z INTERVAL FOR A DIFFERENCE 
BETWEEN TWO PROPORTIONS 
When the conditions are met, an approximate C% confidence interval for 


pi =< po 1S 
ian rae 
(=p) = zn) a Pv AP poll ~ Pr) 


nz 


where z* is the critical value for the standard Normal curve with C% of its 
area between —z* and z*. 


The following example shows how to construct and interpret a confidence in- 
terval for a difference in proportions. As usual with inference problems, we follow 
the four-step process. Because you are expected to include these four steps when- 
ever you construct a confidence interval or perform a significance test, we will 
limit our use of the four-step icon to examples from this point forward. 


Teens and Adults on Social tee 
Networking Sites L. 


Confidence interval for p; — P2 


As part of the Pew Internet and American Life Project, researchers 
conducted two surveys in 2012. The first survey asked a random 
sample of 799 U.S. teens about their use of social media and the 
Internet. A second survey posed similar questions to a random sam- 
ple of 2253 US. adults. In these two studies, 80% of teens and 69% 


of adults used social-networking sites. 


PROBLEM: 


(a) Calculate the standard error of the sampling distribution of the difference in the 
sample proportions (teens — adults). What information does this value provide? 


(b)Construct and interpret a 95% confidence interval for the difference between the 
proportion of all U.S. teens and adults who use social-networking sites. 


SOLUTION: 


(a) The sample proportions of teens and adults who use social-networking sites are p, = 0.80 and 
p. = 0.69, respectively. The standard error of the sampling distribution of p, — pris 


ae ‘| Pall — pa) pol1 — po) _ p cena _ 0.69(0.31) 
ae mt 799 2258 
If we were to take many random samples of 799 teens and 2253 adults, the difference in the sample 


proportions of teens and adults who use social-networking sites will typically be 0.0172 from the 
true difference in proportions of all teens and adults who use social-networking sites. 


= 0.0172 


(b) STATE: Our parameters of interest are p, = the proportion of all U.S. teens who use social- 
networking sites and p, = the proportion of all U.S. adults who use social-networking sites. We want 
to estimate the difference p, — p. ata 95% confidence level. 
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PLAN: We should use a two-sample zinterval for p; — p2 ifthe conditions are met. 

* Random: The data come from independent random samples of 799 U.S. teens and 2253 U.S. adults. 
0 10%: The researchers are sampling without replacement, so we must check the 10% condition: 
there are at least 10(799) = 7990 U.S. teens and at least 10(2253) = 22,530 U.S. adults. 

° Large Counts: We check the counts of “successes” and “failures”: 

4p, = 799(0.80) = 639.2 — 639 n4(1 — p,) = 799(1 — 0.80) = 159.8 — 160 

Nop, = 2253(0.69) = 1554.57 — 1555 n,(1 — p,) = 2253(1 — 0.69) = 698.43 — 698 

Note that the observed counts have to be whole numbers! Because all four values are at least 10, 

this condition is met. 


DO: Weknow that n, = 799, p; = 0.80, tg = 2253, and p, = 0.69. Fora 95% confidence 
level, the critical value is z* = 1.96. So our 95% confidence interval for p, — pis 


p(1—p:) poll —p 0.80(0.20)  0.69(0.31 
(6 — py) = ot fl, PAP 9.80 — 0.69) + 1.96 2s cee 
pip 
Ny No 799 2253 


= 0.11 + 1.96(0.0172) 


= 0.11 + 0.034 

= (0.076, 0.144) 
This interval suggests that more Using technology: Refer to the Technology Corner that follows the example. The calculator’s 
teens than adults in the United 2-PropZInt gives (0.07588, 0.14324). 
States engage in social networking CONCLUDE: Weare 95% confident that the interval from 0.07586 to 0.14324 captures the 
by between about 7.6 and 14.3 P 
percentage points. true difference in the proportion of all U.S. teens and adults who use social-networking sites. 


For Practice Try Exercise 9 | 


The researchers in the previous example selected independent random sam- 
ples from the two populations they wanted to compare. In practice, it’s common 
to take one random sample that includes individuals from both populations of 
interest and then to separate the chosen individuals into two groups. The two- 
sample z procedures for comparing proportions are still valid in such situations, 
provided that the two groups can be viewed as independent samples from their re- 
spective populations of interest. 

You can use technology to perform the calculations in the “Do” step. Remember 
that this comes with potential benefits and risks on the AP® exam. 


CONFIDENCE INTERVAL FOR A DIFFERENCE 


*|coRNER IN PROPORTIONS 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


The TI-83/84 and T1-89 can be used to construct a confidence interval for p; — pz. We'll demonstrate using the previ- 
ous example. Of n; = 799 teens surveyed, X = 639 said they used social-networking sites. Of nz = 2253 adults surveyed, 
X = 1555 said they engaged in social networking. ‘To construct a confidence interval: 
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TI-83/84 TI-89 
Press [STAT], then choose TESTS and e In the Statistics/List Editor, press ({F7]) 


P= Poy OVA ILI, 


and choose 2-PropZInt. 


When the 2-PropZInt screen appears, enter the values shown. 


ni:799 

x2: 1555 

n2: 2253 
C-Level: .95 
Calculate 


Highlight “Calculate” and press [ENTER |. 


NORMAL FLOAT AUTO REAL RADIAN CL fl i eereerercner 
2-Prerortion 2 Interval 


; 
(. 07588, .14324) ; ={.0759s.14323 


B1=. 7997496871 
i2=. 6901908566 
n1=799 

nz=2253 


poe eae 
é-Froportion < Interval 


x1:639 — i] successes xt: 

mt 

Successess xt fis SCSC~™dS 

ne: 

Clavel ) SSS 

CEsc=cnNeel » 
= 


FUNC 


=.105559 
=.033685 
=.79875 
=.650191 
=753. 
=2253. 


AP® EXAM TIP The formula for the two-sample z interval for p, —p. often leads to calculation 
errors by students. As a result, we recommend using the calculator’s 2- PropZInt feature 
to compute the confidence interval on the AP® exam. Be sure to name the procedure (two- 
proportion z interval) and to give the interval (0.076, 0.143) as part of the “Do” step. 


CHECK YOUR UNDERSTANDING 


Are teens or adults more likely to go online daily? The Pew Internet and American Life 
Project asked a random sample of 799 teens and a separate random sample of 2253 adults 
how often they use the Internet. In these two surveys, 63% of teens and 68% of adults 
said that they go online every day. Construct and interpret a 90% confidence interval for 


Pi — p2. 


Significance Tests for p; — p» 


An observed difference between two sample proportions can reflect an actual dif- 
ference in the parameters, or it may just be due to chance variation in random 
sampling or random assignment. Significance tests help us decide which explana- 
tion makes more sense. 

The null hypothesis has the general form 


Ho:p1 — p2 = hypothesized value 
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We'll restrict ourselves to situations in which the hypothesized difference is 0.'Then 
the null hypothesis says that there is no difference between the two parameters: 


Ho: Pp — p2 = 0 or, alternatively, Ho: p; = p2 
The alternative hypothesis says what kind of difference we expect. 


Hungry Children 
Stating hypotheses 


Researchers designed a survey to compare the proportions of children 
who come to school without eating breakfast in two low-income el- 
ementary schools. An SRS of 80 students from School 1 found that 19 
had not eaten breakfast. At School 2, an SRS of 150 students included 
26 who had not had breakfast. More than 1500 students attend each 
school. Do these data give convincing evidence of a difference in the 
population proportions? 


PROBLEM: State appropriate hypotheses for a significance test to answer this 
question. Define any parameters you use. 


SOLUTION: We should carry out a test of 
Ho: py — p2 = O 
H,: Pi az P2 #0 


where p, = the true proportion of students at School 1 who did not eat breakfast and p. = the true 
proportion of students at School 2 who did not eat breakfast. 


For Practice Try Exercise 


The conditions for performing a significance test about p; — pz are the same as 
for constructing a confidence interval. 


CONDITIONS FOR PERFORMING A SIGNIFICANCE 
TEST ABOUT A DIFFERENCE IN PROPORTIONS 
e Random: The data come from two independent random samples or 
from two groups in a randomized experiment. 
° 10%: When sampling without replacement, check that n; = —N 1 
1 
and in = 70.82 


e Large Counts: The counts of “successes” and “failures” in each sample 
or group—)fj, 2\(1 — py), n2p2, n2(1 — pz) —are all at least 10. 
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If the conditions are met, we can proceed with calculations. To do a test, stan- 
dardize fp, — p> to get az statistic: 


statistic — parameter 


test statistic = = —_ 
standard deviation of statistic 


_ (pi — p2) — 0 


= ae ae 
standard deviation of statistic 


If Ho: p) = pz is true, the two parameters are the same. We call their common 
value p. But now we need a way to estimate p, so it makes sense to combine the 
data from the two samples as if they came from one larger sample. This pooled (or 
combined) sample proportion is 

count of successes in both samples combined =X, + X. 
count of individuals in both samples combined —n; + 12 


po= 
In other words, fc gives the overall proportion of successes in the combined 
samples. 

Let’s look at how to calculate fc in the hungry children example. The two-way 
table below summarizes the survey data. We have combined the independent 
SRSs from the two schools in the right-hand Total column. 

Because researchers want to compare the proportions of students 


School at School 1 and School 2 who have not eaten breakfast, we treat 
Breakfast? 1 2 Total =the individuals in the “No” row as successes. It is easy to see from 
No 19 26 45 the table that the overall proportion of successes in the combined 
Yes 61 124 185 ees : . 
samples is fc = ==~ = 0.1957. We can also get this result usin 
Total 80 150 230 : Pe ™ 730 a 


We can use a little algebra to rewrite 
the denominator of the test statistic: 


The final formula looks like the one 
given on the AP® exam formula sheet. 


the formula above: 


. _Xi+X_ 19+26 45 


= = =——=(0] 
San tng 804150 230 wae 


Recall that the standard deviation of fp; — fz is 


- [20 7 pi), poll = po) 
Op, fp. — ny T 


"2 


Use fc in place of both p, and /2 in this expression for the denominator of the 
test statistic: 


When the Large Counts condition is met, this will yield a z statistic that 
has approximately the standard Normal distribution when Hp is true. Here 
are the details for the two-sample z test for the difference between two 
proportions. 
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Some people prefer to use fr to check TWO-SAMPLE z TEST FOR THE DIFFERENCE 
the Large Counts condition. If the BETWEEN TWO PROPORTIONS 
expected counts 1,f¢, m(1 — Bc), 


Mfc, and n,(1 — fic) are all at least - . 
10, the sampling distribution of f; — 6. | Suppose the conditions are met. To test the hypothesis Hp: p; — p2 = 0, first 


is approximately Normal. find the pooled proportion fg of successes in both samples combined. Then 
Checking the observed counts of compute the z statistic 
successes and failures is more 
conservative, as the expected counts (p = 2) = (0 
will always be at least 10 if the Be ae = = = 
observed counts are at least 10. J bc — fc) rh pc(1 — pc) 
ny n2 


Find the P-value by calculating the probability of getting a z statistic this large 
or larger in the direction specified by the alternative hypothesis H,: 


H,:p,—p2>0 H,:p,—p2<90 A, :p,—p2.#0 
z & -|z| [Z| 


Now we can finish the test we started earlier. 


Hungry Children 
Significance test for p; — P2 L. 


Researchers designed a survey to compare the proportions of children who come 
to school without eating breakfast in two low-income elementary schools. An SRS 
of 80 students from School | found that 19 had not eaten breakfast. At School 2, an 
SRS of 150 students included 26 who had not had breakfast. More than 1500 stu- 
dents attend each school. Do these data give convincing evidence at the a = 0.05 
level of a difference in the population proportions? 


STATE: Our hypotheses are 
Ho: pi — p2 = 0 
Hz py — po #O 
where p, = the true proportion of students at School 1 who did not eat breakfast and p. = the true 
proportion of students at School 2 who did not eat breakfast. 
PLAN: If conditions are met, we should perform a two-sample ztest for p, — po. 


* Random: The data were produced using two independent random samples—80 students from 
School 1 and 150 students from School 2. 


° 10%: The researchers are sampling without replacement, so we check the 10% condition: there are 
at least 10(80) = 800 students at School 1 and at least 10(150) = 1500 students at School 2. 
¢ Large Counts: We check the counts of “successes” and “failures”: 


MP, — 19: m(1 = P) = 61, Moo = 26, fo(1 oa Po) = 124 


All four values are at least 10, so this condition is met. 
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19 26 
DO: Weknowthat n, = 80, p, = a = 0.2375, my = 150, and pp = 7150 = 0.1733. Our point 
estimate for the difference in population proportionsis p; — pz = 0.2375 — 0.1733 = 0.0642. 


School 
=e The pooled proportion of students who didn’t eat breakfast in the two samples is 
Breakfast? 1 (2 Total 
No 1925 pees LO To) SPE et 
Yes 61 124 185 60+ 150 230 
ie ee, See the two-way table in the margin for confirmation. 
° Test statistic 
(P1 — P2) — 0 0.0642 — 0 ae 
zi => = — 
(2 — Pd , Bell — Po) a — 0.1957) — 0.1957(1 — 0.1957) 
fo 80 | 150 


* P-value Figure 10.3 displays the P-value as an area under the standard Normal curve for this 
two-tailed test. Using Table A or normal cdf, the desired P-value is 2AZ= 1.17) = 
2(1 — 0.8790) = 0.2420. 


Using technology: Refer to the Technology Corner that follows the example. The calculator’s 
2-PropZTest gives z= 1.1683 and P-value = 0.2427. 


Area = 0.2420 


FIGURE 10.3 The P-value for the Z=-11F Z=11F 
two-sided test. 


CONCLUDE: Because our P-value, 0.2427, is greater than a = 0.05, we fail to reject Hp. There 
is not convincing evidence that the true proportions of students at the two schools who didn’t eat 
breakfast are different. 


For Practice Try Exercise 


Exactly what does the P-value in the previous example tell us? If we repeated the 

random sampling process many times, we’d get a difference in sample proportions 

: as large as or larger than 0.0642 in either direction about 24% of the time when Ho: 
Z interval for the difference between oe aw : . as : f ae 

two proportions don't always give pi — p2 = 0 is true. With such a high probability of getting a result like this just by 

consistent results. That’s because the Chance when the null hypothesis is true, we don’t have enough evidence to reject Hp. 


The two-sample z test and two-sample 


“standard deviation of the statistic” We can get additional information about the difference between the popula- 
used in calculating the test statistic is tion proportions at School | and School 2 with a confidence interval. The TT-84’s 
pal — Ba) boll — Bo 2-PropZInt gives the 95% confidence interval for p, — pz as (—0.047, 0.175). 

J mh Oo That is, we are 95% confident that the difference in the true proportions of stu- 


dents who ate breakfast at the two schools is between 4.7 percentage points lower 

at School | and 17.5 percentage points higher at School 1. This is consistent with 

ve (1 = pr) , Bolt = by) our “fail to reject Ho” conclusion in the example because 0 is included in the 
nm Ne interval of plausible values for p; — pp. 


but for the confidence interval, it’s 
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SIGNIFICANCE TEST FOR A DIFFERENCE 
22.\TECHNOLOGY 1) bROPORTIONS 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


The TI-83/84 and TI-89 can be used to perform significance tests for comparing two proportions. Here, we use the data 
from the hungry children example. To perform a test of Ho: p; — p2 = 0 versus H,:p; — p2 # 0: 


TI-83/84 TI-89 
e =Press|stat], then choose TESTS and e In the Statistics/List Editor, press [2nd][F1] ({F6]) 
2-PropZTest. and choose 2-PropZTest. 


e When the 2-PropZTest screen appears, enter the values x} = 19, n) = 80, x2 = 26, nz = 150. Specify the alter- 
native hypothesis p; # p2, as shown. 


NORMAL FLOAT AUTO REAL RADIAN CL 


2-PropZTest 


Ata Z-PFOPortion 2 Test ae 
Successes id: 


o 


xb na: 
ni:8@ 
x2: 26 Successess 22! 
n2:150 


nz: 
Riternate ve: EERE 
Resuits: 


p1:EEW <p2 >p2 
Color: EMS 
Calculate Draw 


ee 
USE € AND > TO OFEN CHOICES 


e Ifyou select “Calculate” and press [ENTER], you will see that the test statistic is z = 1.168 and the P-value is 0.2427. 
Do you see the combined proportion of students who didn’t eat breakfast? It’s the value labeled f, 0.1957. 


NORMAL FLOAT AUTO REAL RADIAN CL f 


7] pa tpz 
P1*P2 z 
z=1.168347138 F Value 
P=, 2426668816 
B1=. 2375 
6 2=. 1733333333 
p=. 1956521739 
n1=80 
n2=150 


MAIN RAD AUTO FUNC af? 


e Ifyou select the “Draw” option, you will see the screen shown here. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


2-Prop2Test 
2=1.1683 p=.242? MAIN RAD AUTO FUNC 


AP® EXAM TIP The formula for the two-sample z statistic for a test about p, — p» often 
leads to calculation errors by students. As a result, we recommend using the calculator’s 


2-PropZTest feature to perform calculations on the AP® exam. Be sure to name the 
procedure (two-proportion z test) and to report the test statistic (7 = 1.17) and P-value 
(0.2427) as part of the “Do” step. 
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Inference for Experiments 


Most of the examples in this section have involved doing inference about p; — p2 
using data that were produced by random sampling. In such cases, the param- 
eters p; and p2 are the true proportions of successes in the corresponding pop- 
ulations. However, many important statistical results come from randomized 
comparative experiments. Defining the parameters in experimental settings is 
more challenging. 

The “Is Yawning Contagious?” Activity on page 610 describes an experiment 
that used 50 volunteer adults as subjects. Researchers randomly assigned 34 sub- 
jects to get a yawn seed and 16 subjects to get no yawn seed. Then researchers 
compared the proportions of people in the two groups who yawned. The param- 
eters in this setting are: 


pi = the true proportion of people like these who would yawn when given a yawn seed 


p2 = the true proportion of people like these who would yawn when no yawn seed is given 


Most experiments on people use recruited volunteers as subjects. When sub- 
jects are not randomly selected, researchers cannot generalize the results of an 
experiment to some larger populations of interest. But researchers can draw cause- 
and-effect conclusions that apply to people like those who took part in the experi- 
ment. This same logic applies to experiments on animals or things. Also note that 
unless the experimental units are randomly selected, we don’t need to check the 
10% condition when performing inference about an experiment. 

Here is an example that involves comparing two proportions. 


Cholesterol and Heart Attacks ut 


Significance test in an experiment 


High levels of cholesterol in the blood are associated with higher risk of heart 
attacks. Will using a drug to lower blood cholesterol reduce heart attacks? 
The Helsinki Heart Study recruited middle-aged men with high cholesterol 
but no history of other serious medical problems to investigate this question. 
The volunteer subjects were assigned at random to one of two treatments: 
2051 men took the drug gemfibrozil to reduce their cholesterol levels, and 
a control group of 2030 men took a placebo. During the next five years, 56 
men in the gemfibrozil group and 84 men in the placebo group had heart 
attacks. Is this difference statistically significant at the a = 0.01 level? 


STATE: We hope to show that gemfibrozil reduces heart attacks, so we have a one-sided 
alternative: 


Heir = f= 0 or, equivalently, Hot Pa = Pa 
H,: py — p2 <O He: pr < pr 
where p, is the actual heart attack rate for middle-aged men like the ones in this study who 


take gemfibrozil, and p2 is the actual heart attack rate for middle-aged men like the ones in 
this study who take only a placebo. We'll use a = 0.01. 
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Note that we did not need to check PLAN: Ifconditions are met, we will do a two-sample z test for p, — po. 
the 10% condition because the : F : 
subjects in the experiment were not © ° Raftdom: The data come from two groups ina randomized experiment. 


sampled without replacement from ° 10%: Don't need to check because there was no sampling. 
Some larger population. * Large Counts: The number of successes (heart attacks!) and failures in the two groups are 56, 
1995, 54, and 1946. These are all at least 10, 50 this condition is met. 


DO: The proportions of men who had heart attacks in each group are 


56 84 
pi= moe a 0.0273 (gemfibrozil group) and p 2 = 2030 ~ 0.0414 (placebo group) 
Drug taken 
Heart  Gemfibrozil Placebo Tota © /" pooled proportion of heart attacks for the two groups is 
attack? 
ount of heart attacks in both samples combined 
iS se a el é i ie cc 56 + B84 BOO D haus 
No 1995 1946 3941 count of subjects in both samples combined 2051 + 2030 4081 
Uae ul ae) See the two-way table in the margin. 


We'll use the calculator’s 2 - PropZTest to perform calculations. 


a aeeetnrennt 
¢ P-value Thisis the area under the standard Normal 


curve to the left of z= — 2.47, shown in Figure 10.4. pip BEE 


2 
Z=-2.470088266 
p=. 0067539941 
B1=. 0273037543 
62=. 0413793103 
6=. 0343053173 


ni=2051 


n2=2030 


Area = 0.0068 


CONCLUDE: Because the P-value, 0.0068, is less than 
0.01, we can reject Ho. The results are statistically significant at 

y the « = 0.01 level. There is convincing evidence of a lower heart 
J attack rate for middle-aged men like these who take gemfibrozil 


epeesid than for those who take only a placebo. 


FIGURE 10.4 The P-value for the one-sided test. For Practice Try Exercise 


We chose a = 0.01 in the example to reduce the chance of making a Type I 
error —finding convincing evidence that gemfibrozil reduces heart attack risk 
when it actually doesn’t. This error could have serious consequences if an 
ineffective drug was given to lots of middle-aged men with high cholesterol! 

The random assignment in the Helsinki Heart Study allowed researchers to 
draw a cause-and-effect conclusion. They could say that gemfibrozil reduces 
the rate of heart attacks for middle-aged men like those who took part in the 
experiment. Because the subjects were not randomly selected from a larger 
population, researchers could not generalize the findings of this study any 
further. No conclusions could be drawn about the effectiveness of gemfibro- 
zil at preventing heart attacks for all middle-aged men, for older men, or for 
women. 
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THINK Why do the inference methods for random sampling work for 
randomized experiments? Confidence intervals and tests for p) — p2 
ABOUT IT are based on the sampling distribution of f; — f2. But in experiments, we aren’t 
sampling at random from any larger populations. We can think about what would 
happen if the random assignment were repeated many times under the assump- 
tion that Ho: p; — pz = 0 is true. That is, we assume that the specific treatment 

received doesn’t affect an individual subject’s response. 

Let’s see what would happen just by chance if we randomly reassign the 4081 
subjects in the Helsinki Heart Study to the two groups many times, assuming the 
drug received doesn’t affect whether or not each individual has a heart attack. We 
used Fathom software to redo the random assignment 500 times. The approxi- 
mate randomization distribution of 6; — fz is shown in Figure 10.5. It has an 
approximately Normal shape with mean 0 and standard deviation 0.0058. These 
are roughly the same as the shape, center, and spread of the sampling distribution 
of fp; — pz that we used to perform calculations in the previous example because 


Pc — fc) . Pc — fe) reed — 0.0343) — 0.0343(1 — 0.0343) 
+ = + =0. 
| 7 nz 2051 2030 nae 
In 500 random In the Helsinki Heart Study, the difference in the 
ea iar bean proportions of subjects who had a heart attack in the 


a oases panne gemfibrozil and placebo groups was 0.0273 — 0.0414 = 
small as or smaller than —0.0141. How likely is it that a difference this large or 
eee larger would happen just by chance when Hp is true? 
Figure 10.5 provides a rough answer: 5 of the 500 ran- 
dom reassignments yielded a difference in proportions 
less than or equal to —0.0141. That is, our estimate of 
the P-value is 0.01. This is quite close to the 0.0068 

6020 BOIS 0010 0005 0 0.005 0010 0015 0.020 P-value that we calculated in the previous example. 
Figure 10.6 shows the value of the z test statistic for 
each of the 500 re-randomizations, calculated using 

our familiar formula 


Center: Mean = 0 


were only 5 times when 
eet SD = 0.0058 


Shape: Approximately = 


FIGURE 10.5 Fathom simulation showing the approximate ran- 
domization distribution of 6; — > from 500 random reassignments 
of subjects to treatment groups in the Helsinki Heart Study. 


Standard 
Normal curve 


The standard Normal density curve is shown in blue. We 
can see that the z test statistic has approximately the standard 
Normal distribution in this case. 

Whenever the conditions are met, the randomization dis- 
tribution of f; — pz looks much like its sampling distribution. 
We are therefore safe using two-sample z procedures for com- 
paring two proportions in a randomized experiment. 


3 2 4 0 1 2 3 FIGURE 10.6 The distribution of the z test statistic for the 
Zz 500 random reassignments in Figure 10.5. 


_Aoe———_A_,_——— ooo 8 
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CHECK YOUR UNDERSTANDING 


To study the long-term effects of preschool programs for poor children, researchers de- 
signed an experiment. They recruited 123 children who had never attended preschool 
from low-income families in Michigan. Researchers randomly assigned 62 of the children 
to attend preschool (paid for by the study budget) and the other 61 to serve as a control 
group who would not go to preschool. One response variable of interest was the need for 
social services as adults. Over a 10-year period, 38 children in the preschool group and 49 
in the control group have needed social services.* 

Does this study provide convincing evidence that preschool reduces the later need for 
social services? Justify your answer. 


Summary 


e¢ Choose independent SRSs of size n; from Population | with proportion of 
successes Pp; and of size 2 from Population 2 with proportion of successes p2. 
The sampling distribution of f; — fp has the following properties: 


e Shape Approximately Normal if the samples are large enough that 7p, 
ny(1 — p1), nzp2, and n (1 — p2) are all at least 10. 


¢ Center The mean is p; — p>. 
e Spread As long as each sample is no more than 10% of its population, 
eae) 


n2 


ee 
the standard deviation is , pul - Pi 
1 


e Confidence intervals and tests to compare the proportions fp; and p2 of suc- 
cesses for two populations or treatments are based on the difference pf; — f2 
between the sample proportions. 


e Before estimating or testing a claim about p; — p2, check that these condi- 
tions are met: 


e Random: The data come from two independent random samples or 
from two groups in a randomized experiment. 


© 10%:When sampling without replacement, check that the two pop- 
ulations are at least 10 times as large as the corresponding samples. 


e Large Counts: The counts of “successes” and “failures” in each sample 
or group —nif1, n\(1 — fi), n2p2, and nz (1 — pz) —are all at least 10. 


e When conditions are met, an approximate C% confidence interval for 


Pi p2 1S 
; : Tal R mal - 
(f ; j 2 ae - fs pi) pr p ) 


nz 


where z* is the standard Normal critical value with C% of its area between 
—z* and z*. This is called a two-sample z interval for p; — po. 


e Significance tests of Hp:p; — p2 = 0 use the pooled (combined) sample 
proportion in the standard error formula: 


count of successes in both samples combined =X, + X, 


IS 


© count of individuals in both samples combined np tm 
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When conditions are met, the two-sample z test for p; — p2 uses the test statistic 
(fi — p2) — 0 
ce = Pe) , boll = bo) 


nN) nz 


with P-values calculated from the standard Normal distribution. 


e Inference about the difference p; — /2 in the effectiveness of two treatments 
in a completely randomized experiment is based on the randomization distri- 
bution of f; — 62. When conditions are met, our usual inference procedures 


STEP 


TECHNOLOGY 
CORNERS 


based on the sampling distribution of /; — f2 will be approximately correct. 


Be sure to follow the four-step process whenever you construct a confidence 
interval or perform a significance test for comparing two proportions. 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


21. Confidence interval for a difference in proportions 


22. Significance test for a difference in proportions 


Exercises 


4 Remember: We are no longer reminding you to use the four-step 


L. process in exercises that require you to perform inference. 


page 618 
page 624 


1. Goldfish Refer to the example on page 615. Sup- 3. IT want red! A candy maker offers Child and Adult 
615 pose that your teacher decides to take SRSs of 100 bags of jelly beans with different color mixes. The 
& crackers from both bags instead. company claims that the Child mix has 30% red jelly 
(a) What is the shape of the sampling distribution of eee heey mea Ra eae 
pi — pa? Why? Suppose we take a random sample of 50 jelly beans 
(b) Find the mean of the sampling distribution. Show from the Child mix and a separate random sample 
your work. of 100 jelly beans from the Adult mix. Let fc and fa 
; ee ates be the sample proportions of red jelly beans from the 
(c) Find the standard deviation of the sampling distribu- ; . oa 
Tonestiow, otrcore Child and Adult mixes, respectively. 
(a) What is the shape of the sampling distribution of 
2. Homework Refer to page 612. Suppose that both Po pa? Why? 
school counselors decide to take SRSs of 150 stu- (oeiancine mesma tne cammiine cd mibitons Stow 
dents instead. your work 
(a) ee 6. es ithe saropine distubultonok (c) Find the standard deviation of the sampling distribu- 
pie tion. Show your work. 
© ae ae HN eens te SuNORE ON: Sat 4. Literacy A researcher reports that 80% of high 
aoe aie school graduates, but only 40% of high school 
(c) Find the standard deviation of the sampling distribu- dropouts, would pass a basic literacy test.’ Assume 


tion. Show your work. 


that the researcher’s claim is true. Suppose we give 
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(a) 
(b) 


(c) 
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a basic literacy test to a random sample of 60 high 
school graduates and a separate random sample of 
75 high school dropouts. Let fc and fp be the 
sample proportions of graduates and dropouts, re- 
spectively, who pass the test. 


What is the shape of the sampling distribution of 
Pa — po? Why? 

Find the mean of the sampling distribution. Show 
your work. 


Find the standard deviation of the sampling distribu- 
tion. Show your work. 


Explain why the conditions for constructing a two-sample 
z interval for p; — p2 are not met in the settings of 
Exercises 5 through 8. 


b 


Don’t drink the water! ‘The movie A Civil Action 
(Touchstone Pictures, 1998) tells the story of a 

major legal battle that took place in the small town 
of Woburn, Massachusetts. A town well that supplied 
water to eastern Woburn residents was contaminated 
by industrial chemicals. During the period that resi- 
dents drank water from this well, 16 of the 414 babies 
born had birth defects. On the west side of Woburn, 
3 of the 228 babies born during the same time period 
had birth defects. 


In-line skaters A study of injuries to in-line skat- 
ers used data from the National Electronic Injury 
Surveillance System, which collects data from a 
random sample of hospital emergency rooms. The 
researchers interviewed 161 people who came to 
emergency rooms with injuries from in-line skat- 
ing. Wrist injuries (mostly fractures) were the most 
common.° The interviews found that 53 people 
were wearing wrist guards and 6 of these had wrist 
injuries. Of the 108 who did not wear wrist guards, 
45 had wrist injuries. 


Shrubs and fire Fire is a serious threat to shrubs 
in dry climates. Some shrubs can resprout from 
their roots after their tops are destroyed. One study 
of resprouting took place in a dry area of Mexico.’ 
The investigators randomly assigned shrubs to treat- 
ment and control groups. They clipped the tops of 
all the shrubs. They then applied a propane torch 
to the stumps of the treatment group to simulate 

a fire. All 12 of the shrubs in the treatment group 
resprouted. Only 8 of the 12 shrubs in the control 
group resprouted. 


Broken crackers We don’t like to find broken crack- 
ers when we open the package. How can makers re- 
duce breaking? One idea is to microwave the crack- 
ers for 30 seconds right after baking them. Breaks 
start as hairline cracks called “checking.” Randomly 


10. 


11. 


IDE 


assign 65 newly baked crackers to the microwave and 
another 65 to a control group that is not microwaved. 
After one day, none of the microwave group and 16 
of the control group show checking.* 


Who tweets? Do younger people use ‘Twitter more 
often than older people? In a random sample of 316 
adult Internet users aged 18 to 29, 26% used ‘Twitter. 
In a separate random sample of 532 adult Internet 
users aged 30 to 49, 14% used Twitter.’ 


Calculate the standard error of the sampling distri- 
bution of the difference in the sample proportions 
(younger adults — older adults). What information 
does this value provide? 


Construct and interpret a 90% confidence interval for 
the difference between the true proportions of adult 
Internet users in these age groups who use ‘Twitter. 


Listening to rap Is rap music more popular 
among young blacks than among young whites? 
A sample survey compared 634 randomly chosen 
blacks aged 15 to 25 with 567 randomly selected 
whites in the same age group. It found that 368 
of the blacks and 130 of the whites listened to rap 
music every day.” 


Calculate the standard error of the sampling distri- 
bution of the difference in the sample proportions 
(blacks — whites). What information does this value 
provide? 


Construct and interpret a 95% confidence interval 
for the difference between the proportions of black 
and white young people who listen to rap every day. 


Young adults living at home A surprising number 
of young adults (ages 19 to 25) still live in their par- 
ents’ homes. A random sample by the National Insti- 
tutes of Health included 2253 men and 2629 women 
in this age group.'! The survey found that 986 of the 
men and 923 of the women lived with their parents. 


Construct and interpret a 99% confidence interval 
for the difference in the true proportions of men 
and women aged 19 to 25 who live in their parents’ 
homes. 


Does your interval from part (a) give convincing 
evidence of a difference between the population 
proportions? Explain. 


Fear of crime ‘The elderly fear crime more than 
younger people, even though they are less likely to 
be victims of crime. One study recruited separate 
random samples of 56 black women and 63 black 
men over the age of 65 from Atlantic City, New 
Jersey. Of the women, 27 said they “felt vulnerable” 
to crime; 46 of the men said this.” 


14. 


live 


18. 


I), 


Construct and interpret a 90% confidence interval 
for the difference in the true proportions of black 
women and black men in Atlantic City who would 
say they felt vulnerable to crime. 


Does your interval from part (a) give convincing 
evidence of a difference between the population 
proportions? Explain. 


Who owns iPods? As part of the Pew Internet 

and American Life Project, researchers surveyed a 
random sample of 800 teens and a separate random 
sample of +00 young adults. For the teens, 79% said 
that they own an iPod or MP3 player. For the young 
adults, this figure was 67%. Do the data give convinc- 
ing evidence of a difference in the proportions of all 
teens and young adults who would say that they own 
an iPod or MP3 player? State appropriate hypotheses 
for a test to answer this question. Define any param- 
eters you use. 


Steroids in high school A study by the National 
Athletic Trainers Association surveyed random 
samples of 1679 high school freshmen and 1366 
high school seniors in Illinois. Results showed that 
34 of the freshmen and 24 of the seniors had used 
anabolic steroids. Steroids, which are dangerous, are 
sometimes used in an attempt to improve athletic 
performance.!* Do the data give convincing evi- 
dence of a difference in the proportion of all Illinois 
high school freshmen and seniors who have used 
anabolic steroids? State appropriate hypotheses for a 
test to answer this question. Define any parameters 
you use. 


Who owns iPods? Refer to Exercise 13. Carry out a 
significance test at the a = 0.05 level. 


Steroids in high school Refer to Exercise 14. Carry 
out a significance test at the a = 0.05 level. 


Who owns iPods? Refer to Exercise 13. Construct 
and interpret a 95% confidence interval for the dif 
ference between the population proportions. Explain 
how the confidence interval is consistent with the 
results of the test in Exercise 15. 


Steroids in high school Refer to Exercise 14. Con- 
struct and interpret a 95% confidence interval for 
the difference between the population proportions. 
Explain how the confidence interval is consistent 
with the results of the test in Exercise 16. 


Children make choices Many new products intro- 
duced into the market are targeted toward children. 
The choice behavior of children with regard to new 
products is of particular interest to companies that 
design marketing strategies for these products. As part 
of one study, randomly selected children in different 
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20. 


Didhe 


age groups were compared on their ability to sort 
new products into the correct product category (milk 
or juice).!* Here are some of the data: 


Age group N Number who sorted correctly 
4- to 5-year-olds 50 10 
6- to 7-year-olds 53 28 


Did a significantly higher proportion of the 6- to 
7-year-olds than the 4- to 5-year-olds sort correctly? 
Give appropriate evidence to justify your answer. 


Marriage and status “Would you marry a person 
from a lower social class than your own?” Research- 
ers asked this question of a random sample of 385 
black, never-married college students. Of the 149 
men in the sample, 91 said “Yes.” Among the 236 
women, 117 said “Yes.” Did a significantly higher 
proportion of the men than the women who were 
surveyed say “Yes”? Give appropriate evidence to 
justify your answer. 


Driving school A driving school owner believes that 
Instructor A is more effective than Instructor B at 
preparing students to pass the state’s driver's license 
exam. An incoming class of 100 students is randomly 
assigned to two groups, each of size 50. One group is 
taught by Instructor A; the other is taught by Instruc- 
tor B. At the end of the course, 30 of Instructor A’s 
students and 22 of Instructor B’s students pass the 
state exam. 


Do these results give convincing evidence at the 
a = 0.05 level that Instructor A is more effective? 


Describe a Type I and a Type II error in this setting. 
Which error could you have made in part (a)? 


Preventing strokes Aspirin prevents blood from clot- 
ting and so helps prevent strokes. The Second Euro- 
pean Stroke Prevention Study asked whether adding 
another anticlotting drug, named dipyridamole, 
would be more effective for patients who had already 
had a stroke. Here are the data on strokes during the 
two years of the study:' 


Number of Number of 
patients strokes 
Aspirin alone 1649 206 
Aspirin + dipyridamole 1650 allow 


The study was a randomized comparative experiment. 


Is there convincing evidence at the a = 0.05 level that 
adding dipyridamole helps reduce the risk of stroke? 


Describe a Type I and a Type II error in this setting. 
Which is more serious? Explain. 
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Exercises 23 and 24 involve the following setting. Some 
women would like to have children but cannot do so for 
medical reasons. One option for these women is a pro- 
cedure called in vitro fertilization (IVF), which involves 
injecting a fertilized egg into the woman’s uterus. 


PSS 


Prayer and pregnancy ‘lwo hundred women who 
were about to undergo IVF served as subjects in an 
experiment. Each subject was randomly assigned 
to either a treatment group or a control group. 
Women in the treatment group were intentionally 
prayed for by several people (called intercessors) 
who did not know them, a process known as in- 
tercessory prayer. The praying continued for three 
weeks following IVF. The intercessors did not pray 
for the women in the control group. Here are the 
results: 44 of the 88 women in the treatment group 
got pregnant, compared to 21 out of 81 in the con- 
trol group.'” 

Is the pregnancy rate significantly higher for 
women who received intercessory prayer? To 
find out, researchers perform a test of Ho: p) = p2 
versus H,:p; > p2, where p; and p2 are the actual 
pregnancy rates for women like those in the study 
who do and don’t receive intercessory prayer, 
respectively. 


Name the appropriate test and check that the condi- 
tions for carrying out this test are met. 


The appropriate test from part (a) yields a P-value of 
0.0007. Interpret this P-value in context. 


What conclusion should researchers draw at the a = 
0.05 significance level? Explain. 


The women in the study did not know whether they 
were being prayed for. Explain why this is important. 


Acupuncture and pregnancy A study reported in 
the medical journal Fertility and Sterility sought 

to determine whether the ancient Chinese art of 
acupuncture could help infertile women become 
pregnant.'® One hundred sixty healthy women who 
planned to have IVF were recruited for the study. 
Half of the subjects (80) were randomly assigned 

to receive acupuncture 25 minutes before embryo 
transfer and again 25 minutes after the transfer. ‘The 
remaining 80 women were assigned to a control 
group and instructed to lie still for 25 minutes after 
the embryo transfer. Results are shown in the table 
below. 


Acupuncture group Control group 
Pregnant 34 21 
Not pregnant 46 59 
Total 80 80 


Is the pregnancy rate significantly higher for women 
who received acupuncture? To find out, researchers 
perform a test of Ho: p; = p2 versus H,:p; > p2, where 
p; and p are the actual pregnancy rates for women 
like those in the study who do and don’t receive 
acupuncture, respectively. 


(a) Name the appropriate test and check that the condi- 
tions for carrying out this test are met. 


(b) ‘The appropriate test from part (a) yields a P-value of 
0.0152. Interpret this P-value in context. 


(c) What conclusion should researchers draw at the 
a = 0.05 significance level? Explain. 


(d) ‘The women in the study knew whether or not 
they received acupuncture. Explain why this is 
important. 


Multiple choice: Select the best answer for Exercises 25 
to 28. 

Exercises 25 to 27 refer to the following setting. A sample 
survey interviews SRSs of 500 female college students and 
550 male college students. Researchers want to determine 
whether there is a difference in the proportion of male 
and female college students who worked for pay last sum- 
mer. In all, +10 of the females and 484 of the males say 
they worked for pay last summer. 


25. ‘Take py and py to be the proportions of all college 
males and females who worked last summer. ‘The 
hypotheses to be tested are 


a) Ho: pu — pr = 0 versus Hy: pu — pr # 0. 
b) Ho:pm — pr = 0 versus Hy: pu — pr > 9. 


( 
( 
(c) Ho:pm — pr = 0 versus Hy: pu — pr < 0. 
(d) Ho:pu — pr > 0 versus Hy:pu — pr = 0. 
( 


e) Ho:pm — pr # 0 versus Hy: pu — pr = 0. 


26. ‘The researchers report that the results were statisti- 
cally significant at the 1% level. Which of the follow- 
ing is the most appropriate conclusion? 


(a) Because the P-value is less than 1%, fail to reject Hp. 
There is not convincing evidence that the proportion 
of male college students in the study who worked for 
pay last summer is different from the proportion of 
female college students in the study who worked for 
pay last summer. 


(b) Because the P-value is less than 1%, fail to reject 
Ho. There is not convincing evidence that the pro- 
portion of all male college students who worked for 
pay last summer is different from the proportion of 
all female college students who worked for pay last 
summer. 


Palle 


Because the P-value is less than 1%, reject Hp. 
There is convincing evidence that the proportion 
of all male college students who worked for pay last 
summer is the same as the proportion of all female 
college students who worked for pay last summer. 


Because the P-value is less than 1%, reject Ho. There 
is convincing evidence that the proportion of all 
male college students in the study who worked for 
pay last summer is different from the proportion of 
all female college students in the study who worked 
for pay last summer. 


Because the P-value is less than 1%, reject Ho. There 
is convincing evidence that the proportion of all 
male college students who worked for pay last sum- 
mer is different from the proportion of all female 
college students who worked for pay last summer. 


Which of the following is the correct margin of error 
for a 99% confidence interval for the difference in 
the proportion of male and female college students 
who worked for pay last summer? 
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In an experiment to learn whether Substance M 

can help restore memory, the brains of 20 rats were 
treated to damage their memories. First, the rats 
were trained to run a maze. After a day, 10 rats 
(determined at random) were given M and 7 of them 
succeeded in the maze. Only 2 of the 10 control rats 
were successful. The two-sample z test for “no dif- 
ference” against “a significantly higher proportion of 
the M group succeeds” 


gives z = 2.25,P < 0.02. 
gives z = 2.60, P< 0.005. 
gives z = 2.25, P < 0.04 but not < 0.02. 


should not be used because the Random condition is 
violated. 
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(e) should not be used because the Large Counts condi- 
tion is violated. 


Exercises 29 and 30 refer to the following setting. ‘Thirty 
randomly selected seniors at Council High School were 
asked to report the age (in years) and mileage of their 
main vehicles. Here is a scatterplot of the data: 
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Age (years) 


We used Minitab to perform a least-squares regres- 
sion analysis for these data. Part of the computer 
output from this regression is shown below. 


Predictor Coef Stdev t-ratio P 

Constant = iehese)21 8773 = 10 Sis) 0.126 
Age 14954 1546 ORGi7) 0.000 
SoS 22723 RISG eS ii/..10)5 R-sq(adj) = 76.1% 


Drive my car (3.2) 


(a) What is the equation of the least-squares regression 
line? Be sure to define any symbols you use. 


(b) Interpret the slope of the least-squares line in the 
context of this problem. 


(c) One student reported that her 10-year-old car had 
110,000 miles on it. Find and interpret the residual 
for this data value. Show your work. 


. Drive my car (3.2, 4.3) 


(a) Explain what the value of 7’ tells you about how well 
the least-squares line fits the data. 


(b) ‘The mean age of the students’ cars in the sample was 
x = 8 years. Find the mean mileage of the cars in 
the sample. Show your work. 


(c) Interpret the value of s in the context of this setting. 


(d) Would it be reasonable to use the least-squares line 
to predict a car’s mileage from its age for a Council 
High School teacher? Justify your answer. 
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WHAT YOU WILL LEARN 


Comparing Two Means 


By the end of the section, you should be able to: 


Describe the shape, center, and spread of the sampling e Perform a significance test to compare two means. 


distribution of X; — Xo. 


e Determine when it is appropriate to use two-sample 


Determine whether the conditions are met for doing t procedures versus paired ¢ procedures. 
inference about ju — ju2. 
Construct and interpret a confidence interval to 
compare two means. 


ACTIVITY 


In the previous section, we developed methods for comparing two proportions. 
What if we want to compare the mean of some quantitative variable for the indi- 
viduals in Population | and Population 2? Our parameters of interest are the popu- 
lation means 1; and juz. Once again, the best approach is to take separate random 
samples from each population and to compare the sample means X, and x2. 

Suppose we want to compare the average effectiveness of two treatments in a 
completely randomized experiment. In this case, the parameters /1; and ju are the 
true mean responses for Treatment | and ‘Treatment 2, respectively. We use the 
mean response in the two groups, X; and X2, to make the comparison. Here’s a 
table that summarizes these two situations: 


Population or treatment Parameter Statistic Sample size 
1 My x nh 
2 Le Xp M 


We compare the populations or treatments by doing inference about the differ- 
ence [4) — }l2 between the parameters. The statistic that estimates this difference 
is the difference between the two sample means, x, — X2. To use X} — X> for infer- 
ence, we must know its sampling distribution. Here is an Activity that gives you a 
preview of what lies ahead. 


Does Polyester Decay? 


MATERIALS: 


10 small pieces of card 
stock (or index cards) per 
pair of students 


How quickly do synthetic fabrics such as polyester decay in landfills? A researcher 
buried polyester strips in the soil for different lengths of time, then dug up the 
strips and measured the force required to break them. Breaking strength is easy 
to measure and is a good indicator of decay. Lower strength means the fabric has 
decayed. 

The researcher buried 10 strips of polyester fabric in well-drained soil in the 
summer. The strips were randomly assigned to two groups: 5 of them were bur- 
ied for 2 weeks and the other 5 were buried for 16 weeks. Here are the breaking 
strengths in pounds:!? 


-20 


-15 


10 -5 0 
DifferenceilnMeans 
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Group | (2 weeks): 118 126 126 120 129 
Group 2 (16 weeks): 2s 98 110 140 110 


Do the data give convincing evidence that polyester decays more in 16 weeks than 
in 2 weeks? 

1. The Fathom dotplot displays the data from the experiment. Discuss what this 
graph shows with your classmates. 


15 125 135 
Breaking_strength 


For Group 1, the mean breaking strength was x; = 123.8 pounds. For Group 2, 
the mean breaking strength was ¥; = 116.4 pounds. The observed difference in 
average breaking strength for the two groups is X; — x2 = 123.8 — 116.4 = 7.4 
pounds. Is it plausible that this difference is due to the chance involved in the 
random assignment and not to the treatments themselves? To find out, your class 
will perform a simulation. 

Suppose that the length of time in the ground has no effect on the breaking 
strength of the polyester specimens. Then each specimen would have the same 
breaking strength regardless of whether it was assigned to Group | or Group 2. 
In that case, we could examine the results of repeated random assignments of the 
specimens to the two groups. 


2. Write each of the 10 breaking-strength measurements on a separate card. Mix 
the cards well and deal them face down into two piles of 5 cards each. Be sure to 
decide which pile is Group | and which is Group 2 before you look at the cards. 
Calculate the difference in the mean breaking strength (Group 1 — Group 2). 
Record this value. 

3. Your teacher will draw and label axes for a class dotplot. Plot the result you 
got in Step 2 on the graph. 

4. Repeat Steps 2 and 3 if needed to get a total of at least 40 repetitions of the 
simulation for your class. 

5. Based on the class’s simulation results, how surprising would it be to get a 
difference in means of 7.4 or larger simply due to the chance involved in the 
random assignment? 


6. What conclusion would you draw about whether polyester decays more when 
left in the ground for longer periods of time? Explain. 


In this simulation, 14 of the 100 trials (in red) produced a difference 
in means of at least 7.4 pounds, so the approximate P-value is 0.14. It 
is likely that a difference this big could have happened just due to the 


= of edie fisélie., i , chance variation in random assignment. The observed difference is 
5 


10 15 not statistically significant and does not provide convincing evidence 
that polyester decays more in 16 weeks than in 2 weeks. 
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The Sampling Distribution of a Difference 
between Two Means 


To explore the sampling distribution of ¥; — X2, let’s 
start with two Normally distributed populations hav- 
ing known means and standard deviations. Based 
on information from the U.S. National Health and 
Nutrition Examination Survey (NHANES), the 
heights of 10-year-old girls follow a Normal distribu- 
tion with mean jup = 56.4 inches and standard devia- 
tion op = 2.7 inches. The heights of 10-year-old boys 
follow a Normal distribution with mean jy, = 55.7 
inches and standard deviation oy = 3.8 inches.” 

Suppose we take independent SRSs of 12 girls and 
8 boys of this age and measure their heights. What can 
we say about the difference Xp — Xy in the average 
heights of the sample of girls and the sample of boys? 

We used Fathom software to take an SRS of 12 ten-year-old girls and 8 ten-year- 
old boys and to plot the values of Xp, yy, and Xp — X)y for each sample. Our first 
set of simulated samples gave Xp = 56.09 inches and X,, = 54.68 inches, so dots 
were placed above each of those values in Figure 10.7(a) and (b). The difference 
in the sample means is Xp — Xy = 56.09 — 54.68 = 1.41 inches. A dot for this 
value appears in Figure 10.7(c). The three dotplots in Figure 10.7 show the results 
of repeating this process 1000 times. These are the approximate sampling distribu- 
tions of Xp, Xy, and Xp — Xy. 


(a) Approximate sampling distribution of x; (b) Approximate sampling distribution of Xj, (c) Approximate sampling distribution of x- — Xy 


fu. 8 


-4 -2 0 2 4 6 


xbar_f diffmean 
Shape: Approximately Normal Shape: Approximately Normal Shape: Approximately Normal 
Center: Mean = 56.40 inches Center: Mean = 55.73 inches Center: Mean = 0.67 inches 
Spread: SD = 0.80 inches Spread: SD = 1.35 inches Spread: SD = 1.56 inches 


FIGURE 10.7 Simulated sampling distributions of (a) the sample mean height x; in 1000 SRSs 
of size ne = 12 from the population of 10-year-old girls, (b) the sample mean height xy in 1000 
SRSs of size ny = 8 from the population of 10-year-old boys, and (c) the difference in sample 
means X- — Xy for each of the 1000 repetitions. 


In Chapter 7, we saw that the sampling distribution of a sample mean x has the 
following properties: 


Shape: (1) If the population distribution is Normal, then so is the sampling dis- 
tribution of x; (2) if the population distribution isn’t Normal, the sampling distri- 
bution of * will be approximately Normal if the sample size is large enough (say, 
n = 30) by the central limit theorem (CLT). 


THINK 
ABOUT IT 
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Center: pz = 


Spread: 03 = aa if the sample is no more than 10% of the population 
n 


For the sampling distributions of Xp and Xj, in this case: 
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Sampling distribution of x; 


Shape Normal, because the population 
distribution is Normal 


Center = juz, = p= 56.4 inches 
OF 2.7 ‘ 
z= —= = — = 0.78 inches 
Spread 0x, Vm Vi2 


because there are way more than 
10(12) = 120 ten-year-old girls in the 
United States. 


Sampling distribution of x, 


Normal, because the population 
distribution is Normal 


Hx, = Lem = 59.7 inches 


Om 3.8 . 
Zz, = —= = — = 1.34 inches 
ou Vin V8 


because there are way more than 
10(8) = 80 ten-year-old boys in the 
United States. 


The approximate sampling distributions in Figures 10.7(a) and (b) give similar 
results. 

What about the sampling distribution of Xp — Xj)? Figure 10.7(c) suggests that 
it has a roughly Normal shape, is centered at about 0.67 inches, and has standard 
deviation about 1.56 inches. The shape makes sense because we are combining 
two independent Normal random variables, <p» and x. How about the center? 
The actual mean height of 10-year-old girls is wp = 56.4 inches. For 10-year-old 
boys, the actual mean height is jay = 55.7 inches. We'd expect the difference 
Xp — Xy to center on the actual difference in the population means, pup — puy = 
56.4 — 55.7 = 0.7 inches. The spread, however, is a bit more complicated. 


How can we find formulas for the mean and standard devia- 
tion of the sampling distribution of x; — X2? Both x, and x2 are 
random variables. That is, their values would vary in repeated independent SRSs 
of size n, and n2. Independent random samples yield independent random vari- 
ables x; and X2. The statistic x; — X2 is the difference of these two independent 
random variables. 

In Chapter 6, we learned that for any two random variables X and Y, 


Hx—-y ~ Ux ~ Hy 
For the random variables x; and x7, we have 
Mz, -x, =e Ly, a Lx, = by ~~ by 
In the observational study of the heights of 10-year-olds, 
Lz»—%y = Ler — bm = 56.4 — 55.7 = 0.70 inches 
We also learned in Chapter 6 that for independent random variables X and Y, 
oo pe ah, 2 
Ox-y = ox + oY 


For the random variables x; and x7, we have 


2 2 2 2 
2 Dt ol O71 abe O2 SL 
OF = = Oz 0; = — 

ee Vn V2 ny 2 
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In the observational study of the heights of 10-year-olds, 


vA 2 2 2 
OF OM 2.7 3.8 
° yz 1M J I? 8 - 


This is similar to the result from the Fathom simulation. 
ee 


Here are the facts we need. 


THE SAMPLING DISTRIBUTION OF x, — x2 


FIGURE 10.8 Select independent 
SRSs from two populations having 
means ju1 and j2 and standard de- 
viations o; and o>. The two sample 
means are X,; and Xp». If the popu- 
lation distributions are both Normal, 
the sampling distribution of the 
difference x; — X» is Normal. The 
sampling distribution will be ap- 
proximately Normal in other cases 
if both samples are large enough 
(nm, = 30 and no = 30). 


When conditions are met, the sampling distribution of x; — x2 
will be approximately Normal with mean puz,—z, = fu) — fz and 


re 
standard deviation o;,-;, = \/ ot + - Figure 10.8 displays this 
1 7 


distribution. 

The formula for the standard deviation of the sampling dis- 
tribution involves the parameters 0) and 02, which are usually 
unknown. Just as in Chapters 8 and 9, we must replace these by 
estimates to do inference. We'll get to confidence intervals and 
significance tests shortly. For now, let’s focus on the sampling 
distribution of x; — Xp. 


Medium or Large Drink? 
Describing the sampling distribution of x, — x» 


A fast-food restaurant uses an automated filling machine to pour its soft drinks. The 
machine has different settings for small, medium, and large drink cups. According to 
the machine’s manufacturer, when the large setting is chosen, the amount of liquid 
L dispensed by the machine follows a Normal distribution with mean 27 ounces and 
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standard deviation 0.8 ounces. When the medium setting is chosen, the amount 
of liquid M dispensed follows a Normal distribution with mean 17 ounces and 
standard deviation 0.5 ounces. ‘To test the manufacturer’s claim, the restaurant 
manager measures the amount of liquid in each of 20 cups filled with the large 
setting and 25 cups filled with the medium setting. Let x;, — X)y be the differ- 
ence in the sample mean amount of liquid under the two settings. 


PROBLEM: 
(a) What is the shape of the sampling distribution of x, — xy? Why? 


(b) Find the mean of the sampling distribution. Show your work. 
(c) Find the standard deviation of the sampling distribution. Show your work. 


SOLUTION: 


(a) The sampling distribution of x, — xy is Normal because both population distributions are 
Normal. 


(b) The meanis pz —;, = bi — by = 27 — 17 = 10 ounces. 
of , on _ | (0.80)? in (0.50)? 


my 20 25 
Note that we do not need to check the 10% condition because we are not sampling without replace- 
ment from a finite population. 


(c) The standard deviationis o;,—;, = = 0.205 ounces. 


For Practice Try Exercise 


The Two-Sample ¢ Statistic 


When data come from two independent random samples or two groups in a ran- 
domized experiment (the Random condition), the statistic +; — x2 is our best 
guess for the value of ju; — 12. If the 10% condition is met, the standard deviation 
of the sampling distribution of x; — X2 is 


If the Normal/Large Sample condition is met, we can standardize the observed 
difference X; — X2 to obtain a z statistic that is modeled well by a standard Normal 
distribution: 


(4 X2) > (ea — a) 


In the unlikely event that both population standard deviations are known, this 
two-sample z statistic is the basis for inference about ju) — 12. 
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You can thank statisticians B. L. Welch 
and F. E. Satterthwaite for discovering 
this fairly remarkable formula. 
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Suppose now that the population standard deviations 7 and 02 are not known. 
We estimate them by the standard deviations s; and sz from our two samples. The 
result is the standard error (also called the estimated standard deviation) of x; — X2: 


2 2 
Ss] 83 
pear + ees 
nN} nz 


SEz,-z, = 


Now when we standardize the point estimate ¥; — X2, the result is the two-sample 
t statistic: 


pe (X] — Xz) — (ply = faa) 
st 
nN} nz 


The statistic t has the same interpretation as any z or t statistic: it says how far 
X — X2 is from its mean in standard deviation units. When the Normal/Large 
Sample condition is met, the two-sample t statistic has approximately a t distri- 
bution. It does not have exactly a ¢ distribution even if the populations are both 
exactly Normal. In practice, however, the approximation is very accurate. 


CONDITIONS FOR PERFORMING INFERENCE ABOUT ;1; — 2 


There are two practical options for using the two-sample t procedures when the 
conditions are met. The two options are exactly the same except for the degrees of 
freedom used for t critical values and P-values. 


Option 1 (Technology): Use the ¢ distribution with degrees of freedom calcu- 
lated from the data by the formula below. Note that the df given by this formula is 
usually not a whole number. 


a ae) 
4 
ny — 1\ nz — 1\n2 


Option 2 (Conservative): Use the t distribution with degrees of freedom equal 
to the smaller of n; — 1 and nz — 1. With this option, the resulting confidence 
interval has a margin of error as large as or larger than is needed for the desired 
confidence level. The significance test using this option gives a P-value equal to or 
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greater than the true P-value. As the sample sizes increase, confidence levels and 
: 2 
P-values from Option 2 become more accurate.”! 


Confidence Intervals for pz; — pe2 


If the Random, 10%, and Normal/Large Sample conditions are met, we can use 
our standard formula to construct a confidence interval for f4) — [U2: 


statistic + (critical value) - (standard deviation of statistic) 


We can use either technology or the conservative approach with Table B to find 
the critical value t* for the given confidence level. This method is called a two- 
sample t interval for a difference between two means. 


TWO-SAMPLE t INTERVAL FOR A DIFFERENCE BETWEEN TWO MEANS 


When the conditions are met, an approximate C% confidence interval for 


jt) — pz Is 
Fo 2 
= = Ss] 82 
= ae | 4 
(ey = 27) mn 


Here, t* is the critical value with C% of its area between —t* and t* for the 
t distribution with degrees of freedom using either Option | (technology) or 
Option 2 (the smaller of mn; — 1 and n2 — 1). 


The following example shows how to construct and interpret a confidence in- 
terval for a difference in means. As usual with inference problems, we follow the 
. four-step process. 


Big Trees, Small Trees, 4 
Short Trees, Tall Trees L. 


Confidence interval for p44 — p2 


The Wade Tract Preserve in Georgia is an old-growth forest 
of longleaf pines that has survived in a relatively undisturbed 
state for hundreds of years. One question of interest to for- 
fats —{ | = /-— esters who study the area is “How do the sizes of longleaf 

pine trees in the northern and southern halves of the forest 
compare?” ‘To find out, researchers took random samples of 
30 trees from each half and measured the diameter at breast 


nae | | height (DBH) in centimeters.”” Here are comparative box- 
plots of the data and summary statistics from Minitab. 


0 10 20 30 40 50 60 Descriptive Statistics: North, South 
DBH (centimeters) A 
Variable N Mean StDev 
North 30 23.70 L7.50 


South 30 34.53 14.26 
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Upper-tail probability p 
df 10 
28 = =1.313 


301.310 
80% 


Confidence level C 
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PROBLEM: 


(a) Based on the graph and numerical summaries, write a few sentences comparing the sizes of 
longleaf pine trees in the two halves of the forest. 


(b) Construct and interpret a 90% confidence interval for the difference in the mean DBH of longleaf 
pines in the northern and southern halves of the Wade Tract Preserve. 


SOLUTION: 


(a) The distribution of DBH measurements in the northern sample is skewed to the right, while the 
distribution of DBH measurements in the southern sample is skewed to the left. It appears that 
trees in the southern half of the forest have larger diameters. The mean and median DBH for the 
southern sample are both much larger than the corresponding measures of center for the northern 
sample. Furthermore, the boxplots show that more than 75% of the southern trees have diameters 
that are above the northern sample’s median. There is more variability in the diameters of the 
northern longleaf pines, as we can see from the larger range, IQR, and standard deviation for this 
sample. No outliers are present in either sample. 


(b) STATE: Our parameters of interest are ju, = the true mean DBH of all trees in the southern 


half of the forest and 42 = the true mean DBH of all trees in the northern half of the forest. We want 
to estimate the difference [14 — {42 ata 90% confidence level. 


PLAN: Ifconditions are met, we'll construct a two-sample tinterval for 4 — [2. 


* Random: The data came from independent random samples of 30 trees each from the northern 
and southern halves of the forest. 


° 10%: Because sampling without replacement was used, there have to be at least 10(30) = 
300 trees in each half of the forest. This is fairly safe to assume. 


* Normal/Large Sample: The boxplots give us reason to believe that the population distributions of 
DBH measurements may not be Normal. However, because both sample sizes are at least 30, we are 
safe using two-sample t procedures. 

DO: From the Minitab output, x, = 34.53, 5, = 14.26, m = 30, x2 = 23.70, 5, = 17.50, 
and nz = 30. We'lluse the conservative df = the smaller of nj — 1 and np — 1, whichis 29. Fora 
90% confidence level the critical value from Table Bis t* = 1.699. So a 90% confidence interval for 


[1 — fg is 
17.50? 


ae 14.26? 
(iy — a) = vy] + = (5458 — 25:70) = 1.600,/ x 
Ny i) 30 30 


= 10.83 + 7.00 = (3.83, 17.83) 


Using technology: Refer to the Technology Corner that follows the example. The calculator’s 
2-SampTInt gives (3.9362, 17.724) using df = 55.728. 


CONCLUDE: Weare 90% confident that the interval from 3.9362 to 17.724 centimeters captures 
the difference in the actual mean DBH of the southern trees and the actual mean DBH of the northern trees. 


For Practice Try Exercise 


The 90% confidence interval in the example does not include 0. This gives 
convincing evidence that the difference in the mean diameter of northern and 
southern trees in the Wade Tract Preserve isn’t 0. However, the confidence inter- 
val provides more information than a simple reject or fail to reject Hp conclusion. 
It gives a set of plausible values for 4; — j12. The interval suggests that the mean 
diameter of the southern trees is between 3.83 and 17.83 cm larger than the mean 
diameter of the northern trees. 
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oe, 


We chose the parameters in the DBH example so that x; — x2 would be positive. 
What if we had defined ju; as the mean DBH of the northern trees and juz as the 
mean DBH of the southern trees? The 90% confidence interval for jz; — {42 would be 


2 2 
(23,70 — 34,53). = 1.699, /4750 — = —10.83 + 7.00 = (—17.83, —3.83) 
This interval suggests that the mean diameter of the northern trees is between 3.83 
and 17.83 cm smaller than the mean diameter of the southern trees. Changing 
the order of subtraction doesn’t change the result. 
As with other inference procedures, you can use technology to perform the 
calculations in the “Do” step. Remember that technology comes with potential 


benefits and risks on the AP® exam. 


e 


—erprpayees TWO-SAMPLE t INTERVALS 


CORNER ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


You can use the two-sample t interval command on the ‘TI-83/84 or T'l-89 to construct a confidence interval for the dif- 
ference between two means. We'll show you the steps using the summary statistics from the pine trees example. 


TI-83/84 TI-89 
Press |Stat|, then choose ‘TESTS e Press [2nd][F2] ([F'7]) Ints and 
and 2-SampTInt.... choose 2-SampTInt.... 


Choose Stats as the input method and enter the summary statistics as shown. 


Inpt:Data 

%1:34.53 

Sx1:14.26 

n1:3@ 

X2:23.7 

Sx2:17.5 

n2:38 

C-Level: .90 5 
+Pooled:[IK) Yes MAIN RAD AUTO FUME 16 


Enter the confidence level: C-level: .90. For Pooled: choose “No.” We'll discuss pooling later. 
Highlight Calculate and press [ENTER |. 


NORMAL FLOAT AUTO REAL RADIAN CL fi 


=i ={2.936017.722 
(3.9362,17.724) ae =10.B3 
df=5SS.7276914 =6.BS38> 
x1=34.53 
X2=23.7 
Sx1=14.26 
Sx2=17.5 
n1=30 
n2=30 


MAIN RAD AUTO FUNC 1 3 


AP® EXAM TIP The formula for the two-sample tinterval for p11 — ju. often leads to calculation errors by students. As a result, 


we recommend using the calculator’s 2-SampTInt feature to compute the confidence interval on the AP® exam. Be sure to 
name the procedure (two-sample tinterval) and to give the interval (3.9362, 17.724) and df (55.728) as part of the “Do” step. 
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The calculator’s 90% confidence interval for fu; — [2 is (3.936, 17.724). This 
interval is narrower than the one we found by hand earlier: (3.83, 17.83). Why the 
difference? We used the conservative df = 29, but the calculator used df = 55.73. 
With more degrees of freedom, the calculator’s critical value is smaller than our 
t* = 1.699, which results in a smaller margin of error and a narrower interval. 


CHECK YOUR UNDERSTANDING 


The U.S. Department of Agriculture (USDA) conducted a survey to estimate the average 
price of wheat in July and in September of the same year. Independent random samples 
of wheat producers were selected for each of the two months. Here are summary statistics 
on the reported price of wheat from the selected producers, in dollars per bushel:”* 


Month n Xx Sy 
July 90 $2.95 $0.22 
September 45 $3.61 $0.19 


Construct and interpret a 99% confidence interval for the difference in the true mean 
wheat price in July and in September. 


Significance Tests for pz; — po 


An observed difference between two sample means can reflect an actual differ- 
ence in the parameters j) and {12, or it may just be due to chance variation in 
random sampling or random assignment. Significance tests help us decide which 
explanation makes more sense. The null hypothesis has the general form 


Ho: 4) — 42 = hypothesized value 


We're often interested in situations in which the hypothesized difference 
is 0. Then the null hypothesis says that there is no difference between the two 
parameters: 


Ho: — M2 = 0 or, alternatively, Ho: p14) = pu 


The alternative hypothesis says what kind of difference we expect. 

If the Random, 10%, and Normal/Large Sample conditions are met, we can 
proceed with calculations. To do a test, standardize x; — ¥2 to get a two-sample 
t statistic: 


statistic — parameter 


test statistic = — —_ 
standard deviation of statistic 


To find the P-value, use the ¢ distribution with degrees of freedom given by 
Option | (technology) or Option 2 (df = smaller of n; — 1 and n; — 1). Here are 
the details for the two-sample t test for the difference between two means. 
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TWO-SAMPLE ¢ TEST FOR THE DIFFERENCE BETWEEN TWO MEANS 


Suppose the conditions are met. To test the hypothesis Ho: 4; — j42 = hypothesized 
value, compute the two-sample t statistic 


(= 2) (n — 2) 


Fo 2 
Ss] 82 
pats + ee, 
nmi 2 


Find the P-value by calculating the probability of getting a ¢ statistic this large 
or larger in the direction specified by the alternative hypothesis H,. Use the 

t distribution with degrees of freedom approximated by technology or the 
smaller of n} — 1 and n, — 1. 


— 


AZ, : My — 2 > hypothesized value H, : 4, —M>< hypothesized value H, : 1, —M>¥ hypothesized value 
t if 


Here’s an example that shows how to perform a two-sample t test for a randomized 
experiment. 


-lt| i¢| 


Calcium and Blood Pressure 
Comparing two means 


Does increasing the amount of calcium in our diet reduce blood pressure? 
Examination of a large sample of people revealed a relationship between cal- 
cium intake and blood pressure. The relationship was strongest for black men. 
Such observational studies do not establish causation. Researchers therefore 
designed a randomized comparative experiment. 


The subjects were 21 healthy black men who volunteered to take part in the 
experiment. They were randomly assigned to two groups: 10 of the men re- 
ceived a calcium supplement for 12 weeks, while the control group of 11 men 
received a placebo pill that looked identical. The experiment was double- 
blind. The response variable is the decrease in systolic (top number) blood 
pressure for a subject after 12 weeks, in millimeters of mercury. An increase 
appears as a negative number.”* Here are the data: 


Group 1 (calcium): 7 
Group 2 (placebo): = —1 


PROBLEM: 


(a) Do the data provide convincing evidence that a calcium supplement reduces blood pressure more 
than a placebo? Carry out an appropriate test to support your answer. 


(b) Interpret the P-value you got in part (a) in the context of this experiment. 
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AP® EXAM TIP When 
checking the Normal 
condition on an AP® exam 
question involving inference 
about means, be sure 


to include graphs. Don’t 
expect to receive credit for 
describing graphs that you 
made on your calculator but 
didn’t put on paper. 


FIGURE 10.9 Sketches of boxplots of the changes 
in blood pressure for the two groups of subjects in 
the calcium and blood pressure experiment. 
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SOLUTION: 

(a) STATE: We want to perform a test of 
Ho: [4 — fla = O ; Ho: [lq = [a 
fom, — i SO or, equivalently, esse 


where [1 is the true mean decrease in systolic blood pressure for healthy black men like the ones 
in this study who take a calcium supplement and /12 is the true mean decrease in systolic blood 
pressure for healthy black men like the ones in this study who take a placebo. No significance level was 
specified, so we'll use a = 0.05. 


PLAN: Ifconditions are met, we will carry out a two sample ttest for 14 — [U2. 


° Random: The 21 subjects were randomly assigned to the two treatments. 
° 10%: Don’t need to check because there was no sampling. 


* Normal/Large Sample: With such small sample sizes, we need to graph the data to see if it's 
reasonable to believe that the actual distributions of differences in blood pressure when taking 
calcium or placebo are Normal. Figure 10.9 shows hand sketches of calculator boxplots for these 
data. The graphs show no strong skewness and no outliers. So we are safe using two-sample t 
procedures. 


Calcium 


Placebo 


-10 -5 0 5 10 15 20 
Change in BP 


DO: From the data, we calculated summary statistics: 


Group Treatment n xX Sy 
1 Calcium 10 5.000 8.743 
2 Placebo 11 —0.273 5.901 


° Test statistic 


= = = 1.604 
i, ( 8.743? 5.90127 3.2878 
Ny Ng 10 11 
° P-value By the conservative method, the smaller of n, — 1 and Upper-tail probability p 
tl, — 1 gives df = 9. Because H, counts only positive values oftas | gf 025 
evidence against Hp, the Pvalue is the area to the right of 8 1397 1.860 2.306 
t = 1.604 under the t distribution curve with df = 9. Figure 5 1,383 1.833. 9.962 
10.10 illustrates this P-value. Table B shows that the P-value lies , 
between 0.05 and 0.10. IO Vale Lie 22 


Using technology: Refer to the Technology Corner that follows the example. The calculator’s 
2-SampTTest gives t = 1.60 and P-value = 0.0644 using df = 15.59. 


Section 10.2 Comparing Two Means 647 


CONCLUDE: Because the P-value is greater than « = 0.05, we 
fail to reject Ho. The experiment does not provide convincing evidence 
that the true mean decrease in systolic blood pressure is higher for 
men like these who take calcium than for men like these who take a 
placebo. 

(b) Assuming Ho: 144 — [12 = Ois true, there is a 0.0644 probabil- 
ity of getting a difference in mean blood pressure reduction for the two 
groups (calcium — placebo) of 5.273 or greater just by the chance 
involved in the random assignment. 


tdistribution, 
9 degrees of freedom 


T = 1.604 


FIGURE 10.10 The P-value for the one-sided test using the 
conservative method, which leads to the ¢ distribution with 


9 degrees of freedom. For Practice Try Exercise 


When a significance test leads to a fail to reject Ho decision, as in the previous 
example, be sure to interpret the results as “We don’t have convincing evidence 
to conclude H,.” Saying anything that sounds like you believe Ho is (or might be) 
true is incorrect. 


THINK Why didn’t researchers find a significant difference in the 
calcium and blood pressure experiment? The difference in mean 
ABOUT IT systolic blood pressures for the two groups was 5.273 millimeters of mercury. This 
seems like a fairly large difference. With the small group sizes, however, this dif- 
ference wasn’t large enough to reject Ho: 4; — fz = 0 in favor of the one-sided 
alternative. We suspect that larger groups might show a similar difference in mean 
blood pressure reduction, which would indicate that calcium has a significant 
effect. If so, then the researchers in this experiment made a Type II error—failing 
to reject a false Ho. In fact, later analysis of data from an experiment with more 
subjects resulted in a P-value of 0.008. Sample size strongly affects the power of a 
test. It is easier to detect an actual difference in the effectiveness of two treatments 
if both are applied to large numbers of subjects. 


TECHNOLOGY Tia CAMPLE ¢ TESTS ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Technology gives smaller P-values for two-sample t tests than the conservative method. That’s because calculators and 
software use the more complicated formula on page 640 to obtain a larger number of degrees of freedom. 
e Enter the Group | (calcium) data in LI /listl and the Group 2 (placebo) data in L2/list2. 


e ‘To perform the significance test, go to STAT/TESTS (Tests menu in the Statistics/List Editor on the TI-89) and 
choose 2-SampTTest. 


e In the 2-SampTTest screen, specify “Data” and adjust your other settings as shown. 
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TI-83/84 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


Inet :DERe) Stats 
List1:Li 

List2:Le 

Freqi:1 

Freq2:1 i= Alternate Wve: EQS 
H1:*u2 <u2 By 

Pooled: Yes + Fooled MD > 
Color: 


Calculate Draw 


e Highlight “Calculate” and press [ENTER], (‘The Pooled option will be discussed shortly.) 


NORMAL FLOAT AUTO REAL RADIAN CL f 


H1>H2 
t=1.603717288 
P=. 0644196844 
df=15.59051297 
x1=5 
X2=~.272727273 
Sx1=8. 74325137 
4Sx2=5. 90069333 


If you select “Draw” instead of “Calculate,” the appropriate ¢ distribution will be displayed, showing the test statistic 
and the shaded area corresponding to the P-value. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


2-SampTTest 
t=1.6037 P=.0644 


AP® EXAM TIP The formula for the two-sample f statistic for a test about 1; — ju. often leads to calculation 
errors by students. As a result, we recommend using the calculator’s 2-SampTTest feature to perform 


calculations on the AP® exam. Be sure to name the procedure (two-sample ¢ test) and to report the test statistic 
(t = 1.60), P-value (0.0644), and df (15.59) as part of the “Do” step. 


Inference for Experiments Confidence intervals and tests for pu) — p17 
are based on the sampling distribution of ¥; — x2. But in experiments, we aren't 
sampling at random from any larger populations. We can think about what would 
happen if the random assignment were repeated many times under the assump- 
tion that Ho: w; — ~2 = 0 is true. That is, we assume that the specific treatment 
received doesn’t affect an individual subject’s response. 

Let’s see what would happen just by chance if we randomly reassign the 21 
subjects in the calcium and blood pressure experiment to the two groups many 
times, assuming the drug received doesn’t affect each individual’s change in sys- 
tolic blood pressure. We used Fathom software to redo the random assignment 
1000 times. The approximate randomization distribution of X; — X2 is shown in 
Figure 10.11. It has an approximately Normal shape with mean 0 (no difference) 
and standard deviation 3.42. 


FIGURE 10.11 Fathom simulation 
showing the approximate random- 
ization distribution of x; — X» 
from 1000 random reassignments 
of subjects to treatment groups in 
the calcium and blood pressure 
experiment. 


-10 8 6 -4 


Difference in means X;—Xo if Hb is true 


Shape: Approx. Normal 


Center: Mean = 0 
Spread: Standard deviation = 3.42 


t distribution 
with df = 15.59 


-2 0 2 4 6 


FIGURE 10.12 The distribution of the two-sample 
ttest statistic for the 1000 random reassignments 
in Figure 10.11. 


Approximate randomization 
distribution of x;—x2 
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In the actual experiment, the differ- 
ence in the mean change in blood pressure 
in the calcium and placebo groups was 
5.000 — (—0.273) = 5.273. How likely 
is it that a difference this large or larger 
would happen just by chance when Hp is 
true? Figure 10.11 provides a rough an- 
swer: 61 of the 1000 random reassignments 
yielded a difference in means greater than 
or equal to 5.273. That is, our estimate of 
the P-value is 0.061. This is quite close to 
the 0.0644 P-value that we obtained in the 
Technology Corner. 

If Figure 10.11 displayed the results of all 
possible random reassignments of subjects 
to treatment groups, it would be the actual 
randomization distribution of x; — x7. The P-value obtained from 
this distribution would be exactly correct. Using the two-sample t 
test to calculate the P-value gives only approximately correct results. 

Figure 10.12 shows the value of the two-sample t test statistic for 
each of the 1000 re-randomizations, calculated using our familiar 
formula 


=~ hint 8 8 
diffmean 


(Ll) — [2) 


sf 83 
1 2 
aie + ue 


— x2) - 


nN] n2 


The density curve for the ¢ distribution with df = 15.59 is shown in 
blue. We can see that the test statistic follows the t distribution quite 
closely in this case. 

Whenever the conditions are met, the randomization distribu- 
tion of X; — X2 looks much like its sampling distribution. We are 
therefore safe using two-sample t procedures for comparing two 
means in a randomized experiment. 


CHECK YOUR UNDERSTANDING 


How quickly do synthetic fabrics such as polyester decay in landfills? A researcher buried 
polyester strips in the soil for different lengths of time, then dug up the strips and mea- 
sured the force required to break them. Breaking strength is easy to measure and is a good 
indicator of decay. Lower strength means the fabric has decayed. 

For one part of the study, the researcher buried 10 strips of polyester fabric in well- 
drained soil in the summer. The strips were randomly assigned to two groups: 5 of them 
were buried for 2 weeks and the other 5 were buried for 16 weeks. Here are the breaking 
strengths in pounds:” 


Group | (2 weeks): 
Group 2 (16 weeks): 


120 
140 


129 
110 


118 
124 


126 
98 


126 
110 


Do the data give convincing evidence that polyester decays more in 16 weeks than in 


2 weeks? 
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Using Two-Sample ¢ Procedures Wisely 


In Chapter 9, we used paired t procedures to compare the mean change in 
depression scores for a group of caffeine-dependent individuals when taking caf- 
feine and a placebo. The inference involved paired data because the same 11 sub- 
jects received both treatments. In this chapter, we used two-sample t procedures 
to compare the mean change in blood pressure for a group of healthy 

black men when taking calcium and a placebo. This time, the inference @ 
involved two distinct groups of subjects. The proper method of analysis 
depends on the design of the study. 


Comparing Tires and 


Comparing Workers 


Independent samples versus paired data 
PROBLEM: Ineach of the following settings, decide whether you should use paired t pro- 
cedures or two-sample t procedures to perform inference.”° Explain your choice. 


(a) To test the wear characteristics of two tire brands, A and B, one Brand A tire is mounted on 
one side of each car in the rear, while a Brand B tire is mounted on the other side. Which side gets 
which brand is determined by flipping a coin. 


(b) Can listening to music while working increase productivity? Twenty factory workers agree to 
take part ina study to investigate this question. Researchers randomly assign 10 workers to 
do a repetitive task while listening to music and the other 10 workers to do the task in silence. 


SOLUTION: 

(a) Paired t procedures. This is a matched pairs experiment, with the two treatments (Brand A 
and Brand B) being randomly assigned to the rear pair of wheels on each car. 

(b) Two-sample t procedures. The data are being produced using two distinct groups of work- 
ers in a randomized experiment. 


For Practice Try Exercise 


The same logic applies when data are produced by random sampling. If in- 
dependent random samples are taken from each of two populations, we should 
use two-sample t procedures to perform inference about ju; — j2 if conditions are 
met. If one random sample is taken, and two data values are recorded for each 
individual, we should use paired t procedures to perform inference about the 
population mean difference pup if conditions are met. 


The Pooled Two-Sample t Procedures (Don’t Use Them!) Most 
software offers a choice of two-sample t statistics. One is often labeled “unequal” 
variances; the other, “equal” variances. The “unequal” variance procedure uses 
our two-sample t statistic. This test is valid whether or not the population variances 
are equal. 


Section 10.2 Comparing Two Means »4651 


The other choice is a special version of the two-sample t statistic that assumes 
that the two populations have the same variance. This procedure combines (the 
statistical term is pools) the two sample variances to estimate the common popula- 
tion variance. The resulting statistic is called the pooled two-sample t statistic. 

The pooled t statistic has exactly the t distribution with n; + n; — 2 degrees of 
freedom if the two population variances really are equal and the population dis- 
tributions are exactly Normal. This method offers more degrees of freedom than 
Option | (technology), which leads to narrower confidence intervals and smaller 
P-values. The pooled t procedures were in common use before software made it 
easy to use Option | for our two-sample t statistic. 

In the real world, distributions are not exactly Normal, and population vari- 

Remember wealwaye lise Me POd.  anceaite not exactly equal. In practice, the Option | two-sample t procedures 
sample proportion p¢ when performing ; 

a significance test for comparing two av almost always more accurate than the pooled procedures. Our advice: 
proportions. But we don’t recommend = Never use the pooled t procedures if you have software that will carry out @ 
pooling when comparing two means. Option it, 


Fast-Food Frenzy! 


i = > VM DRIVE THRE ; Let’s return to the chapter-opening Case Study (page 609) about drive- 
ws 3 Se thru service at fast-food restaurants. Here, once again, are some results 


from the 2012 OSR study. 


e For restaurants with order-confirmation boards, 1169 of 1327 visits 
(88.1%) resulted in accurate orders. For restaurants with no order- 
confirmation board, 655 of 726 visits (90.2%) resulted in accurate 
orders. 


McDonald’s average service time for 362 drive-thru visits was 188.83 
seconds with a standard deviation of 17.38 seconds. Burger King’s 
service time for 318 drive-thru visits had a mean of 201.33 seconds 
and a standard deviation of 18.85 seconds. 


You are now ready to use what you have learned about comparing population 
parameters to perform inference about accuracy and average service time in 
the drive-thru lane. 


1. Is there a significant difference in order accuracy between restau- 
rants with and without order-confirmation boards? Carry out an 
appropriate test at the a = 0.05 level to help answer this question. 


A 95% confidence interval for the difference in the population proportions of 
accurate orders at restaurants with and without order-confirmation boards is 


(-0.049, 0.00649). 


2. Interpret the meaning of “95% confident” in the context of this 
study. 
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3. Explain how the confidence interval is consistent with your con- 
clusion from Question 1. 


Now turn your attention to the speed-of-service data. 


4. Construct and interpret a 99% confidence interval for the differ- 
ence in the mean service times at McDonald’s and Burger King 
drive-thrus. 


Summary 


¢ Choose independent SRSs of size n; from Population | and size nz from Pop- 
ulation 2. The sampling distribution of x; — X2 has the following properties: 
e Shape: Normal if both population distributions are Normal; approxi- 
mately Normal otherwise if both samples are large enough (n; = 30 and 

nz = 30) by the central limit theorem. 


e = Center: Its mean is ft) — [12. 
e Spread: As long as each sample is no more than 10% of its population, 


2 2 
; aw ol. © 
its standard deviation is ,/—- + —. 
nj) nz 
e Confidence intervals and tests for the difference between the means of two 
populations or the mean responses to two treatments j4; and j12 are based on 


the difference x; — x2 between the sample means. 
e Because we almost never know the population standard deviations in prac- 
tice, we use the two-sample t statistic 


= Xa) = (i ee) 


a 2 2 
Ss] 82 
—+ ken, 
ny nz 


This statistic has approximately a t distribution. There are two options for using 
at distribution to approximate the distribution of the two-sample t statistic: 


¢ Option 1 (Technology) Use the t distribution with degrees of freedom 
calculated from the data by a somewhat messy formula. The degrees of 
freedom probably won’t be a whole number. 


¢ Option 2 (Conservative) Use the t distribution with degrees of freedom 
equal to the smaller of ny — 1 and nz — 1. This method gives wider con- 
fidence intervals and larger P-values than Option 1. 


e Before estimating or testing a claim about 4; — ju2, check that these condi- 
tions are met: 
e Random: The data are produced by independent random samples of 
size n, from Population | and of size nz from Population 2 or by two 
groups of size n; and n in a randomized experiment. 


Section 10.2 Comparing Two Means 1s 653 


© 10%:When sampling without replacement, check that the two pop- 
ulations are at least 10 times as large as the corresponding samples. 


¢ Normal/Large Sample: Both population distributions (or the true dis- 
tributions of responses to the two treatments) are Normal or both sample 
sizes are large (nj = 30 and nz = 30). If either population (treatment) 
distribution has unknown shape and the corresponding sample size 
is less than 30, use a graph of the sample data to assess the Normal- 
ity of the population (treatment) distribution. Do not use two-sample 
t procedures if the graph shows strong skewness or outliers. 
e An approximate C% confidence interval for ju; — [12 is 
See eee ee 
ei ep eeeet ay 
where t* is the critical value with C% of its area between —¢* and t* for the 
t distribution with degrees of freedom from either Option | (technology) or 
Option 2 (the smaller of nm; — 1 and nz — 1). This is called a two-sample 
t interval for 4) — p2. 


e To test Ho: 44) — 2 = hypothesized value, use a two-sample t test for j4) — [42. 
The test statistic is 


or a) Ga fa) 


= 2 2 
ST 82 
nN} nz 


P-values are calculated using the t distribution with degrees of freedom from 
either Option | (technology) or Option 2 (the smaller of nj — 1 and n2 — 1). 


e Inference about the difference 1 — ju2 in the effectiveness of two treatments in 
a completely randomized experiment is based on the randomization distribu- 
tion of x; — X27. When the conditions are met, our usual inference procedures 
based on the sampling distribution of ¥; — x2 will be approximately correct. 


e Don’t use two-sample t procedures to compare means for paired data. 


Par. | e Be sure to follow the four-step process whenever you construct a confidence 
E interval or perform a significance test for comparing two means. 


TECHNOLOGY 
CORNERS 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


23. ‘Two-sample ¢ intervals on the calculator page 643 


24. ‘Two-sample ¢ tests on the calculator page 647 
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Exercises 


Remember: We are no longer reminding you to use the four-step 
process in exercises that require you to perform inference. 


STEP 


31. 
A] 638 


Cholesterol The level of cholesterol in the blood for 
all men aged 20 to 34 follows a Normal distribution 
with mean 188 milligrams per deciliter (mg/dl) and 
standard deviation +1 mg/dl. For 14-year-old boys, 
blood cholesterol levels follow a Normal distribution 


of 20 male students from their school. Then they 
recorded the number of pairs of shoes that each 
respondent reported having. The back-to-back 
stemplot displays the data. 


with mean 170 mg/dl and standard deviation 30 mg/dl. sas poe nce pe ee Be ae eS Sead hs le 
: ing in households in the United Kingdom (U.K.) and 
Suppose we select independent SRSs of 25 men : : : 
South Africa compare? ‘To help answer this question, 
aged 20 to 34 and 36 boys aged 14 and calculate the eae 1 
sample mean cholesterol levels ¥\; and Xp we used CensusAtSchool’s random data selector to 
; choose independent samples of 50 students from 
(a) What is the shape of the sampling distribution of each country. Here is a Fathom dotplot of the house- 
Xm — Xp? Why? hold sizes reported by the students in the survey. 
(b) Find the mean of the sampling distribution. Show a EEE 
Census at school comparisons 
your work. 
(c) Find the standard deviation of the sampling distribu- 
tion. Show your work. 
32. How tall? The heights of young men follow a Normal 
distribution with mean 69.3 inches and standard devia- 
tion 2.8 inches. The heights of young women follow 
a Normal distribution with mean 64.5 inches and 
standard eee 2.5 inches. Suppose we select inde- 35. Literacy rates Do males have higher average literacy 
pendent SRSs of 16 young men and 9 ye Meren rates than females in Islamic countries? ‘he table 
and calculate the sample mean heights x, and xv. below shows the percent of men and women who 
(a) What is the shape of the sampling distribution of were literate in the major Islamic nations at the time 
Xm — Xw? Why? of this writing.”” (We omitted countries with popula- 
tions of less than 3 million. 
(b) Find the mean of the sampling distribution. Show 
your work. Country Male (%) Female (%) 
(c) Find the standard deviation of the sampling distribu- Afghanistan 43 13 
tion. Show your work. Algeria 80 60 
In Exercises 33 to 36, determine whether or not the Azerbaijan 99.9 99.7 
conditions for using two-sample t procedures are met. Bangladesh 61 52 
33. Shoes How many pairs of shoes do teenagers have? Egypt BD es 
To find out, a group of AP® Statistics students Indonesia 94 86.8 
conducted a survey. ‘They selected a random sample Iran 84 70 
of 20 female students and a separate random sample Iraq 86 7 
Jordan 96 89 
carat a Kazakhstan 100 99 
0 | 555677778 Kyrgyzstan 99.3 98.1 
333 | 1 | 0000124 
95 | 1 Lebanon o8) 82 
ee || || 2 Key: 2/2 represents : 
66 | 2 a a eee with Libya 96 83 
410 | 3 22 pairs of shoes. Malaysia 92 85 
pee | 
4 Morocco 69 44 
es , 
100 | 5 Pakistan 68.6 30.3 
715 


Saudi Arabia 90 81 
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Country Male (%) Female (%) (b) Construct and interpret a 90% confidence interval for 
the difference in mean percent change in polyphenol 


Syria 86 74 
Ee ' a 40 levels for the red wine and white wine treatments. 
ajikistan 
Tee 83 65 (c) Does the interval in part (b) suggest that red wine is 
Turkey 98 00 more effective than white wine? Explain. 
Turkmenistan 99.3 98.3 38. Tropical flowers Different varieties of the tropical 
(lebeuistan 400 99 flower Heliconia are fertilized by different species of 
¥ a ie hummingbirds. 
men 


36. Long words Mary was interested in comparing the 
mean word length in articles from a medical journal 
and an airline’s in-flight magazine. She counted the 
number of letters in the first 400 words of an article in 
the medical journal and in the first 100 words of an ar- 
ticle in the airline magazine. Mary then used Minitab 
statistical software to produce the histograms shown. 
Note that J is for journal and M is for magazine. 


aa JLength MLength 
25 
2 20; i r 
5 Researchers believe that over time, the lengths of the 
Ge [| flowers and the forms of the hummingbirds’ beaks 
104 have evolved to match each other. Here are data on 
= the lengths in millimeters for random samples of two 
color varieties of the same species of flower on the 
O24 6 8 1012140 2 4 6 8 1012 14 island of Dominica:” 


H. caribaea red 


Is red wine better than white wine? Observational 
41.90 42.01 41.93 43.09 41.17 41.69 39.78 40.57 


Bie 
po fal studies suggest that moderate use of alcohol by adults 


& reduces heart attacks and that red wine may have 39.63 42.18 40.66 37.87 39.16 37.40 38.20 38.07 
special benefits. One reason may be that red wine 38.10 37.97 38.79 38.23 38.87 37.78 38.01 
contains polyphenols, substances that do good things 
- Seats the blood and ne) ae the risk H. caribaea yellow 
of heart attacks. In an experiment, healthy men were 36.78 37.02 36.52 36.11 36.03 35.45 3813 37.10 


assigned at random to drink half a bottle of either 
red or white wine each day for two weeks. The level 
of polyphenols in their blood was measured before (a) A Fathom dotplot of the data is shown below. Write a 
and after the two-week period. Here are the percent few sentences comparing the distributions. 

changes in level for the subjects in both groups:”° 


35.17 36.82 36.66 35.68 36.03 34.57 34.63 


Redwine: 3.5 8.1 74 40 07 49 84 7.0 5.5 
White wine: 3.1 0.5 -3.8 41 -06 27 19 -5.9 0.1 


(a) A Fathom dotplot of the data is shown below. Write a 
few sentences comparing the distributions. 


(b) Construct and interpret a 95% confidence interval 
for the difference in the mean lengths of these two 
varieties of flowers. 


3 
© 
z 
= 


° op _eee (c) Does the interval support the researchers’ belief 
asics hiatal 1 @ that the two flower varieties have different average 
os. lengths? Explain. 
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Paying for college College financial aid offices ex- 
pect students to use summer earnings to help pay for 
college. But how large are these earnings? One large 
university studied this question by asking a random 
sample of 1296 students who had summer jobs how 
much they earned. The financial aid office separated 
the responses into two groups based on gender. Here 
are the data in summary form:*” 


breeding next year? Researchers randomly assigned 

7 pairs of birds to have the natural caterpillar supply 
supplemented while feeding their young and another 
6 pairs to serve as a control group relying on natural 
food supply. The next year, they measured how many 
days after the caterpillar peak the birds produced 
their nestlings. ** The investigators expected the 
control group to adjust their breeding date the next 
year, whereas the well-fed supplemented group had 


Group n Xx S no reason to change. Here are the data (days after 

Males 675 $1884.52 $1368.37 cherpill peas: 

Females 621 $1360.39 $1037.46 

Control: A'G ee: O hfe O! On Olle 

How can you tell from the summary statistics that Supplemented: lows ke) Gh gles) WIS) Ie 
the distribution of earnings in each group is strongly 
k the right? Th f£ two-sampl = ; At . : 
SW eto ener Sig ea ae rploce (a) Do the data provide convincing evidence to confirm 


dures is still justified. Why? 


Construct and interpret a 90% confidence interval 
for the difference between the mean summer earn- 
ings of male and female students at this university. 


(b) 


Interpret the 90% confidence level in the context of 
this study. 


Happy customers As the Hispanic population 

in the United States has grown, businesses have 
tried to understand what Hispanics like. One study 
interviewed a random sample of customers leaving a 
bank. Customers were classified as Hispanic if they 
preferred to be interviewed in Spanish or as Anglo 
if they preferred English. Each customer rated the 
importance of several aspects of bank service on a 
10-point scale.*' Here are summary results for the 
importance of “reliability” (the accuracy of account 
records and so on): 


Group n x Sy 
Anglo 92 6.37 0.60 
Hispanic 86 5.91 0.93 


The distribution of reliability ratings in each group 
is not Normal. The use of two-sample t procedures is 


still justified. Why? 


(a) 


Construct and interpret a 95% confidence interval 


b 
for the difference between the mean ratings of the ©) 
importance of reliability for Anglo and Hispanic 
bank customers. 43. 


Interpret the 95% confidence level in the context of 
this study. 


Baby birds Do birds learn to time their breeding? 
Blue titmice eat caterpillars. The birds would like 
lots of caterpillars around when they have young to 
feed, but they must breed much earlier. Do the birds 
learn from one year’s experience when to time their 


Pe 


the researchers’ belief? 


Interpret the P-value from part (a) in the context of 
this study. 


DDT in rats Poisoning by the pesticide DDT 
causes convulsions in humans and other mammals. 
Researchers seek to understand how the convulsions 
are caused. In a randomized comparative experi- 
ment, they compared 6 white rats poisoned with 
DDT with a control group of 6 unpoisoned rats. 
Electrical measurements of nerve activity are the 
main clue to the nature of DDT poisoning. When 

a nerve is stimulated, its electrical response shows a 
sharp spike followed by a much smaller second spike. 
The researchers measured the height of the second 
spike as a percent of the first spike when a nerve in 
the rat’s leg was stimulated.** For the poisoned rats, 
the results were 


12.207 16.869 25.050 22.429 8.456 20.589 


The control group data were 


11.074 9.686 12.064 9.351 8.182 6.642 


Do these data provide convincing evidence that 
DDT affects the mean relative height of the second 
spike’s electrical response? 


Interpret the P-value from part (a) in the context of 
this study. 


Who talks more—men or women? Research- 

ers equipped random samples of 56 male and 

56 female students from a large university with 

a small device that secretly records sound for a 
random 30 seconds during each 12.5-minute period 
over two days. ‘Then they counted the number of 
words spoken by each subject during each record- 
ing period and, from this, estimated how many 
words per day each subject speaks. ‘The female 


44, 


estimates had a mean of 16,177 words per day with 
a standard deviation of 7520 words per day. For 

the male estimates, the mean was 16,569 and the 
standard deviation was 9108. Do these data provide 
convincing evidence of a difference in the average 
number of words spoken in a day by male and 
female students at this university? 


Competitive rowers What aspects of rowing 
technique distinguish between novice and skilled 
competitive rowers? Researchers compared two 
randomly selected groups of female competitive 
rowers: a group of skilled rowers and a group of 
novices. ‘The researchers measured many mechani- 
cal aspects of rowing style as the subjects rowed 

on a Stanford Rowing Ergometer. One important 
variable is the angular velocity of the knee, which 
describes the rate at which the knee joint opens as 
the legs push the body back on the sliding seat. The 
data show no outliers or strong skewness. Here is 
the SAS computer output: ** 


TTEST PROCEDURE 


Variable: KNEE 


GROUP N Mean Std Dev Std Error 
SKILLED 10 4.182 0.479 (0) alsa 
NOVICE 8 3.010 0,959) 0} gshshS) 


45). 


The researchers believed that the knee velocity 
would be higher for skilled rowers. Do the data 
provide convincing evidence to support this 


belief? 


Teaching reading An educator believes that 

new reading activities in the classroom will help 
elementary school pupils improve their reading 
ability. She recruits 44 third-grade students and 
randomly assigns them into two groups. One 
group of 21 students does these new activities for 
an 8-week period. A control group of 23 third- 
graders follows the same curriculum without the 
activities. At the end of the 8 weeks, all students 
are given the Degree of Reading Power (DRP) test, 
which measures the aspects of reading ability that 
the treatment is designed to improve. Comparative 
boxplots and summary statistics for the data from 
Fathom are shown below.” 


Reading study 


10 20 30 40 S50 60 70 80 90 
DRP_score 


51.4762) 41.5217 
21 23 

11.0074) 17.1487 

S1= mean( ) = 

$2=count({ ) 

$3 = stdDev ( ) 


(a) Based on the graph and numerical summaries, write 
a few sentences comparing the DRP scores for the 
two groups. 


(b) Is the mean DRP score significantly higher for the 
students who did the reading activities? Give appro- 
priate evidence to justify your answer. 


(c) Can we conclude that the new reading activities 
caused an increase in the mean DRP score? Explain. 


46. Does breast-feeding weaken bones? Breast-feeding 
mothers secrete calcium into their milk. Some of the 
calcium may come from their bones, so mothers may 
lose bone mineral. Researchers compared a random 
sample of 47 breast-feeding women with a random 
sample of 22 women of similar age who were neither 
pregnant nor lactating. They measured the percent 
change in the bone mineral content (BMC) of the 
women’s spines over three months. Comparative 
boxplots and summary statistics for the data from 
Fathom are shown below. *° 


3 


Oo 
Notpregnant 


-10 8 6 +4 2 0 


Bone mineral study 


S1=mean( ) 
$2 = count( ) 
$3 = stdDev ( ) 


(a) Based on the graph and numerical summaries, write 
a few sentences comparing the percent changes in 
BMC for the two groups. 
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Is the mean change in BMC significantly lower for 
the mothers who are breast-feeding? Give appropri- 
ate evidence to justify your answer. 


Can we conclude that breast-feeding causes a 
mother’s bones to weaken? Why or why not? 


Who talks more—men or women? Refer to 
Exercise +3. Construct and interpret a 95% confi- 
dence interval for the difference in mean number 
of words spoken in a day. Explain how this interval 
provides more information than the significance 
test in Exercise 43. 


DDT in rats Refer to Exercise 42. Construct and 
interpret a 95% confidence interval for the difference 
in mean relative height of the second spike’s electri- 
cal response. Explain how this interval provides more 
information than the significance test in Exercise 42. 


A better drug? In a pilot study, a company’s new 
cholesterol-reducing drug outperforms the currently 
available drug. If the data provide convincing evi- 
dence that the mean cholesterol reduction with the 
new drug is more than 10 milligrams per deciliter 
of blood (mg/dl) greater than with the current drug, 
the company will begin the expensive process of 
mass-producing the new drug. For the 14 subjects 
who were assigned at random to the current drug, 
the mean cholesterol reduction was 54.1 mg/dl with 
a standard deviation of 11.93 mg/dl. For the 15 sub- 
jects who were randomly assigned to the new drug, 
the mean cholesterol reduction was 68.7 mg/dl with 
a standard deviation of 13.3 mg/dl. Graphs of the 
data reveal no outliers or strong skewness. 


Carry out an appropriate significance test. What 
conclusion would you draw? (Note that the null 
hypothesis is not Ho: f4) — [2 = 0.) 


Based on your conclusion in part (a), could you have 
made a ‘Type I error or a ‘Type II error? Justify your 
answer. 


Down the toilet A company that makes hotel 
toilets claims that its new pressure-assisted toilet 
reduces the average amount of water used by more 
than 0.5 gallon per flush when compared to its 
current model. To test this claim, the company 
randomly selects 30 toilets of each type and mea- 
sures the amount of water that is used when each 
toilet is flushed once. For the current-model toilets, 
the mean amount of water used is 1.64 gal with a 
standard deviation of 0.29 gal. For the new toilets, 
the mean amount of water used is 1.09 gal with a 
standard deviation of 0.18 gal. 


Carry out an appropriate significance test. What 
conclusion would you draw? (Note that the null 
hypothesis is not Ho: f4) — (2 = 0.) 


(b) 


mill. 


be 


Based on your conclusion in part (a), could you have 
made a‘lype I error or a ‘Type II error? Justify your 
answer. 


Rewards and creativity Dr. ‘Teresa Amabile con- 
ducted a study involving 47 college students who 
were randomly assigned to two treatment groups. 
The 23 students in one group were given a list of 
statements about external reasons (E) for writing, 
such as public recognition, making money, or 
pleasing their parents. The 24 students in the other 
group were given a list of statements about internal 
reasons (I) for writing, such as expressing yourself 
and enjoying playing with words. Both groups were 
then instructed to write a poem about laughter. Each 
student’s poem was rated separately by 12 different 
poets using a creativity scale.*’ The 12 poets’ ratings 
of each student’s poem were averaged to obtain an 
overall creativity score. 


We used Fathom software to randomly reassign the 
47 subjects to the two groups 1000 times, assuming 
the treatment received doesn’t affect each individu- 
al’s average creativity rating. The dotplot shows the 
approximate randomization distribution of x; — Xp. 


diffmeans 


Why did researchers randomly assign the subjects to 
the two treatment groups? 


In the actual experiment, ¥; — Xg = 4.15. This 
value is marked with a blue line in the figure. What 
conclusion would you draw? Justify your answer with 
appropriate evidence. 


Based on your conclusion in part (b), could you have 
made a‘lype I error or a ‘Type II error? Justify your 
answer. 


Sleep deprivation Does sleep deprivation linger for 
more than a day? Researchers designed a study using 
21 volunteer subjects between the ages of 18 and 

25. All 21 participants took a computer-based visual 
discrimination test at the start of the study. Then the 
subjects were randomly assigned into two groups. 
The 11] subjects in one group, D, were deprived of 
sleep for an entire night in a laboratory setting. The 


10 subjects in the other group, A, were allowed unre- 
stricted sleep for the night. Both groups were allowed as 
much sleep as they wanted for the next two nights. On 
Day 4, all the subjects took the same visual discrimina- 
tion test on the computer. Researchers recorded the 
improvement in time (measured in milliseconds) from 
Day 1 to Day 4 on the test for each subject.** 

We used Fathom software to randomly reassign 
the 21 subjects to the two groups 1000 times, assum- 
ing the treatment received doesn’t affect each indi- 
vidual’s time improvement on the test. The dotplot 
shows the approximate randomization distribution of 
XA Xp: 


-20 -10 0 10 20 


Explain why the researchers didn’t let the subjects 
choose whether to be in the sleep deprivation group 
or the unrestricted sleep group. 


In the actual experiment, ¥4 — Xp = 15.92. This 
value is marked with a blue line in the figure. What 
conclusion would you draw? Justify your answer with 
appropriate evidence. 


Based on your conclusion in part (b), could you have 
made a ‘Type I error or a ‘Type II error? Justify your 
answet. 


Paired or unpaired? In each of the following 
settings, decide whether you should use paired t 
procedures or two-sample t procedures to perform 
inference. Explain your choice.” 


To test the wear characteristics of two tire brands, 
Aand B, each brand of tire is randomly assigned to 
50 cars of the same make and model. 


To test the effect of background music on productivi- 
ty, factory workers are observed. For one month, each 
subject works without music. For another month, the 
subject works while listening to music on an MP3 
player. The month in which each subject listens to 
music is determined by a coin toss. 


A study was designed to compare the effectiveness of 
two weight-reducing diets. Fifty obese women who 
volunteered to participate were randomly assigned 
into two equal-sized groups. One group used Diet 
Aand the other used Diet B. The weight of each 
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woman was measured before the assigned diet and 
again after 10 weeks on the diet. 


Paired or unpaired? In each of the following 
settings, decide whether you should use paired t 
procedures or two-sample t procedures to perform 
inference. Explain your choice.*” 


To compare the average weight gain of pigs fed two 
different rations, nine pairs of pigs were used. The 
pigs in each pair were littermates. A coin toss was 
used to decide which pig in each pair got Ration A 
and which got Ration B. 


Separate random samples of male and female college 
professors are taken. We wish to compare the average 
salaries of male and female teachers. 


To test the effects of a new fertilizer, 100 plots are 
treated with the new fertilizer, and 100 plots are 
treated with another fertilizer. A computer’s random 
number generator is used to determine which plots 
get which fertilizer. 


Exercises 55 and 56 refer to the following setting. Coach- 
ing companies claim that their courses can raise the SAT 
scores of high school students. Of course, students who 
retake the SAT without paying for coaching generally raise 
their scores. A random sample of students who took the 
SAT twice found 427 who were coached and 2733 who 
were uncoached."! Starting with their Verbal scores on the 
first and second tries, we have these summary statistics: 


Try 1 Try 2 Gain 
n x Sy xi Sy x Sy 
Coached 427 500 92 529 97 29 59 
Uncoached DSS 506 101 527 101 21 52. 
55. Coaching and SAT scores Let’s first ask if students 


(a) 


56. 


who are coached increased their scores significantly. 


You could use the information on the Coached line 
to carry out either a two-sample t test comparing 
‘Try | with Try 2 for coached students or a paired 

t test using Gain. Which is the correct test? Why? 


Carry out the proper test. What do you conclude? 


Coaching and SAT scores What we really want to 
know is whether coached students improve more 
than uncoached students, and whether any advan- 
tage is large enough to be worth paying for. Use the 
information above to answer these questions: 


How much more do coached students gain on the aver- 
age? Construct and interpret a 99% confidence interval. 


Does the interval in part (a) give convincing evi- 
dence that coached students gain more, on average, 
than uncoached students? Explain. 


Based on your work, what is your opinion: do you 
think coaching courses are worth paying for? 
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Multiple choice: Select the best answer for Exercises 57 
to 60. 


57. ‘There are two common methods for measuring the 
concentration of a pollutant in fish tissue. Do the two 
methods differ, on average? You apply both methods 
to each fish in a random sample of 18 carp and use 


a) the paired t test for jug, 
b) the one-sample z test for p. 


( 
( 
(c) the two-sample t test for f4) — [u2. 
(d) the two-sample < test for p) — po. 
( 


e) none of these. 


Exercises 58 to 60 refer to the following setting. A study of road 
rage asked random samples of 596 men and 523 women 
about their behavior while driving. Based on their answers, 
each person was assigned a road rage score on a scale of 0 to 
20. The participants were chosen by random digit dialing of 
phone numbers. The researchers performed a test of the fol- 
lowing hypotheses: Ho: fq = [up versus H,: uy # Lp. 


58. Which of the following describes a ‘Type II error in 
the context of this study? 


(a) Finding convincing evidence that the true means are 
different for males and females, when in reality the 
true means are the same 


(b) Finding convincing evidence that the true means are 
different for males and females, when in reality the 
true means are different 


(c) Not finding convincing evidence that the true means 
are different for males and females, when in reality 
the true means are the same 


(d) Not finding convincing evidence that the true means 
are different for males and females, when in reality 
the true means are different 


(e) Not finding convincing evidence that the true means 
are different for males and females, when in reality there 
is convincing evidence that the true means are different 


59. The P-value for the stated hypotheses is 0.002. Inter- 
pret this value in the context of this study. 


(a) Assuming that the true mean road rage score is the 
same for males and females, there is a 0.002 prob- 
ability of getting a difference in sample means. 


(b) Assuming that the true mean road rage score is the 
same for males and females, there is a 0.002 prob- 
ability of getting an observed difference at least as 
extreme as the observed difference. 


(c) Assuming that the true mean road rage score is 
different for males and females, there is a 0.002 prob- 
ability of getting an observed difference at least as 
extreme as the observed difference. 


(d) Assuming that the true mean road rage score is the 
same for males and females, there is a 0.002 prob- 
ability that the null hypothesis is true. 


(e) Assuming that the true mean road rage score is the 
same for males and females, there is a 0.002 prob- 
ability that the alternative hypothesis is true. 


60. Based on the P-value in Exercise 59, which of the 
following must be true? 


a) A90% confidence interval for ju\y — fp will contain 0. 
b) A95% confidence interval for fy — [4p will contain 0. 
c) A99% confidence interval for juyy — fp will contain 0. 


d) A99.9% confidence interval for {uy — fp will 
contain 0). 


(e) Itis impossible to determine whether any of these 
statements is true based only on the P-value. 


In each part of Exercises 61 and 62, state which inference 
procedure from Chapter 8, 9, or 10 you would use. Be 
specific. For example, you might say, “Two-sample z test for 
the difference between two proportions.” You do not need to 
carry out any procedures. 


61. Which inference method? 


(a) Drowning in bathtubs is a major cause of death in 
children less than 5 years old. A random sample of 
parents was asked many questions related to bathtub 
safety. Overall, 85% of the sample said they used 
baby bathtubs for infants. Estimate the percent of all 
parents of young children who use baby bathtubs. 


(b) How seriously do people view speeding in compari- 
son with other annoying behaviors? A large random 
sample of adults was asked to rate a number of be- 
haviors on a scale of | (no problem at all) to 5 (very 
severe problem). Do speeding drivers get a higher 
average rating than noisy neighbors? 


(c) You have data from interviews with a random sample 
of students who failed to graduate from a particular 
college in 7 years and also from a random sample 
of students who entered at the same time and did 
graduate. You will use these data to compare the 
percents of students from rural backgrounds among 
dropouts and graduates. 


(d) Do experienced computer game players earn higher 
scores when they play with someone present to cheer 
them on or when they play alone? Fifty teenagers 
with experience playing a particular computer game 
have volunteered for a study. We randomly assign 25 
of them to play the game alone and the other 25 to 
play the game with a supporter present. Each player's 
score is recorded. 


62. 


Which inference method? 


How do young adults look back on adolescent ro- 
mance? Investigators interviewed 40 couples in their 
midtwenties. ‘The female and male partners were 
interviewed separately. Each was asked about his or 
her current relationship and also about a romantic 
relationship that lasted at least two months when 
they were aged 15 or 16. One response variable was 
a measure on a numerical scale of how much the 
attractiveness of the adolescent partner mattered. You 
want to find out how much men and women differ 
on this measure. 


Are more than 75% of Toyota owners generally 
satisfied with their vehicles? Let’s design a study 

to find out. We'll select a random sample of 400 
‘Toyota owners. Then we'll ask each individual in the 
sample: “Would you say that you are generally satis- 
fied with your Toyota vehicle?” 


Are male college students more likely to binge drink 
than female college students? The Harvard School of 
Public Health surveys random samples of male and 
female undergraduates at four-year colleges and uni- 
versities about whether they have engaged in binge 
drinking. 

A bank wants to know which of two incentive plans 
will most increase the use of its credit cards and by 
how much. It offers each incentive to a group of cur- 
rent credit card customers, determined at random, 
and compares the amount charged during the fol- 
lowing six months. 


Quality control (2.2, 5.3, 6.3) Many manufacturing 
companies use statistical techniques to ensure that the 
products they make meet standards. One common 
way to do this is to take a random sample of products 
at regular intervals throughout the production shift. 
Assuming that the process is working properly, the 
mean measurement x from a random sample varies 
according to a Normal distribution with mean ju; and 
standard deviation o,. For each question that follows, 
assume that the process is working properly. 
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What's the probability that at least one of the next 
two sample means will fall more than 20; from the 
target mean ju;? Show your work. 


What's the probability that the first sample mean that 
is greater than yu; + 20; is the one from the fourth 
sample taken? 


Plant managers are trying to develop a criterion 
for determining when the process is not working 
properly. One idea they have is to look at the 5 most 
recent sample means. If at least 4 of the 5 fall outside 
the interval (j4; — 0%, Lz + 0), they will conclude 
that the process isn’t working. 


Find the probability that at least 4 of the 5 most 
recent sample means fall outside the interval, assum- 
ing the process is working properly. Is this a reason- 
able criterion? Explain. 


Information online (8.2, 10.1) A random digit dial- 
ing sample of 2092 adults found that 1318 used the 
Internet.” Of the users, 1041 said that they expect 
businesses to have Web sites that give product infor- 
mation; 294 of the 774 nonusers said this. 


Construct and interpret a 95% confidence interval 
for the proportion of all adults who use the 
Internet. 


Construct and interpret a 95% confidence interval to 
compare the proportions of users and nonusers who 
expect businesses to have Web sites that give product 
information. 


Coaching and SAT scores: Critique (4.1, 4.3) ‘The 
data in Exercises 55 and 56 came from a random 
sample of students who took the SAT twice. The 
response rate was 63%, which is fairly good for non- 
government surveys. 


Explain how nonresponse could lead to bias in this study. 


We can’t be sure that coaching actually caused the 
coached students to gain more than the uncoached 
students. Explain briefly but clearly why this is so. 
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Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Name-brand Store-brand Name-brand Store-brand 
95 94 90 78 


88 89 97 84 
Directions: Show all your work. Indicate clearly the methods BA 99 93 86 
you use, because you will be scored on the correctness of your 94 89 94 86 
methods as well as on the accuracy and completeness of your 
results and explanations. 81 el 86 90 


Will using name-brand microwave popcorn result in Do the data provide convincing evidence that using name- 
a greater percentage of popped kernels than using store- brand microwave popcorn will result in a greater mean per- 
brand microwave popcorn? To find out, Briana and Maggie centage of popped kernels? 
randomly selected 10 bags of name-brand microwave pop- 
corn and 10 bags of store-brand microwave popcorn. ‘The 
chosen bags were arranged in a random order. Then each 
bag was popped for 3.5 minutes, and the percentage of 
popped kernels was calculated. ‘The results are displayed in 
the following table. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you think 
each solution is “complete,” “substantial,” “developing,” or “mini- 
mal.” If the solution is not complete, what improvements would you 
suggest to the student who wrote it? Finally, your teacher will provide 
you with a scoring rubric. Score your response and note what, if any- 
thing, you would do differently to improve your own score. 


Chapter Review, 


Section 10.1: Comparing Two Proportions ie: : p(l—fi) poll — po) 
In this section, you learned how to construct confidence in- (Pi — p2) +z 7 3 

tervals and perform significance tests for a difference between 
two proportions. Inference for a difference in proportions is 
based on the sampling distribution of f; — p2. When the 
conditions are met, the sampling distribution of 6; — f2 is 
approximately Normal with a mean of fug,-3, = P — pz and 


i) 
The logic of confidence intervals, including how to inter- 
pret the confidence interval and the confidence level, is 
the same as it was in Chapter 8, when you first learned 
about confidence intervals. 
Likewise, a significance test for a difference between two 


aianda:ddeviaionolowen = (2 — Pi) p p21 — pa) proportions uses the same logic as the significance tests you 
n| 12 learned about in Chapter 9. We start by assuming the null 
The conditions for inference about a difference in propor- hypothesis is true and asking how likely it would be to get 
tions are the same for confidence intervals and significance results at least as unusual as the results observed in a study by 
tests. The Random condition says that the data must be from chance alone. If it is plausible that a difference in proportions 
two independent random samples or two groups in a random- could be the result of sampling variability or the chance varia- 
ized experiment. The 10% condition says that each sample tion due to random assignment, we do not have convincing 
size should be less than 10% of the corresponding population evidence that the alternative hypothesis is true. However, if 
size when sampling without replacement. ‘The Large Counts the difference is too big to attribute to chance, there is con- 
condition says that the number of successes and number of vincing evidence to believe that the alternative hypothesis is 
failures from each sample/group should be at least 10. That true. For a test of Ho:p1 — p2 = 0, the test statistic is 


is, 2p), 2\(1 — fi), n2p2, n2(1 — 2) are all =10. Gino 
A confidence interval for a difference between two pro- z= —_ - 

portions provides an interval of plausible values for the true oe — Po) a pc(l — pe) 

difference in proportions. The formula is ny nN? 


where fc is the combined (overall) proportion of successes: 
2 PLD 
Pc Ny] ar nN? ‘ 

Finally, you learned that the inference techniques 
used for analyzing a difference in proportions from two 
independent random samples work very well for analyzing 
a difference in proportions from two groups in a completely 
randomized experiment. 


Section 10.2: Comparing Two Means 


In this section, you learned how to construct confidence in- 
tervals and perform significance tests for a difference in two 
means. Inference for a difference in means is based on the 
sampling distribution of x; — x2. When the conditions are 
met, the sampling distribution of x; — X2 is approximately 
Normal with a mean of juz,-x, = (41 — fz and a standard 

To 
deviation of o;,- x, = es 
Ny} iy) 

The conditions for inference abouta difference in means 
are the same for confidence intervals and significance tests. 
The Random condition says that the data must be from 
two independent random samples or two groups in a ran- 
domized experiment. The 10% condition says that each 
sample size should be less than 10% of the corresponding 
population size when sampling without replacement. The 
Normal/Large Sample condition says that the two popula- 
tions are Normal or that the two sample/group sizes are large 
(n, = 30, nz = 30). Ifthe sample/group sizes are small and 


What Did You Learn? 


Learning Objective 


Section 


the population shapes are unknown, graph both sets of data 
to make sure there is no strong skewness or outliers. 

As in Chapters 8 and 9, inference techniques for means 
are based on the ¢ distributions. There are two options for 
calculating the number of degrees of freedom to use. ‘The 
first option is to use technology to calculate the degrees of 
freedom. The second option is to use the smaller of n — | 
and nz — |. The technology option is preferred because it 
produces a larger number of degrees of freedom, resulting 
in narrower confidence intervals and smaller P-values. If you 
are using technology, always choose the unpooled option. 

A confidence interval for a difference between two 
means provides an interval of plausible values for the true 
difference in means. The formula is 


See ae ST. 
(Xy— x2) 2 aaa 

Use a significance test to decide between two competing hy- 
potheses about a difference in true means. The test statistic is 


= (1 = Xo) — (ei = fp) 


ae 
st 83 


Ny) 12 


where /4; — [12 is the difference specified by the null hypothesis. 

When constructing confidence intervals or performing 
significance tests for a difference in means, make sure that 
the data are not paired. If the data are paired, use the paired 
t procedures from Chapter 9. 


Related Example 
on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Describe the shape, center, and spread of the sampling distribution of 
By = Bo. 


615 R10.2 


Determine whether the conditions are met for doing inference about 
Di — Pr. 


R10.5, R10.6 


Construct and interpret a confidence interval to compare two proportions. 
Perform a significance test to compare two proportions. 


R10.2 
R10.5 


Describe the shape, center, and spread of the sampling distribution of 
iy = Tere 

Determine whether the conditions are met for doing inference for 
HT pe. 


R10.3 


R10.3, R10.4, R10.6 


Construct and interpret a confidence interval to compare two means. 
Perform a significance test to compare two means. 


R10.4 
R10.7 


Determine when it is appropriate to use two-sample ft procedures 
versus paired f procedures. 


R10.1, R10.7 
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Chapter 10 Chapter Review Exercises 


These exercises are designed to help you review the impor- 
tant ideas and methods of the chapter. 


R10.1 


— 
f 
ma 


Ss 


= 
fe) 
WS 


S 


R10.2 


Which procedure? For each of the following 
settings, say which inference procedure from 
Chapter 8, 9, or 10 you would use. Be specific. For 
example, you might say, “Two-sample < test for the 
difference between two proportions.” You do not 
need to carry out any procedures.” 


Do people smoke less when cigarettes cost more? 
A random sample of 500 smokers was selected. 
The number of cigarettes each person smoked per 
day was recorded over a one-month period before 
a 30% cigarette tax was imposed and again for one 
month after the tax was imposed. 

How much greater is the percent of senior citizens 
who attend a play at least once per year than the 
percent of people in their twenties who do so? 
Random samples of 100 senior citizens and 100 
people in their twenties were surveyed. 

You have data on rainwater collected at 16 loca- 
tions in the Adirondack Mountains of New York 
State. One measurement is the acidity of the water, 
measured by pH ona scale of 0 to 14 (the pH of 
distilled water is 7.0). Estimate the average acidity 
of rainwater in the Adirondacks. 

Consumers Union wants to see which of two 
brands of calculator is easier to use. They recruit 
100 volunteers and randomly assign them to two 
equal-sized groups. ‘The people in one group 

use Calculator A and those in the other group 
use Calculator B. Researchers record the time 
required for each volunteer to carry out the same 
series of routine calculations (such as figuring 
discounts and sales tax, totaling a bill) on the 
assigned calculator. 


Seat belt use The proportion of drivers who use 
seat belts depends on things like age (young people 
are more likely to go unbelted) and gender (wom- 
en are more likely to use belts). It also depends 

on local law. In New York City, police can stop a 
driver who is not belted. In Boston at the time of 
the study, police could cite a driver for not wearing 
a seat belt only if the driver had been stopped for 
some other violation. Here are data from observ- 
ing random samples of female Hispanic drivers in 
these two cities:** 


City Drivers Belted 
New York 220 183 
Boston 117 68 


(a) Calculate the standard error of the sampling distri- 


R10.3 


R10.4 


= 


7 


bution of the difference in the proportions of female 
Hispanic drivers in the two cities who wear seat 
belts. What information does this value provide? 
Construct and interpret a 95% confidence inter- 

val for the difference in the proportions of female 
Hispanic drivers in the two cities who wear seat belts. 


Expensive ads Consumers who think a product’s 
advertising is expensive often also think the prod- 
uct must be of high quality. Can other information 
undermine this effect? ‘To find out, marketing 
researchers did an experiment. ‘he subjects were 
90 women from the clerical and administrative 
staff of a large organization. All subjects read an 

ad that described a fictional line of food products 
called “Five Chefs.” The ad also described the ma- 
jor T’'V commercials that would soon be shown, an 
unusual expense for this type of product. The 45 
women who were randomly assigned to the control 
group read nothing else. ‘The 45 in the “under- 
mine group” also read a news story headlined “No 
Link between Advertising Spending and New 
Product Quality.” All the subjects then rated the 
quality of Five Chefs products on a 7-point scale. 
The study report said, “The mean quality ratings 
were significantly lower in the undermine treat- 
ment (¥4 = 4.56) than in the control treatment 
(te = 505 = Zot P-—0ne 

The 90 women who participated in the study were 
not randomly selected from a population. Explain 
why the Random condition is still satisfied. 

The distribution of individual responses is not Nor- 
mal, because there is only a 7-point scale. What is 
the shape of the sampling distribution of Xo — X,4? 
Explain. 

Interpret the P-value in context. 

Men versus women ‘I'he National Assessment of 
Educational Progress (NAEP) Young Adult Literacy 
Assessment Survey interviewed a random sample 

of 1917 people 21 to 25 years old. ‘The sample con- 
tained 840 men and 1077 women.* The mean and 
standard deviation of scores on the NAEP’s test of 
quantitative skills were ¥; = 272.40 and s; = 59.2 
for the men in the sample. For the women, the 
results were X72 = 274.73 and s2 = 57.5. 


Construct and interpret a 90% confidence inter- 
val for the difference in mean score for male and 
female young adults. 

Based only on the interval from part (a), is there 
convincing evidence of a difference in mean score 
for male and female young adults? 


R10.5 Treating AIDS The drug AZT was the first drug 


R10.6 


(b) 


that seemed effective in delaying the onset of 
AIDS. Evidence for AZ'I’s effectiveness came 
from a large randomized comparative experi- 
ment. The subjects were 870 volunteers who 
were infected with HIV, the virus that causes 
AIDS, but did not yet have AIDS. The study 
assigned +35 of the subjects at random to take 
500 milligrams of AZT each day and another 435 
to take a placebo. At the end of the study, 38 of 
the placebo subjects and 17 of the AZT subjects 
had developed AIDS. 


Do the data provide convincing evidence at the 

a = 0.05 level that taking AZT lowers the propor- 
tion of infected people who will develop AIDS in a 
given period of time? 


Describe a ‘Type I error and a ‘Type II error in 
this setting and give a consequence of each error. 
Based on your conclusion in part (a), which error 
could have been made in this study? 


Conditions Explain why it is not safe to use the 
methods of this chapter to perform inference in 
each of the following settings. 


Lyme disease is spread in the northeastern United 
States by infected ticks. ‘The ticks are infected 
mainly by feeding on mice, so more mice result in 
more infected ticks. The mouse population in turn 
rises and falls with the abundance of acorns, their 
favored food. E:xperimenters studied two similar 
forest areas in a year when the acorn crop failed. 
They added hundreds of thousands of acorns to 
one area to imitate an abundant acorn crop, while 
leaving the other area untouched. The next spring, 
54 of the 72 mice trapped in the first area were 

in breeding condition, versus 10 of the 17 mice 
trapped in the second area.” 

Who texts more —males or females? For their 
final project, a group of AP® Statistics students 
investigated their belief that females text more 
than males. They asked a random sample of 31 
students— 15 males and 16 females—from their 
school to record the number of text messages sent 
and received over a 2-day period. Boxplots of their 
data are shown below. 


oo 
413 214 


Males 


O16 28 883 127 


Females 


0 100 200 300 400 
Number of text messages in 2-day period 
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Chapter Review Exercises ix cas 


Each day I am getting better in math A “sublimi- 
nal” message is below our threshold of awareness 
but may nonetheless influence us. Can subliminal 
messages help students learn math? A group of 18 
students who had failed the mathematics part of 
the City University of New York Skills Assessment 
‘Test agreed to participate in a study to find out. 
All received a daily subliminal message, flashed 
on a screen too rapidly to be consciously read. 
The treatment group of 10 students (assigned at 
random) was exposed to “Each day I am getting 
better in math.” The control group of 8 students 
was exposed to a neutral message, “People are 
walking on the street.” All 18 students participated 
in a summer program designed to raise their math 
skills, and all took the assessment test again at the 
end of the program. The table below gives data on 
the subjects’ scores before and after the program.** 


Treatment Group Control Group 


Pretest Posttest Difference Pretest Posttest Difference 


18 
18 
21 
18 
18 
20 
23 
23 
21 
17 


24 6 18 29 11 
25 7 24 29 5 
33 12 20 24 4 
29 11 18 26 8 
33 15 24 38 14 
36 16 22 27 5 
34 11 15 22 7 
36 13 19 31 12 
34 13 

27 10 


(a) 
(b) 


(c) 
(d) 


Explain why a two-sample t test is more appropri- 
ate than a paired t test for analyzing these data. 
The Fathom boxplots below display the differences 
in pretest and posttest scores for the students in the 
control (Cdiff) and treatment (‘Tdiff) groups. Write 
a few sentences comparing the performance of 
these two groups. 


(Box Piot 9 


Subliminal messages 


Do the data provide convincing evidence that 
subliminal messages help students learn math? 
Can we generalize these results to the population 
of all students who failed the mathematics part of 
the City University of New York Skills Assessment 
‘Test? Why or why not? 
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Chapter 10 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T10.1 A study of road rage asked separate random samples as a nuisance by the operators and deliberately 


of 596 men and 523 women about their behavior 
while driving. Based on their answers, each respon- 
dent was assigned a road rage score on a scale of 0 
to 20. Are the conditions for performing a two- 
sample t test satisfied? 


(a) Maybe; we have independent random samples, but 
we need to look at the data to check Normality. 

(b) No; road rage scores in a range between 0 and 20 
can’t be Normal. 

(c) No; we don’t know the population standard 
deviations. 

(d) Yes; the large sample sizes guarantee that the corre- 
sponding population distributions will be Normal. 

(e) Yes; we have two independent random samples and 
large sample sizes. 


T10.2 Thirty-five people from a random sample of 125 


workers from Company A admitted to using sick 
leave when they weren't really ill. Seventeen em- 
ployees from a random sample of 68 workers from 
Company B admitted that they had used sick leave 
when they weren’t ill. A 95% confidence interval for 
the difference in the proportions of workers at the 
two companies who would admit to using sick leave 
when they weren’t ill is 


_ [@28N0.72) (0250.75) 
o 03 = yf 125 68 


(0.28)(0.72) _ (0.25)(0.75) 


(b) 0.03 + 1.96] 


125 68 
(c) 0.03 + 1.6454) ee ne 

(d) 0.03 + 1.964] — Oe 
(e) 0.03 + 1.645] ee ae? 


T10.3 The power takeoff driveline on tractors used in 


agriculture is a potentially serious hazard to opera- 
tors of farm equipment. The driveline is covered by 
a shield in new tractors, but for a variety of reasons, 
the shield is often missing on older tractors. ‘I'wo 
types of shields are the bolt-on and the flip-up. It 
was believed that the bolt-on shield was perceived 


removed, but the flip-up shield is easily lifted for 
inspection and maintenance and may be left in 
place. In a study initiated by the U.S. National 
Safety Council, random samples of older tractors 
with both types of shields were taken to see what 
proportion of shields were removed. Of 183 trac- 
tors designed to have bolt-on shields, 35 had been 
removed. Of the 136 tractors with flip-up shields, 
15 were removed. We wish to perform a test of 
Ho: pp = peversus Hy: pp > pp where py, and prare 
the proportions of all tractors with the bolt-on and 
flip-up shields removed, respectively. Which of 
the following is not a condition for performing the 
significance test? 


a) Both populations are Normally distributed. 

b) 

(c) 

(d) The counts of successes and failures are large enough 
to use Normal calculations. 


( 
(b) ‘The data come from two independent samples. 


Both samples were chosen at random. 


(e) Both populations are at least 10 times the corre- 
sponding sample sizes. 


T10.4 A quiz question gives random samples of n = 10 
observations from each of two Normally distributed 
populations. Tom uses a table of ¢ distribution criti- 
cal values and 9 degrees of freedom to calculate a 
95% confidence interval for the difference in the 
two population means. Janelle uses her calculator’s 
two-sample t interval with 16.87 degrees of freedom 
to compute the 95% confidence interval. Assume 
that both students calculate the intervals correctly. 
Which of the following is true? 


(a) ‘Tom’s confidence interval is wider. 


) 
(b) 
) 
) 


Janelle’s confidence interval is wider. 


(c) Both confidence intervals are the same. 


(d) There is insufficient information to determine which 
confidence interval is wider. 


(e) Janelle made a mistake; degrees of freedom has to be 
a whole number. 


Exercises T10.5 and T10.6 refer to the following setting. A 
researcher wished to compare the average amount of time 
spent in extracurricular activities by high school students 
in a suburban school district with that in a school district 
of a large city. The researcher obtained an SRS of 60 high 
school students in a large suburban school district and found 
the mean time spent in extracurricular activities per week 


to be 6 hours with a standard deviation of 3 hours. ‘The 
researcher also obtained an independent SRS of 40 high 
school students in a large city school district and found the 
mean time spent in extracurricular activities per week to 
be 5 hours with a standard deviation of 2 hours. Suppose 
that the researcher decides to carry out a significance test of 
Ao: suburban = Heity Versus a two-sided alternative. 


T10.5 The correct test statistic is 


6= 5)=0 
= O28 
Sao 
0 0 
6=5)=—0 
ie ( ) 
32 
60° 40 
Bio >) =v 
eae 
Vo0 V40 
6=5)=0 
3 2 
—— + —— 
60 40 
6=5)—0 
6 fpaes ) 
32 22 
60 | 40 
T10.6 The P-value for the test is 0.048. A correct conclu- 
sion is to 


(a) fail to reject Hp at the a = 0.05 level. ‘There is con- 
vincing evidence of a difference in the average time 
spent on extracurricular activities by students in the 
suburban and city school districts. 


(b) fail to reject Hp at the a = 0.05 level. There is not 
convincing evidence of a difference in the average 
time spent on extracurricular activities by students in 
the suburban and city school districts. 


(c) fail to reject Hp at the a = 0.05 level. There is con- 
vincing evidence that the average time spent on ex- 
tracurricular activities by students in the suburban 
and city school districts is the same. 


(d) reject Hp at the a = 0.05 level. There is not con- 
vincing evidence of a difference in the average time 
spent on extracurricular activities by students in the 
suburban and city school districts. 


(e) reject Ho at the a = 0.05 level. There is convincing 
evidence of a difference in the average time spent on 
extracurricular activities by students in the suburban 
and city school districts. 

110.7 Ata baseball game, 42 of 65 randomly selected 
people own an iPod. At a rock concert occurring 
at the same time across town, 34 of 52 randomly 
selected people own an iPod. A researcher wants 
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to test the claim that the proportion of iPod owners 
at the two venues is different. A 90% confidence 
interval for the difference in population propor- 
tions (game — concert) is (—0.154, 0.138). Which 
of the following gives the correct outcome of the 
researcher's test of the claim? 


(a) Because the confidence interval includes 0, the re- 
searcher can conclude that the proportion of iPod 
owners at the two venues is the same. 


(b) Because the center of the interval is —0.008, the re- 
searcher can conclude that a higher proportion of 
people at the rock concert own iPods than at the 
baseball game. 


(c) Because the confidence interval includes 0, the re- 
searcher cannot conclude that the proportion of iPod 
owners at the two venues is different. 


(d) Because the confidence interval includes more 
negative than positive values, the researcher can 
conclude that a higher proportion of people at 
the rock concert own iPods than at the baseball 
game. 


(e) The researcher cannot draw a conclusion about a 
claim without performing a significance test. 


T10.8 An SRS of size 100 is taken from Population A 
with proportion 0.8 of successes. An indepen- 
dent SRS of size 400 is taken from Population B 
with proportion 0.5 of successes. The sampling 
distribution for the difference (Population A — 
Population B) in sample proportions has what 
mean and standard deviation? 


(a) mean = 0.3; standard deviation = 1.3 
(b) mean = 0.3; standard deviation = 0.40 
(c) mean = 0.3; standard deviation = 0.047 
(d) mean = 0.3; standard deviation = 0.0022 


(e) mean = 0.3; standard deviation = 0.0002 


T10.9 How much mote effective is exercise and drug treat- 
ment than drug treatment alone at reducing the rate 
of heart attacks among men aged 65 and older? To 
find out, researchers perform a completely random- 
ized experiment involving 1000 healthy males in 
this age group. Half of the subjects are assigned to 
receive drug treatment only, while the other half are 
assigned to exercise regularly and to receive drug 
treatment. ‘he most appropriate inference method 
for answering the original research question is 


a) one-sample z test for a proportion. 


( 
(b) two-sample z interval for p; — p2. 
( 
( 


) 
) 

c) two-sample z test for p; — p2. 

d) two-sample t interval for j4) — [up. 
) 


(e) two-sample t test for ju) — po. 
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110.10 Researchers are interested in evaluating the effect 
of a natural product on reducing blood pressure. 
This will be done by comparing the mean reduc- 
tion in blood pressure of a treatment (natural 
product) group and a placebo group using a 
two-sample t test. The researchers would like to 
be able to detect whether the natural product 
reduces blood pressure by at least 7 points more, 
on average, than the placebo. If groups of size 50 
are used in the experiment, a two-sample t test 
using a = 0.01 will have a power of 80% to detect 
a 7-point difference in mean blood pressure 


reduction. If the researchers want to be able to 
detect a 5-point difference instead, then the power 
of the test 


(a) would be less than 80%. 

(b) would be greater than 80%. 

(c) would still be 80%. 
) 


(d) could be either less than or greater than 80%, de- 
pending on whether the natural product is effective. 


a 


(e) would vary depending on the standard deviation of 
the data. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T10.11 Researchers wondered whether maintaining a 
patient’s body temperature close to normal by 
heating the patient during surgery would affect 
wound infection rates. Patients were assigned at 
random to two groups: the normothermic group 
(patients’ core temperatures were maintained at 
near normal, 36.5°C, with heating blankets) and 
the hypothermic group (patients’ core tempera- 
tures were allowed to decrease to about 34.5°C). 
If keeping patients warm during surgery alters the 
chance of infection, patients in the two groups 
should have hospital stays of very different lengths. 
Here are summary statistics on hospital stay (in 
number of days) for the two groups: 


Group n xX Sy 
Normothermic 104 V2.1 44 
Hypothermic 96 14.7 6.5 


(a) Construct and interpret a 95% confidence inter- 
val for the difference in the true mean length of 
hospital stay for normothermic and hypothermic 
patients. 


(b) Does your interval in part (a) suggest that keep- 
ing patients warm during surgery affects the aver- 
age length of patients’ hospital stays? Justify your 
answer. 


(c) Interpret the meaning of “95% confidence” in the 
context of this study. 


T10.12 A random sample of 100 of a certain popular car 
model last year found that 20 had a certain minor 


defect in the brakes. ‘The car company made an 
adjustment in the production process to try to 
reduce the proportion of cars with the brake prob- 
lem. A random sample of 350 of this year’s model 


found that 50 had the minor brake defect. 


(a) Was the company’s adjustment successful? Carry 
out an appropriate test to support your answer. 


(b) Describe a Type I error and a ‘Type II error in this 
setting, and give a possible consequence of each. 


T10.13 Pat wants to compare the cost of one- and two- 
bedroom apartments in the area of her college 
campus. She collects data for a random sample of 
10 advertisements of each type. The table below 
shows the rents (in dollars per month) for the 
selected apartments. 


500 650 600 505 450 550 515 495 650 395 
595 500 580 650 675 675 750 500 495 670 


1 bedroom: 
2 bedroom: 


Pat wonders if two-bedroom apartments rent for 
significantly more, on average, than one-bedroom 
apartments. 


(a) State an appropriate pair of hypotheses for a signifi- 
cance test. Be sure to define any parameters you use. 

(b) Name the appropriate test and show that the condi- 
tions for carrying out this test are met. 

(c) ‘The appropriate test from part (b) yields a P-value 
of 0.029. Interpret this P-value in context. 

(d) What conclusion should Pat draw at the a = 0.05 


significance level? Explain. 


AP3.1 


AP3.2 


Section I: Multiple Choice Choose the best answer. 


Suppose the probability that a softball player gets a 
hit in any single at-bat is 0.300. Assuming that her 
chance of getting a hit on a particular time at bat 
is independent of her other times at bat, what is 
the probability that she will not get a hit until her 
fourth time at bat in a game? 


(5)osro2y (d) (0.3)3(0.7)! 


(5030.7) (e) (0.3)!(0.7)3 


(Foro 


The probability that Color Me Dandy wins a horse 
race at Batavia Downs given good track conditions is 
0.60. The probability of good track conditions on any 
given day is 0.85. What is the probability that Color 
Me Dandy wins or the track conditions are good? 


0.94  (b) 0.51 (ce) 0.49 (d)-(0.06 


The answer cannot be determined from the given 
information. 


Sports Illustrated planned to ask a random sample 
of Division I college athletes, “Do you believe per- 
formance-enhancing drugs are a problem in college 
sports?” How many athletes must be interviewed 

to estimate the proportion concerned about use of 


drugs within +2% with 90% confidence? 


17 (c) 1680 (e) 2401 
21 (d) 1702 
The distribution of grade point averages for a certain 


college is approximately Normal with a mean of 2.5 
and a standard deviation of 0.6. Within which of the 
following intervals would we expect to find approxi- 
mately 81.5% of all GPAs for students at this college? 


(07 31) (enn ea 7) (e) (0.7, 43) 
(bso 0) (qd) (1.9743) 
Which of the following will increase the power of a 


significance test? 


Increase the Type II error probability. 
Decrease the sample size. 


Reject the null hypothesis only if the P-value is 
smaller than the level of significance. 


Increase the significance level a. 


Select a value for the alternative hypothesis closer to 
the value of the null hypothesis. 
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AP3.6 You can find some interesting polls online. Anyone 


can become part of the sample just by clicking on 
a response. One such poll asked, “Do you prefer 
watching first-run movies at a movie theater, or 
waiting until they are available to watch at home or 
ona digital device?” In all, 8896 people responded, 
with only 12% (1118 people) saying they preferred 
theaters. You can conclude that 


(a) American adults strongly prefer watching movies at 


(d 


~ 


home or on their digital devices. 


the high nonresponse rate prevents us from drawing 
a conclusion. 


the sample is too small to draw any conclusion. 


the poll uses voluntary response, so the results tell us 
little about all American adults. 


(e) American adults strongly prefer seeing movies at a 


movie theater. 


AP3.7 A certain candy has different wrappers for various 


(a) 
(b) 


holidays. During Holiday 1, the candy wrappers are 
30% silver, 30% red, and 40% pink. During Holiday 
2, the wrappers are 50% silver and 50% blue. Forty 
pieces of candy are randomly selected from the 
Holiday | distribution, and 40 pieces are randomly 
selected from the Holiday 2 distribution. What are 
the expected value and standard deviation of the 
total number of silver wrappers? 


32,184  (c) 32, 4.29 
32,6.06  (d) 80, 18.4 


(e) 80, 4.29 


AP3.8 A beef rancher randomly sampled 42 cattle from 


her large herd to obtain a 95% confidence interval 
to estimate the mean weight of the cows in the 
herd. The interval obtained was (1010, 1321). If 
the rancher had used a 98% confidence interval 
instead, the interval would have been 


wider and would have less precision than the origi- 
nal estimate. 


wider and would have more precision than the origi- 
nal estimate. 


wider and would have the same precision as the orig- 
inal estimate. 


narrower and would have less precision than the 
original estimate. 


narrower and would have more precision than the 
original estimate. 
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(a 
(b 
(c 


) 
) 
) 
(d) 


(e) 


AP3.10 


AP3.11 
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School A has 400 students and School B has 2700 
students. A local newspaper wants to compare the 
distributions of SAT scores for the two schools. 
Which of the following would be the most useful 
for making this comparison? 


Back-to-back stemplots for A and B 
A scatterplot of A versus B 
Dotplots for A and B drawn on the same scale 


Two relative frequency histograms of A and B 
drawn on the same scale 


Two bar graphs for A and B drawn on the same 
scale 


Let X represent the outcome when a fair six-sided 
die is rolled. For this random variable, py = 3.5 
and ox = 1.71. Ifthe die is rolled 100 times, what 
is the approximate probability that the total score 
is at least 375? 


0.0000 (c) 0.0721 
0.0017 (d) 0.4420 


An agricultural station is testing the yields for six 
different varieties of seed corn. The station has 
four large fields available, which are located in 
four distinctly different parts of the county. The 
agricultural researchers consider the climatic and 
soil conditions in the four parts of the county as 
being unequal but are reasonably confident that 
the conditions within each field are fairly similar 
throughout. The researchers divide each field 
into six sections and then randomly assign one 
variety of corn seed to each section in that field. 
This procedure is done for each field. At the end 
of the growing season, the com will be harvested, 
and the yield, measured in tons per acre, will be 
compared. Which one of the following statements 
about the design is correct? 


(e) 0.9279 


This is an observational study because the research- 
ers are watching the corn grow. 

This a randomized block design with fields as 
blocks and seed types as treatments. 


This is a randomized block design with seed types 
as blocks and fields as treatments. 


This is a completely randomized design because the six 
seed types were randomly assigned to the four fields. 


This is a completely randomized design with 24 
treatments—6 seed types and 4 fields. 


The correlation between the heights of fathers and 
the heights of their (grownup) sons is r = 0.52, 
both measured in inches. If fathers’ heights were 
measured in feet instead, the correlation between 
heights of fathers and heights of sons would be 


(a 


( 


( 
( 


b 


d 


much smaller than 0.52. 
slightly smaller than 0.52. 
unchanged; equal to 0.52. 
slightly larger than 0.52. 


(e) much larger than 0.52. 
AP3.13 A random sample of 200 New York State voters in- 


0.44 — 0.47) + 1.96 


cluded 88 Republicans, while a random sample of 
300 California voters produced 141 Republicans. 
Which of the following represents the 95% confi- 
dence interval that should be used to estimate the 
true difference in the proportions of Republicans 
in New York State and California? 


(0.44)(0.56) + (0.47)(0.53) 


/200 + 300 
Nee re ee ae Oe) 
7a 300 
. (0.44)(0.56)  (0.47)(0.53) 
0.44 — 0.47) + 1.964] ai ah 
a (0.44)(0.56) + (0.47)(0.53) 
0.44 — 0.47) + 1.964] TT 
. (0.45)(0.55)  (0.45)(0.55) 
0.44 — 0.47) + 1.96] =a ai 


AP3.14 Which of the following is not a property of a 


(a) 


( 


b) 


binomial setting? 


Outcomes of different trials are independent. 

The chance process consists of a fixed number of 
trials, 7. 

The probability of success is the same for each trial. 
‘Trials are repeated until a success occurs. 

Each trial can result in either a success or a failure. 


Mrs. Woods and Mrs. Bryan are avid vegetable 
gardeners. They use different fertilizers, and each 
claims that hers is the best fertilizer to use when 
growing tomatoes. Both agree to do a study using 
the weight of their tomatoes as the response vari- 
able. ‘They had each planted the same varieties 
of tomatoes on the same day and fertilized the 
plants on the same schedule throughout the 
growing season. At harvest time, they each 
randomly select 15 tomatoes from their respec- 
tive gardens and weigh them. After performing 

a two-sample t test on the difference in mean 
weights of tomatoes, they get t = 5.24 and P = 
0.0008. Can the gardener with the larger mean 
claim that her fertilizer caused her tomatoes to 
be heavier? 


(a) Yes, because a different fertilizer was used on each 


garden. 


(b) Yes, because random samples were taken from 


each garden. 


(c) Yes, because the P-value is so small. 


AP3.17 


na 


No, because the soil conditions in the two gardens 
is a potential confounding variable. 


No, because there was no replication. 


The Environmental Protection Agency is charged 
with monitoring industrial emissions that pollute 
the atmosphere and water. So long as emission lev- 
els stay within specified guidelines, the EPA does 
not take action against the polluter. [f the polluter 
is in violation of the regulations, the offender can 
be fined, forced to clean up the problem, or pos- 
sibly closed. Suppose that for a particular industry 
the acceptable emission level has been set at no 
more than 5 parts per million (5 ppm). The null 
and alternative hypotheses are Ho: pp = 5 versus 
H,:p > 5. Which of the following describes a 
‘Type I error? 


The EPA fails to find convincing evidence that 
emissions exceed acceptable limits when, in fact, 
they are within acceptable limits. 

The EPA finds convincing evidence that emissions 
exceed acceptable limits when, in fact, they are 
within acceptable limits. 

The EPA finds convincing evidence that emissions 
exceed acceptable limits when, in fact, they do 
exceed acceptable limits. 

The EPA takes more samples to ensure that they 
make the correct decision. 

The EPA fails to find convincing evidence that 
emissions exceed acceptable limits when, in fact, 
they do exceed acceptable limits. 


Which of the following is false? 


(a) A measure of center alone does not completely 


(b 


) 


describe the characteristics of a set of data. Some 
measure of spread is also needed. 


If the original measurements are in inches, con- 
verting them to centimeters will not change the 
mean or standard deviation. 

One of the disadvantages of a histogram is that it 
doesn’t show each data value. 

Between the range and the interquartile range, the 
IQR isa better measure of spread if there are outliers. 
If a distribution is skewed, the median and inter- 
quartile range should be reported rather than the 
mean and standard deviation. 


AP3.18 A 96% confidence interval for the proportion of 


the labor force that is unemployed in a certain city 


— 
oO 
w~ 


AP3.20 


(a) 
(b) 
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is (0.07, 0.10). Which of the following statements 
about this interval is true? 


The probability is 0.96 that between 7% and 10% of 
the labor force is unemployed. 


About 96% of the intervals constructed by this 
method will contain the true proportion of unem- 
ployed in the city. 

In repeated samples of the same size, there is a 96% 


chance that the sample proportion will fall between 
0.07 and 0.10. 


The true rate of unemployment lies within this in- 
terval 96% of the time. 


Between 7% and 10% of the labor force is unem- 
ployed 96% of the time. 


A large toy company introduces a lot of new toys 
to its product line each year. ‘The company wants 
to predict the demand as measured by y, first-year 
sales (in millions of dollars) using x, awareness 
of the product (as measured by the percent of 
customers who had heard of the product by the 
end of the second month after its introduction). 
A random sample of 65 new products was taken, 
and a correlation of 0.96 was computed. Which 
of the following is a correct interpretation of this 
value? 


Ninety-six percent of the time, the least-squares re- 
gression line accurately predicts first-year sales. 


About 92% of the time, the percent of people who 
have heard of the product by the end of the second 
month will correctly predict first-year sales. 


About 92% of first-year sales can be accounted for 
by the percent of people who have heard of the 
product by the end of the second month. 


For each increase of 1% in awareness of the new 
product, the predicted sales will go up by 0.96 mil- 
lion dollars. 


About 92% of the variation in first-year sales can be 
accounted for by the least-squares regression line 
with percent of people who have heard of the prod- 
uct by the end of the second month as the explana- 
tory variable. 


Final grades for a class are approximately Normally 
distributed with a mean of 76 and a standard devia- 
tion of 8. A professor says that the top 10% of the class 
will receive an A, the next 20% a B, the next 40% a 

C, the next 20% a D, and the bottom 10% an F. What 
is the approximate maximum grade a student could 
attain and still receive an F for the course? 


70 (c) 65.75 (e) 57 
69.27 (d) 62.84 
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AP3.21 National Park rangers keep data on the bears that 
inhabit their park. Below is a histogram of the 
weights of 143 bears measured in a recent year. 


Frequency 
8 
| 


Ps 
i= 
! 


40 80 120 160 200 240 280 320 360 400 440 480 520 
Weight (pounds) 


Which statement below is correct? 


(a) ‘The median will lie in the interval (140, 180), and 
the mean will lie in the interval (180, 220). 


(b) The median will lie in the interval (140, 180), and 
the mean will lie in the interval (260, 300). 


(c) ‘The median will lie in the interval (100, 140), and 
the mean will lie in the interval (180, 220). 


(d) ‘The mean will lie in the interval (140, 180), and 
the median will lie in the interval (260, 300). 


(e) ‘The mean will lie in the interval (100, 140), and 
the median will lie in the interval (180, 220). 


AP3.22 A random sample of size n will be selected from 
a population, and the proportion of those in the 
sample who have a Facebook page will be calcu- 
lated. How would the margin of error for a 95% 
confidence interval be affected if the sample size 
were increased from 50 to 200? 


(a) It remains the same. 
(b) It is multiplied by 2. 
(c) It is multiplied by 4. 
(d) Itis divided by 2. 
(e) Itis divided by 4. 
AP3.23 A scatterplot and a least-squares regression line are 
shown in the figure below. What effect does point 


P have on the slope of the regression line and the 
correlation? 


(a) Point P increases the slope and increases the 
correlation. 


(b) Point P increases the slope and decreases the 
correlation. 


(c) Point P decreases the slope and decreases the 
correlation. 


(d) Point P decreases the slope and increases the 
correlation. 


(e) No conclusion can be drawn because the other co- 
ordinates are unknown. 


AP3.24 The following dotplots show the average high tem- 
peratures (in degrees Fahrenheit) for a sample of 
tourist cities from around the world. Both the Janu- 
ary and July average high temperatures are shown. 
What is one statement that can be made with 
certainty from an analysis of the graphical display? 


Temperatures in Tourist Cities Dot Piot 


0 20 40 60 80 100 


(a) Every city has a larger average high temperature in 
July than in January. 

(b) The distribution of temperatures in July is skewed 
right, while the distribution of temperatures in Jan- 
uary is skewed left. 

(c) The median average high temperature for January 
is higher than the median average high tempera- 
ture for July. 

(d) There appear to be outliers in the average high 
temperatures for January and July. 

(e) There is more variability in average high tempera- 
tures in January than in July. 

AP3.25 Suppose the null and alternative hypotheses for a 
significance test are defined as 


Ho: = 40 
Hy: < 40 
Which of the following specific values for H, will 
give the highest power? 
(a) w= 38 (c) w= 40 (e) w= 42 
(b) = 39 (d) w= 41 


AP3.26 


(a) 0.4975 
(b) 0.2475 


A large university is considering the establishment of 
a schoolwide recycling program. ‘To gauge interest in 
the program by means of a questionnaire, the univer- 
sity takes separate random samples of undergraduate 
students, graduate students, faculty, and staff. This is 
an example of what type of sampling design? 


Simple random sample 

Stratified random sample 

Convenience sample 

Cluster sample 

Randomized block design 

Suppose the true proportion of people who use 
public transportation to get to work in the Wash- 
ington, D.C., area is 0.45. In a simple random 
sample of 250 people who work in Washington, 


about how far do you expect the sample propor- 
tion to be from the true proportion? 


(c) 0.0315 (e) 0 
(d) 0.0009 


Questions 28 and 29 refer to the following setting. Ac- 
cording to sleep researchers, if you are between the ages 
of 12 and 18 years old, you need 9 hours of sleep to be 
fully functional. A simple random sample of 28 students 
was chosen from a large high school, and these students 
were asked how much sleep they got the previous night. 
The mean of the responses was 7.9 hours, with a standard 
deviation of 2.1 hours. 


AP3.28 


If we are interested in whether students at this 
high school are getting too little sleep, which of 
the following represents the appropriate null and 
alternative hypotheses? 


Ho: = 7.9 and Hy: pp < 7.9 
Ho: = 7.9 and H,: p # 7.9 
Ho:u = 9 and H,:u #9 
Ho: = 9 and H,:u<9 
Ho:u 9 and H,:~=9 


AP3.29 


(a) 


(d) 


AP3.30 
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Which of the following is the test statistic for the 
hypothesis test? 


79-9 9-79 79-9 
aS Oo Oe 
VB V28 38 
Uis=9 3-79 
a en oe 
Vi Vi 


Shortly before the 2012 presidential election, a 
survey was taken by the school newspaper at a very 
large state university. Randomly selected students 
were asked, “Whom do you plan to vote for in the 
upcoming presidential election?” Here is a two- 
way table of the responses by political persuasion 
for 1850 students: 


Candidate of Political Persuasion 

choice Democrat Republican Independent Total 
Obama 925 78 26 1029 
Romney 78 598 19 695 
Other 2 8 11 21 
Undecided 32 28 45 105 
Total 1037 712 101 1850 


Which of the following statements about these 
data is true? 


The percent of Republicans among the respon- 
dents is 41%. 

The marginal distribution of the variable choice 
of candidate is given by Obama: 55.6%; Romney: 
37.6%; Other: 1.1%; Undecided: 5.7%. 

About 11.2% of Democrats reported that they 
planned to vote for Romney. 


About 44.6% of those who are undecided are 
Independents. 

The conditional distribution of political persuasion 
among those for whom Romney is the candidate 
of choice is Democrat: 7.5%; Republican: 84.0%; 
Independent: 18.8% 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


AP3.31 A researcher wants to determine whether or not a 


five-week crash diet is effective over a long period 
of time. A random sample of 15 dieters is selected. 
Each person’s weight is recorded before starting 
the diet and one year after it is concluded. Based 
on the data shown at right (weight in pounds), 

can we conclude that the diet has a long-term 
effect, that is, that dieters manage to not regain the 
weight they lose? Include appropriate statistical 
evidence to justify your answer. 


1 2 3 4 5 6 7 8 


Before 158 185 176 172 164 234 258 200 

After 163 182 188 #150 161 220 235 191 
9 10 11 12 #13 14 =~ «15 

Before 228 246 198 221 236 255 231 

After 228 38237 =6209) = 220) 222 268 = 234 
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32 Starting in the 1970s, medical technology allowed 
babies with very low birth weight (VLBW, less than 
1500 grams, or about 3.3 pounds) to survive without 
major handicaps. It was noticed that these children 
nonetheless had difficulties in school and as adults. 
A long study has followed 242 randomly selected 
VLBW babies to age 20 years, along with a control 
group of 233 randomly selected babies from the 
same population who had normal birth weight.’ 


(a) Is this an experiment or an observational study? Why? 


At age 20, 179 of the VLBW group and 193 of the 
control group had graduated from high school. Is 
the graduation rate among the VLBW group sig- 
nificantly lower than for the normal-birth-weight 
controls? Give appropriate statistical evidence to 
justify your answer. 


33 A nuclear power plant releases water into a nearby 
lake every afternoon at 4:51 p.m. Environmental 
researchers are concerned that fish are being driven 
away from the area around the plant. They believe 
that the temperature of the water discharged may 
be a factor. The scatterplot below shows the tem- 
perature of the water (°C) released by the plant and 
the measured distance (in meters) from the outflow 
pipe of the plant to the nearest fish found in the 
water on eight randomly chosen afternoons. 


aS 

D 

i) 
! 
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Temperature (°C) 


Computer output from a least-squares regres- 
sion analysis on these data and a residual plot are 
shown below. 


Predictor Coef SE Coef T. P 
Constant -73.64 15.48 -4.76 0.003 
Temperature 5.7188 '0'5602 —-1O219) — 09000 


S = 11.4175 R-Sq = 94.5% R-Sq(adj) = 93.6% 
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Fitted value 


(a) Write the equation of the least-squares regression 
line. Define any variables you use. 


(b 


a 


Interpret the slope of the regression line in 

context. 

(c) Isa linear model appropriate for describing the re- 
lationship between temperature and distance to 
the nearest fish? Justify your answer. 

(d) Compute the residual for the point (29, 78). Inter- 

pret this residual in context. 


AP3.34 The Candy Shoppe assembles gift boxes that 


contain 8 chocolate truffles and 2 handmade 
caramel nougats. The truffles have a mean 
weight of 2 ounces with a standard deviation of 
0.5 ounce, and the nougats have a mean weight 
of 4 ounces with a standard deviation of 1 ounce. 
The empty boxes weigh 3 ounces with a standard 
deviation of 0.2 ounce. 


(a) 


(b) 


AP3.35 


Assuming that the weights of the truffles, nougats, 
and boxes are independent, what are the mean and 
standard deviation of the weight of a box of candy? 


Assuming that the weights of the truffles, nougats, 
and boxes are approximately Normally distributed, 
what is the probability that a randomly selected box 
of candy will weigh more than 30 ounces? 


If five gift boxes are randomly selected, what is 
the probability that at least one of them will weigh 
more than 30 ounces? 


If five gift boxes are randomly selected, what is the 
probability that the mean weight of the five boxes 
will be more than 30 ounces? 


An investor is comparing two stocks, A and B. 

She wants to know if over the long run, there is a 
significant difference in the return on investment 
as measured by the percent increase or decrease in 
the price of the stock from its date of purchase. ‘The 
investor takes a random sample of 50 annualized 
daily returns over the past five years for each stock. 
The data are summarized below. 


Stock Mean return Standard deviation 
A 11.8% 12.9% 
B 71% 9.6% 


Is there a significant difference in the mean return 
on investment for the two stocks? Support your an- 
swer with appropriate statistical evidence. Use a 5% 
significance level. 


The investor believes that although the return 
on investment for Stock A usually exceeds that of 
Stock B, Stock A represents a riskier investment, 
where the risk is measured by the price volatility 
of the stock. The standard deviation is a statistical 
measure of the price volatility and indicates how 
much an investment’s actual performance during 
a specified period varies from its average perfor- 
mance over a longer period. Do the price fluctua- 


SF 
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tions in Stock A significantly exceed those of Stock 
B, as measured by their standard deviations? Iden- 
tify an appropriate set of hypotheses that the inves- 
tor is interested in testing. 


To measure this, we will construct a test statistic 


defined as 


large sample variance 


smaller sample variance 


What value(s) of the statistic would indicate that the 
price fluctuations in Stock A significantly exceed 
those of Stock B? Explain. 


Calculate the value of the F statistic using the in- 
formation given in the table. 


Two hundred simulated values of this test statistic, 
F, were calculated assuming no difference in the 
standard deviations of the returns for the two stocks. 
The results of the simulation are displayed in the 
following dotplot. 


S  eeecccccenccocoscsccence 
Ss  eeeecccccvccoces 


= 
ma 


bo ] eeeeee 
e 


1 


Larger variance/smaller variance 


Use these simulated values and the test statistic 
that you calculated in part (d) to determine 
whether the observed data provide convincing 
evidence that Stock A is a riskier investment than 
Stock B. Explain your reasoning. 
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Inference for Distributions 
of Categorical Data 


Do Dogs Resemble Their Owners? 


Some people look a lot like their pets. Maybe they deliberately choose animals that match their 
appearance. Or maybe we’re perceiving similarities that aren’t really there. Researchers at the University 
of California, San Diego, decided to investigate. They designed an experiment to test whether or not dogs 
resemble their owners. The researchers believed that resemblance between dog and owner might differ 
for purebred and mixed-breed dogs. 

A random sample of 45 dogs and their owners was photographed separately at three dog parks. Then, 
researchers “constructed triads of pictures, each consisting of one owner, that owner’s dog, and one other 
dog photographed at the same park.” The subjects in the experiment were 28 undergraduate psychology 
students. Each subject was presented with the individual sets of photographs and asked to identify which 
dog belonged to the pictured owner. A dog was classified as resembling its owner if more than half of the 
28 undergraduate students matched dog to owner.! 

The table below summarizes the results. There is some support for the researchers’ belief that 
resemblance between dog and owner might differ for purebred and mixed-breed dogs. 


Breed status 
Resemblance? Purebred dogs Mixed-breed dogs 
Resemble owner 16 7 
Don’t resemble 9 13 
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Introduction 


In the previous chapter, we discussed inference procedures for comparing the 
proportion of successes for two populations or treatments. Sometimes we want 
to examine the distribution of a single categorical variable in a population. The 
chi-square test for goodness of fit allows us to determine whether a hypothesized 
distribution seems valid. This method is useful in a field like genetics, where the 
laws of probability give the expected proportion of outcomes in each category. 

We can decide whether the distribution of a categorical variable differs for 
two or more populations or treatments using a chi-square test for homogeneity. 
In doing so, we will often organize our data in a two-way table. It is also possible 
to use the information in a two-way table to study the relationship between two 
categorical variables. The chi-square test for independence allows us to determine 
if there is convincing evidence of an association between the variables in the 
population at large. 

The methods of this chapter help us answer questions such as these: 


e Are the birthdays of NHL players evenly distributed throughout the year? 
¢ Does background music influence customer purchases? 


e Is there an association between anger level and heart disease? 


Of course, we have to do a careful job of describing the data before we proceed 
to statistical inference. In Chapter 1, we discussed graphical and numerical 
methods of data analysis for categorical variables. You may want to quickly review 
Section 1.1 now. 

Here’s an Activity that gives you a taste (pardon the pun) of what lies ahead. 


ACTIVITY | The Candy Man Can 


MATERIALS: Mars, Incorporated, which is headquartered in McLean, Virginia, makes milk 
Large bag of M&M’S® Milk chocolate candies. Here’s what the company’s Consumer Affairs Department says 
Chocolate Candies for the about the color distribution of its M&M’S Milk Chocolate Candies: 


pea einen ile wa On average, MG@M’S Milk Chocolate Candies will contain 13 percent of each 
each team of 3 to 4 students 
of browns and reds, 14 percent yellows, 16 percent greens, 20 percent oranges 
and 24 percent blues. 


icmilk chocolate” = 
—— ‘I 


The purpose of this activity is to determine whether the company’s claim is 


believable. 


1. Your teacher will take a random sample of 60 M&M’S from a large bag and 
give one or more pieces of candy to each student. As a class, count the number 
of M&M’S® Chocolate Candies of each color. Make a table on the board that 
summarizes these observed counts. 


2. How can you tell if the sample data give convincing evidence to dispute the 
company’s claim? Each team of three or four students should discuss this ques- 
tion and devise a formula for a test statistic that measures the difference between 
the observed and expected color distributions. The test statistic should yield a 
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single number when the observed and expected values are plugged in. Here are 
some questions for your team to consider: 


e Should we look at the difference between the observed and expected 
proportions in each color category or between the observed and expected 
counts in each category? 
e Should we use the differences themselves, the absolute value of the 
differences, or the square of the differences? 
e Should we divide each difference value by the sample size, expected 
count, or nothing at all? 
3. Each team will share its proposed test statistic with the class. Your teacher 
will then reveal how the chi-square statistic x’ is calculated. 
4. Discuss as a class: If your sample is consistent with the company’s claimed 
distribution of M&M’S® Chocolate Candies colors, will the value of x” be large 
or small? If your sample is not consistent with the company’s claimed color 
distribution, will the value of x7 be large or small? 
5. Compute the value of the chi-square test statistic for the class’s data. Is this val- 
ue large or small? To find out, you and your classmates will perform a simulation. 


6. Suppose that the company’s claimed color distribution is correct. We'll use 
numerical labels from 1 to 100 to represent the color of a randomly chosen 


M&M’S Milk Chocolate Candy: 


1-13 =brown 14-26=r1ed 27-40 =yellow 41-56 = green 
57-76 = orange 77—100 = blue 


Use the calculator command below to simulate choosing a random sample of 


60 candies. 


TI-83/84: RandInt (1,100,60) — Ll 
TL89: tistat.randint (1,100,60) — listi 


Sort the list in ascending order. Then record the observed counts in each color 
category and compute the value of 7 for your simulated sample. 


7. Your teacher will draw and label axes for a class dotplot. Plot the result you 
got in Step 6 on the graph. 


8. Repeat Steps 6 and 7 if needed to get a total of at least 40 repetitions of the 
simulation for your class. 


9. Based on the class’s simulation results, how surprising would it be to get a 


x?-value as large as or larger than the one you did in Step 5 by chance alone 
when sampling from the claimed distribution? What conclusion would you draw? 


oo 


33 Here is an example of what the class dotplot in the Activity might 
one look like after 100 trials. ‘The graph shows what values of the chi-square 
Sogee Se, statistic are likely to occur by chance alone when sampling from the 

SSSase sess | & company’s claimed M&M’S® Chocolate Candies color distribution. 
OHOOWDOCOCHOOD OC O oe °¢ ° . ; D) 4 
6 5 10 15 20 25 Where did your class’s y*-value fall? You will learn more about the 


chisquare sampling distribution of the chi-square statistic shortly. 
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TD Chi-Square Tests for 


WHAT YOU WILL LEARN 


Goodness of Fit 


By the end of the section, you should be able to: 


e State appropriate hypotheses and compute expected e Perform a chi-square test for goodness of fit. 


counts for a chi-square test for goodness of fit. © Conduct a follow-up analysis when the results of a 
e Calculate the chi-square statistic, degrees of freedom, chi-square test are statistically significant. 
and P-value for a chi-square test for goodness of fit. 


Note that the correct alternative 
hypothesis H, is two-sided. A sample 
proportion of blue M&M’S much 
higher or much lower than 0.24 would 
give Jerome reason to be suspicious 
about the company’s claim. It’s not 
appropriate to adjust H, after looking 
at the sample data! 


Jerome’s class did the Candy Man Can Activity. The one-way table below 
summarizes the data from the class’s sample of M&M’S® Milk Chocolate Can- 
dies. In general, one-way tables display the distribution of a single categorical 
variable for the individuals in a sample. 


Color: Blue Orange Green Yellow Red Brown Total 
Count: 9 8 12 15 10 6 60 


x9 
The sample proportion of blue M&M’S is p = 60 7 0.15. Because the company 


claims that 24% of all M&M’S Milk Chocolate Candies are blue, Jerome might 
believe that something fishy is going on. We could use the one-sample z test for a 
proportion from Chapter 9 to test the hypotheses 


Ho: p = 0.24 
Hy: p # 0.24 


where is the true population proportion of blue M&M’S® Chocolate Candies. 
We could then perform additional significance tests for each of the remaining 
colors. 

Not only would this method be fairly inefficient, it would also lead to the prob- 
lem of multiple comparisons, which we’ll discuss in Section 11.2. More impor- 
tant, this approach wouldn’t tell us how likely it is to get a random sample of 
60 candies with a color distribution that differs as much from the one claimed by 
the company as the class’s sample does, taking all the colors into consideration at 
one time. For that, we need a new kind of significance test, called a chi-square 
test for goodness of fit. 


Comparing Observed and Expected Counts: 
The Chi-Square Statistic 


As with any test, we begin by stating hypotheses. The null hypothesis in a chi- 
square test for goodness of fit should state a claim about the distribution of a 
single categorical variable in the population of interest. In the case of the Candy 
Man Can Activity, the categorical variable we’re measuring is color and the 


THINK 
ABOUT IT 
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population of interest is all M&M’S® Milk Chocolate Candies. The appropriate 
null hypothesis is 


Ho: The company’s stated color distribution for all 
M&M’S Milk Chocolate Candies is correct. 


The alternative hypothesis in a chi-square test for goodness of fit is that the 
categorical variable does not have the specified distribution. For the M&M’S, our 
alternative hypothesis is 


H,: The company’s stated color distribution for all 
M&M’S Milk Chocolate Candies is not correct. 


Why did we state hypotheses in words for a chi-square test 
for goodness of fit? We can also write the hypotheses in symbols as 


Ho: Pbiue = 0.24, Porange = 0.20, Pereen = 0.16, 
Pyellow = O14, Pred = 0.13, Pbrown = 0.1 3, 


H,; At least two of the p;’s are incorrect 


where Peolor = the true population proportion of M&M’S Milk Chocolate Can- 
dies of that color. Why don’t we write the alternative hypothesis as H,: At least one 
of the ;’s is incorrect? If the stated proportion in one category is wrong, then the 
stated proportion in at least one other category must be wrong because the sum of 
the p;’s must be 1. 
Don’t state the alternative hypothesis in a way that suggests that all the 

proportions in the hypothesized distribution are wrong. For instance, it rT) 
would be incorrect to write 


Ea Pblue # O24; Porange # 0.20, Pereen # 0.16, 
Pycllow # 0.14, Pred # 0.13, Pbrown # 0.13 


The idea of the chi-square test for goodness of fit is this: we compare the ob- 
served counts from our sample with the counts that would be expected if Hp is 
true. (Remember: we always assume that Hp is true when performing a significance 
test.) The more the observed counts differ from the expected counts, the more 
evidence we have against the null hypothesis. In general, the expected counts can 
be obtained by multiplying the sample size by the proportion in each category 
according to the null hypothesis. Here’s an example that illustrates the process. 


Return of the M&M’S® 


Chocolate Candies 


Computing expected counts 


PROBLEM: Jerome's class collected data from a random sample of 60 M&M’S Milk Chocolate 
Candies. Calculate the expected counts for each color. Show your work. 


SOLUTION: Assuming that the color distribution stated by Mars, Inc., is true, 24% of all M&M’S 
Milk Chocolate Candies produced are blue. For random samples of 60 candies, the average number of 
blue M&M’S should be (60)(0.24) = 14.40. This is our expected count of blue M&M’S® Chocolate 
Candies. Using this same method, we find the expected counts for the other color categories: 
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Orange: (60)(0.20) = 12.00 Color Observed Expected 
Green: (60)(0.16) = 9.60 is ic 
a 6 Ae ary Orange 8 12.00 
Ele EOE) =e Green 12 9.60 
Red: (60)(0.13) = 7.80 Yellow 15 8.40 
Brown: (60)(0.13) = 7.80 Red 10 7.80 
Brown 6 7.80 


For Practice Try Exercise 


Did you notice that the expected count sounds a lot like the expected value of a 
random variable from Chapter 6? That’s no coincidence. The number of M&M’S® 
Chocolate Candies of a specific color in a random sample of 60 candies is a bino- 
mial random variable. Its expected value is np, the average number of candies of 
this color in many samples of 60 M&M’S Milk Chocolate Candies. The expected 
value is not likely to be a whole number. 
To see if the data give convincing evidence for the alter- 
16 - native hypothesis, we compare the observed counts from our 


- WE Observed sample with the expected counts. If the observed counts are 
12 - WH Expected far from the expected counts, that’s the evidence we were seek- 
‘eel ing. The table in the example gives the observed and expected 
z. counts for the sample of 60 M&M’S in Jerome’s class. Figure 
6 11.1 shows a side-by-side bar graph comparing the observed 
gl and expected counts. 

We see some fairly large differences between the observed 
: il and expected counts in several color categories. How likely is it 

T T T T T 1 


that differences this large or larger would occur just by chance 


Count 


i a = — see Brow in random samples of size 60 from the population distribution 
FIGURE 11.1 Bar graph comparing observed and expected claimed by Mars, Inc.? To answer this question, we calculate a 
counts for Jerome's class sample of 60 M&M’S® Milk statistic that measures how far apart the observed and expected 
Chocolate Candies. counts are. The statistic we use to make the comparison is the 


chi-square statistic 


(Observed — Expected)? 
Expected 


y=> 


(The symbol x is the lowercase Greek letter chi, pronounced “kye.”) 


DEFINITION: Chi-square statistic 


The chi-square statistic is a measure of how far the observed counts are from the 
expected counts. The formula for the statistic is 


(Observed — Expected)? 
v=> 


Expected 
where the sum is over all possible values of the categorical variable. 


THINK 
ABOUT IT 
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Let’s use this formula to compare the observed and expected counts for Jerome’s 
class sample. 


Return of the M&M’S® 


Chocolate Candies 


Calculating the chi-square statistic 


The table shows the observed and expected Golor — Observed —_— Expected 
counts for the random sample of 60 M&M’S pig 14.40 
Milk Chocolate Candies in Jerome’s class. 


Orange 12.00 


PROBLEM: Calculate the chi-square statistic. Green 9.60 
SOLUTION: Theformulafor the chi-square statistic is 


Yellow 8.40 


; Red as 
(Observed — Expected) Brown 7.80 


x= > 


For Jerome’s data, we add six terms—one for each color category: 


Expected 


9-14.40)? (8-12.00) | (12 — 9.60)’ 


1440 ~—«- 12.00 9.60 
(ib 2A0)> (10 — 7.80)? (6 7.60) 


8.40 7.60 7.60 
= 2.025 + 1.333 + 0.600 + 5.186 + 0.621 + 0.415 = 10.180 


ot 


xX 


For Practice Try Exercise 


Why do we divide by the expected count when calculat- 
ing the chi-square statistic? Suppose you obtain a random sample of 
60 M&M’S Milk Chocolate Candies. Which would be more surprising: getting 
18 blue candies or 12 yellow candies in the sample? Earlier, we computed the ex- 
pected counts for these two categories as 14.4 and 8.4, respectively. The difference 
in the observed and expected counts for the two colors would be 


Blue: 18 — 14.4 = 3.6 Yellow: 12 — 8.4 = 3.6 


In both cases, the number of M&M’S® Chocolate Candies in the sample exceeds 
the expected count by the same amount. But it’s much more surprising to be off 
by 3.6 out of an expected 8.4 yellow candies (almost a 50% discrepancy) than to 
be off by 3.6 out of an expected 14.4 blue candies (a 25% discrepancy). For that 
reason, we want the category with a larger relative difference to contribute more 
heavily to the evidence against Hy and in favor of H, measured by the y’ statistic. 

If we just computed (Observed — Expected)’ for each category instead, the 
contributions of these two color categories would be the same: 


Blue: (18 — 14.40)? = 12.96 Yellow: (12 — 8.40)? = 12.96 
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(Observed — Expected)? 


Expected 
larger relative difference will contribute more heavily to the total: 


By using , we guarantee that the color category with the 


(18 — 14.40)? 7 (12 — 8.40)" 
ag 0.90 Yellow: 340. 
ooo a 


Blue: = 1.54 


Think of x? as a measure of the distance of the observed counts from the ex- 
pected counts. Like any distance, it is always zero or positive, and it is zero only 
when the observed counts are exactly equal to the expected counts. Large values 
of y” are stronger evidence for H, because they say that the observed counts are 
far from what we would expect if Hy were true. Small values of y” suggest that 
the data are consistent with the null hypothesis. Is x7 = 10.180 a large value? 
You know the drill: compare the observed value 10.180 against the sampling dis- 
tribution that shows how x7 would vary in repeated random sampling if the null 
hypothesis were true. 


CHECK YOUR UNDERSTANDING 

Mars, Inc., reports that their M&M’S® Peanut Chocolate Candies are produced accord- 
ing to the following color distribution: 23% each of blue and orange, 15% each of green 
and yellow, and 12% each of red and brown. Joey bought a randomly selected bag of 
Peanut Chocolate Candies and counted the colors of the candies in his sample: 12 blue, 
7 orange, 13 green, 4 yellow, 8 red, and 2 brown. 

1. State appropriate hypotheses for testing the company’s claim about the color distribu- 
tion of M&M’S Peanut Chocolate Candies. 

2. Calculate the expected count for each color, assuming that the company’s claim is 
true. Show your work. 


3. Calculate the chi-square statistic for Joey’s sample. Show your work. 


The Chi-Square Distributions and P-Values 


We used Fathom software to simulate taking 500 random samples of size 60 from 
the population distribution of M&M’S Milk Chocolate Candies given by Mars, 
Inc. Figure 11.2 shows a dotplot of the values of the chi-square statistic for these 
500 samples. The blue vertical line is plotted at the value of x7 = 10.180 from 
Jerome’s class data. 

Recall that larger values of y” give more convincing evidence against Hy and 
in favor of H,. According to Fathom, 37 of the 500 simulated samples 
resulted in a chi-square statistic of 10.180 or higher. Our estimated 


0 5 10 15 20 25 


sina P-value is 37/500 = 0.074. Because the P-value exceeds the default 
FIGURE 11.2 Fathom dotplot showing values a = 0.05 significance level, we fail to reject Hp. We do not have convinc- 
of the chi-square statistic in 500 simulated ing evidence that the company’s claimed color distribution is incorrect. 
samples of size n = 60 from the population As Figure 11.2 suggests, the sampling distribution of the chi-square 


distribution of M&M’S® Milk Chocolate Candies __ statistic is not a Normal distribution. It is a right-skewed distribution 
stated by the company. that allows only nonnegative values because x? can never be negative. 
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The sampling distribution of y” differs depending on the number of possible val- 
ues for the categorical variable (that is, on the number of categories). 

When the expected counts are all at least 5, the sampling distribution of the x7 
statistic is modeled well by a chi-square distribution with degrees of freedom (df) 
equal to the number of categories minus 1. As with the t distributions, there is a 
different chi-square distribution for each possible value of df. Here are the facts. 


THE CHI-SQUARE DISTRIBUTIONS 


The chi-square distributions are a family of density curves that take only non- 
negative values and are skewed to the right. A particular chi-square distri- 
bution is specified by giving its degrees of freedom. The chi-square test for 
goodness of fit uses the chi-square distribution with degrees of freedom = the 
number of categories — 1. 


Figure 11.3 shows the density curves for three members of 
the chi-square family of distributions. As the degrees of free- 
dom (df) increase, the density curves become less skewed, and 
larger values become more probable. Here are two other inter- 
esting facts about the chi-square distributions: 


e The mean of a particular chi-square distribution is equal 
to its degrees of freedom. 


e For df > 2, the mode (peak) of the chi-square density 
curve is at df — 2. 


Chi-square When df = 8, for example, the chi-square distribution has a 
FIGURE 11.3 The density curves for three members of the mean of 8 anda mode of 6. 
chi-square family of distributions. To get P-values from a chi-square distribution, we can use 


technology or Table C in the back of the book. The following example shows how 
to use the table. 


Return of the M&M’S® 
Chocolate Candies 


Finding the P-value 


In the last example, we computed the chi-square statistic 
for the random sample of 60 M&M’S Milk Chocolate 
Candies in Jerome’s class: x7 = 10.180. Now let’s find 
the P-value. Because all the expected counts are at least 
5, the x? statistic will be modeled well by a chi-square 

distribution when Hp is true. There are 6 color categories 


for M&M’S Milk Chocolate Candies, so df = 6 — 1 = 5. 


Chi-square 
distribution 
with df = 5 


as 0) ee The P-value is the probability of getting a value of 7 as 


large as or larger than 10.180 when Hp is true. Figure 114 
shows this probability as an area under the chi-square 
density curve with 5 degrees of freedom. 


X? = 10.180 


FIGURE 11.4 The P-value for a chi-square test for goodness 
of fit using Jerome’s M&M’S® Chocolate Candies class data. 
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To find the P-value using Table C, look in the df = 5 P 

row. The value x7 = 10.180 falls between the critical | gf 45 10 05 
values 9.24 and 11.07. The corresponding areasinthe | 4 674 778 9,49 
right tail of the chi-square distribution with 5 degrees 
of freedom are 0.10 and 0.05. (See the excerpt from Ee 9241107 | 
Table C on the right.) So the P-value for a test based LB Ay US 12a | 
on Jerome’s data is between 0.05 and 0.10. 


Now let’s look at how to find the P-value with your calculator. 


FINDING P-VALUES FOR CHI-SQUARE 
TECHNOLOGY = TecTS ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


To find the P-value in the M&M’S® example with your calculator, use the y7cdf(Chi-square Cdf on the TI-89) com- 
mand. We ask for the area between x7 = 10.180 and a very large number (we’ll use 10,000) under the chi-square density 
curve with 5 degrees of freedom. 

TI-83/84 TI-89 
¢ Press [2nd][VARS](DISTR) and choose x7cdf (. In the Stats/List Editor, Press [F5](Distr) and 


OS 2.55 or later: In the dialog box, enter these values: choose Chi-square Cdf.... 

lower:10.18, upper:10000, d£:5, choose In the dialog box, enter these values: Lower 
Paste, and then press [ENTER]. Older OS: Complete value:10.18, Upper value:10000, Deg of 
the command y*cdf(10.180,10000,5) and press Freedom, df£:5, and then choose | ENTER |. 
ENTER |, 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


X*cdf (10.180, 10000,5) 
- 9782927523 


As the calculator screen shots show, this method gives a more precise P-value than ‘Table C. 


Table C gives us an interval in which the P-value falls. The calculator’s x7ca£ 
(Chi-square Cdf on the T1-89) command gives a result that is consistent with 
Table C but more precise. For that reason, we recommend using your calculator 
to compute P-values from a chi-square distribution. 

Based on Jerome’s sample, what conclusion can we draw about Ho:the com- 
pany’s stated color distribution for all M&M’S® Milk Chocolate Candies is cor- 
rect? Because our P-value of 0.07 is greater than a = 0.05, we fail to reject Hp. We 
don’t have convincing evidence that the company’s claimed color distribution is 
incorrect. 
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Failing to reject Hp does not mean that the null hypothesis is true! That 
is, we can’t conclude that the color distribution claimed by Mars, Inc., is 
correct. All we can say is that the sample data did not provide convincing 
evidence to reject Hp. 


CHECK YOUR UNDERSTANDING 


Let’s continue our analysis of Joey’s sample of M&M’S® Peanut Chocolate Candies from 
the previous Check Your Understanding (page 684). 


1. Confirm that the expected counts are large enough to use a chi-square distribution. 
Which distribution (specify the degrees of freedom) should we use? 


2. Sketch a graph like Figure 11.4 on page 685 that shows the P-value. 
3. Use ‘Table C to find the P-value. Then use your calculator’s 
y’cd£ command. 
4. What conclusion would you draw about the company’s 
claimed color distribution for M&M’S® Peanut Chocolate 
© Candies? Justify your answer. 


Carrying Out a Test 


Like our test for a population proportion, the chi-square test for goodness of fit 
uses some approximations that become more accurate as we take larger samples. 
The Large Counts condition says that all expected counts must be at least 5. Be- 
fore performing a test, we must also check that the Random and 10% conditions 
are met. 


CONDITIONS FOR PERFORMING A CHI-SQUARE 
TEST FOR GOODNESS OF FIT 


Before we start using the chi-square test for goodness of fit, we have two impor- 
tant cautions to offer. 
1. The chi-square test statistic compares observed and expected counts. 
Don’t try to perform calculations with the observed and expected pro- @ 
portions in each category. 
2. When checking the Large Counts condition, be sure to examine the expected 
counts, not the observed counts. 
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We can also write these hypotheses 
symbolically using p; to represent 
the proportion of individuals in the 
population that fall in category /: Suppose the conditions are met. To determine whether a categorical vari- 
able has a specified distribution in the population of interest, expressed as the 
proportion of individuals falling into each possible category, perform a test of 


THE CHI-SQUARE TEST FOR GOODNESS OF FIT 


FD) = — 4a = __j.niesy Py = __- 
H,:At least two of the p;’s are incorrect. 


Ho: The stated distribution of the categorical variable in the population of 
interest is correct. 

H,: The stated distribution of the categorical variable in the population of 
interest is not correct. 


Start by finding the expected count for each category assuming that Ho is 
true. Then calculate the chi-square statistic 


(Observed — Expected)? 
Expected 


=> 


where the sum is over the k different categories. The P-value is the area to the 
right of x? under the density curve of the chi-square distribution with k — 1 
degrees of freedom. 


The next example shows the chi-square test for goodness of fit in action. As 
always, we follow the four-step process when performing inference. 


Birthdays and Hockey 
A test for equal proportions L. 


7 In his book Outliers, Malcolm Gladwell suggests that a hock- 
; ‘ fy ey player’s birth month has a big influence on his chance to 
3 $45 i Rai rae Se make it to the highest levels of the game. Specifically, because 
\ aie January | is the cut-off date for youth leagues in Canada (where 
many National Hockey League (NHL) players come from), 
players born in January will be competing against players up to 
12 months younger. The older players tend to be bigger, stron- 
ger, and more coordinated and hence get more playing time, 
more coaching, and have a better chance of being successful. 


To see if birth date is related to success (judged by whether a 
player makes it into the NHL), a random sample of 80 NHL players from a recent 
season was selected and their birthdays were recorded. The one-way table below 
summarizes the data on birthdays for these 80 players: 


Birthday: Jan—Mar Apr—Jun Jul-Sep Oct-Dec 
Number of players: 32 20 16 12 


Do these data provide convincing evidence that the birthdays of NHL players are 
not uniformly distributed throughout the year? 


The null hypothesis says that 

NHL players’ birthdays are evenly 
distributed across the four quarters of 
the year. In that case, all 4 proportions 
must be 1/4. So we could write the 
hypotheses in symbols as 


Hp: Dian-mar Papr-Jun = Pyul-Sep 
Poct-dec = 1/4 
H,; At least two of the proportions are 


not 1/4 
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STATE: We want to perform a test of 
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~~ 


Ho: The birthdays of all NHL players are evenly distributed across the four quarters of the year. 


H,;: The birthdays of all NHL players are not evenly distributed across the four quarters of the year. 


No significance level was specified, so we'll use a = 0.05. 


PLAN: Ifthe conditions are met, we will perform a chi-square test for goodness of fit. 


* Random: The data came from a random sample of NHL players. 


° 10%: Because we are sampling without replacement, there 


must be at least 10(80) = 800 


NHL players. In the season when the data were collected, there were 879 NHL players. 
* Large Counts: \f birthdays are evenly distributed across the four quarters of the year, then the 
expected counts are all 80(1/4) = 20. These counts are all at least 5. 


DO: 


Chi-square 
distribution, 
df =3 


a2 


° Test statistic: 


J (Observed — Expected)? 
s Expected 


_ (82 — 20) ,, (20- 20)? me 


6—20)* (12 — 20)? 
be ) 


20 20 
= 175 ar 0) ar Oe ar ohy74 = WN 


FIGURE 11.5 The P-value for the chi-square 
test for goodness of fit with y* = 11.2 and 


df = 3. 


TECHNOLOGY 
CORNER 


distribution with 4 — 1 = 3 degrees of freedom. 


20 20 


* Pvalue: Figure 11.5 displays the P-value for this test as an area under the chi-square 


As the excerpt at right shows, x? = 11.2 corresponds toa P-value between 0.01 


and 0.02. 


Using Technology: Refer to the Technology Corner that follows 
the example. The calculator's x? GOF-Test gives 

\? = 11.2 and P-value = 0.011 using df = 3. 
CONCLUDE: Because the P-value, 0.011, is less than 

a = 0.05, we reject Ho. We have convincing evidence that the 
birthdays of NHL players are not evenly distributed across the 
four quarters of the year. 


p 

df 0.02 0.01 0.005 
2 782 9.21 10.60 
4 11.67 13.28 14.86 


For Practice Try Exercise 


You can use your calculator to carry out the “Do” step for a chi-square test for 
goodness of fit. Remember that this comes with potential benefits and risks on the 


AP® exam. 


CHI-SQUARE TEST FOR GOODNESS 


OF FIT ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


You can use the TI-83/84 or TI-89 to perform the calculations for a chi-square test for goodness of fit. We'll use the data 
from the hockey and birthdays example to illustrate the steps. 
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1. Enter the counts. Birthday Observed Expected 
e Enter the observed counts in LI/listl. Enter the expected counts in Jan—Mar 32 20 
L2/list2. Apr-Jun 20 20 
2. Performa chi-square test for goodness of fit. Jul-Sep 16 20 
Oct-Dec 12 20 


Note: Some older TI-83s and TI-84s don’t have this test. TI-84 users can get this 
functionality by upgrading their operating systems. 

TI-83/84: Press [STAT], arrow over to TESTS and choose y*GOF-Test.... 

TI-89: In the Stats/List Editor APP, press ([F6]) and choose Chi2GOF... . 


Enter the inputs shown below. If you choose Calculate, you'll get a screen with the test statistic, P-value, and df. If 
you choose the Draw option, you'll get a picture of the appropriate chi-square distribution with the test statistic marked 
and shaded area corresponding to the P-value. 


NORMAL FLOAT AUTO REAL RADIAN CL fl NORMAL FLOAT AUTO REAL RADIAN CL f NORMAL FLOAT AUTO REAL RADIAN CL fl 


x2GOF-Test X2GOF-Test 


Observed:Li X2=11.2 
Expected:L2 p=. 0106921291 
df:3 


df=3 
Color: i —U=l CNTRB={7.2 9 .8 3.2} 


Calculate Draw 


\ 
L \ 
\ 
\ 
eee 5 ell 


*%2G0F-Test 
%2=11.2 


P=.0107 


vik its 


Obrorved Lick 
Expected List: 


allt sctelin 
Chi-square Goodness of Fit 


(iste __] 


Chi-square Goodness of Fit 


112 
=.010682 


bed of Freedoms dé: E——_] se? 2h 82 

Results: DELL > 
list2[(5]= list2(5l= chi2=11.2 p=.610692 
USE € AND + TO OPEN CHOICES MAIN kat auto FUNC 2/6 MAIN Rab auto FUNC 


We'll discuss the CNTRB and Comp Lst results shortly. 


AP® EXAM TIP You can use your calculator to carry out the mechanics of a significance test on the AP® exam. But there’s 
a risk involved. If you just give the calculator answer with no work, and one or more of your values is incorrect, you will 
probably get no credit for the “Do” step. We recommend writing out the first few terms of the chi-square calculation followed 


by “...”. This approach might help you earn partial credit if you enter a number incorrectly. Be sure to name the procedure 
(y?GOF-Test) and to report the test statistic (vy? =11.2), degrees of freedom (df = 3), and P-value (0.011). 


Follow-up Analysis In the chi-square test for goodness of fit, we test the null 
hypothesis that a categorical variable has a specified distribution in the population 
of interest. If the sample data lead to a statistically significant result, we can con- 
clude that our variable has a distribution different from the one stated. When this 
happens, start by examining which categories of the variable show large deviations 
between the observed and expected counts. Then look at the individual terms 
(Observed — Expected)? 


Expected 
These components show which terms contribute most to the chi-square statistic. 


that are added together to produce the test statistic x7. 
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Let’s return to the hockey and birthdays example. The table of observed and 
expected counts for the 80 randomly selected NHL players is repeated below. 
We have added a column that shows the components of the chi-square test 
statistic. Looking at the counts, we see that there were many more players born 
in January through March than expected and far fewer players born in October 
through December than expected. The component for January to March birth- 
days made the largest contribution to the chi-square statistic. These results sup- 
port Malcolm Gladwell’s claim that NHL players are more likely to be born 


early in the year. 


Birthday 


Jan—Mar 
Apr—Jun 
Jul-Sep 
Oct-Dec 


32 
20 


Expected 
20 
20 
20 
20 


0-E 
12 


Observed (O—E)/E 


7.2 
0.0 
0.8 
3.2 


Note: When we ran the chi-square test for goodness of fit on the calculator, a 
list of these individual components was stored. On the TI-83/84, the list is called 
CNTRB (for contribution). On the TT-89, it’s called Comp Lst (component list). 


CHECK YOUR UNDERSTANDING 


Biologists wish to mate pairs of fruit flies having genetic makeup RrCc, indicating that 
each has one dominant gene (R) and one recessive gene (r) for eye color, along with one 
dominant (C) and one recessive (c) gene for wing type. Each offspring will receive one 
gene for each of the two traits from each parent. The following table, known as a Punnett 
square, shows the possible combinations of genes received by the offspring: 


Parent 1 passes on: 


RC 
Rc 
rC 
re 


Parent 2 passes on: 


Rc 


rC 


Any offspring receiving an R gene will have red eyes, and any offspring receiving a 
C gene will have straight wings. So based on this Punnett square, the biologists predict a 
ratio of 9 red-eyed, straight-winged (x):3 red-eyed, curly-winged (y):3 white-eyed, straight- 
winged (z):1 white-eyed, curly-winged (w) offspring. 

To test their hypothesis about the distribution of offspring, the biologists mate a random 
sample of pairs of fruit flies. Of 200 offspring, 99 had red eyes and straight wings, 42 had 
red eyes and curly wings, 49 had white eyes and straight wings, and 10 had white eyes and 
curly wings. Do these data differ significantly from what the biologists have predicted? 
Carry out a test at the a = 0.01 significance level. 
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Summary 


e A one-way table is often used to display the distribution of a single categorical 
variable for a sample of individuals. 


e The chi-square test for goodness of fit tests the null hypothesis that a cat- 
egorical variable has a specified distribution in the population of interest. 


e This test compares the observed count in each category with the counts that 
would be expected if Hp were true. The expected count for any category is 
found by multiplying the sample size by the proportion in each category ac- 
cording to the null hypothesis. 


e The chi-square statistic is 


(Observed — Expected)? 
Expected 


v=> 


where the sum is over all possible categories. 
e The conditions for performing a chi-square test for goodness of fit are: 


¢ Random: The data were produced by a well-designed random sample or 
randomized experiment. 
© 10%: When sampling without replacement, check that the popula- 
tion is at least 10 times as large as the sample. 
e Large Counts: All expected counts must be at least 5. 

e When the conditions are met, the sampling distribution of the statistic x7 can 
be modeled by a chi-square distribution. 

e Large values of x? are evidence against Hy and in favor of H,. The P-value is 
the area to the right of 7 under the chi-square distribution with degrees of 
freedom df = number of categories — 1. 

e If the test finds a statistically significant result, consider doing a follow-up 
analysis that compares the observed and expected counts and that looks for 
the largest components of the chi-square statistic. 


TECHNOLOGY 
CORNERS 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


25. Finding P-values for chi-square tests on the calculator 


26. Chi-square test for goodness of fit on the calculator 
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Exercises 


Aw, nuts! A company claims that each batch of 

its deluxe mixed nuts contains 52% cashews, 27% 
almonds, 13% macadamia nuts, and 8% brazil nuts. 
‘To test this claim, a quality-control inspector takes 
a random sample of 150 nuts from the latest batch. 
The one-way table below displays the sample data. 


Nut: Cashew Almond Macadamia Brazil 
Count: 83 29 20 18 


State appropriate hypotheses for performing a test of 
the company’s claim. 


Calculate the expected counts for each type of nut. 
Show your work. 


Roulette Casinos are required to verify that their 
games operate as advertised. American roulette 
wheels have 38 slots— 18 red, 18 black, and 2 green. 
In one casino, managers record data from a random 
sample of 200 spins of one of their American roulette 
wheels. The one-way table below displays the results. 


Color: Red Black Green 
Count: 85 99 16 


State appropriate hypotheses for testing whether these 
data give convincing evidence that the distribution of 
outcomes on this wheel is not what it should be. 


Calculate the expected counts for each color. Show 
your work. 


Aw, nuts! Calculate the chi-square statistic for the 
data in Exercise 1. Show your work. 


Roulette Calculate the chi-square statistic for the 
data in Exercise 2. Show your work. 


Aw, nuts! Refer to Exercises | and 3. 


Confirm that the expected counts are large enough 
to use a chi-square distribution to calculate the 
P-value. What degrees of freedom should you use? 


Sketch a graph like Figure 11.4 (page 685) that 
shows the P-value. 


Use Table C to find the P-value. Then use your 
calculator’s y7cd£ command. 


What conclusion would you draw about the com- 
pany’s claimed distribution for its deluxe mixed nuts? 
Justify your answer. 
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Roulette Refer to Exercises 2 and 4. 


Confirm that the expected counts are large enough 
to use a chi-square distribution to calculate the 
P-value. What degrees of freedom should you use? 


Sketch a graph like Figure 11.4 (page 685) that 
shows the P-value. 


Use Table C to find the P-value. Then use your 
calculator’s x7cd£ command. 


What conclusion would you draw about whether or 
not the roulette wheel is operating correctly? Justify 
your answer. 


Birds in the trees Researchers studied the behavior 
of birds that were searching for seeds and insects in 
an Oregon forest. In this forest, 54% of the trees were 
Douglas firs, 40% were ponderosa pines, and 6% 
were other types of trees. At a randomly selected time 
during the day, the researchers observed 156 
red-breasted nuthatches: 70 were seen in Douglas 
firs, 79 in ponderosa pines, and 7 in other types of 
trees.” Do these data provide convincing evidence 
that nuthatches prefer particular types of trees when 
they're searching for seeds and insects? 


Seagulls by the seashore Do seagulls show a prefer- 
ence for where they land? To answer this question, 
biologists conducted a study in an enclosed outdoor 
space with a piece of shore whose area was made up 
of 56% sand, 29% mud, and 15% rocks. The biolo- 
gists chose 200 seagulls at random. Each seagull 
was released into the outdoor space on its own and 
observed until it landed somewhere on the piece 

of shore. In all, 128 seagulls landed on the sand, 61 
landed in the mud, and 11 landed on the rocks. Do 
these data provide convincing evidence that seagulls 
show a preference for where they land? 


No chi-square A school’s principal wants to know 
if students spend about the same amount of time 
on homework each night of the week. She asks a 
random sample of 50 students to keep track of their 
homework time for a week. The following table 
displays the average amount of time (in minutes) 
students reported per night: 


Night: 


Sunday Monday Tuesday Wednesday Thursday Friday Saturday 


Average 130 108 115 104 99 37 62 


time: 
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Explain carefully why it would not be appropriate 
to perform a chi-square test for goodness of fit using 
these data. 


10. No chi-square The principal in Exercise 9 also 
asked the random sample of students to record 
whether they did all of the homework that was as- 
signed on each of the five school days that week. 
Here are the data: 


School day: Monday Tuesday Wednesday Thursday Friday 
No. who did 34 29 32 28 19 
homework: 


Explain carefully why it would not be appropriate 
to perform a chi-square test for goodness of fit using 
these data. 


11. Benford’s law Faked numbers in tax returns, 
invoices, or expense account claims often display 
patterns that aren’t present in legitimate records. 
Some patterns are obvious and easily avoided by a 
clever crook. Others are more subtle. It is a striking 
fact that the first digits of numbers in legitimate re- 
cords often follow a model known as Benford’s law.’ 
Call the first digit of a randomly chosen record X for 
short. Benford’s law gives this probability model for X 
(note that a first digit can’t be 0): 


First digit: 1 2 3 4 5 6 7 8 9 
Probability: 0.301 0.176 0.125 0.097 0.079 0.067 0.058 0.051 0.046 


A forensic accountant who is familiar with Benford’s 
law inspects a random sample of 250 invoices from 
a company that is accused of committing fraud. ‘The 
table below displays the sample data. 


First digit: 1 A Ss @ 8 & ¢ B 8 
Count: 61 50 43 34 2 167 8 6 


(a) Are these data inconsistent with Benford’s law? 
Carry out an appropriate test at the a = 0.05 level to 
support your answer. If you find a significant result, 
perform a follow-up analysis. 


(b) Describe a Type I error and a Type II error in this 
setting, and give a possible consequence of each. 
Which do you think is more serious? 


12. Housing According to the Census Bureau, the 
distribution by ethnic background of the New York 
City population in a recent year was 

Hispanic: 28% Black: 24% White: 35% 
Asian: 12% Others: 1% 


The manager of a large housing complex in the 
city wonders whether the distribution by race of the 
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complex’s residents is consistent with the population 
distribution. ‘To find out, she records data from a 
random sample of 800 residents. The table below 
displays the sample data.* 


Race: Hispanic Black White Asian Other 
Count: 212 202 270 94 22 


Are these data significantly different from the 

city’s distribution by race? Carry out an appropriate 
test at the a = 0.05 level to support your answer. 

If you find a significant result, perform a follow-up 
analysis. 


13. Skittles Statistics teacher Jason Molesky contacted 
Mars, Inc., to ask about the color distribution for 
Skittles candies. Here is an excerpt from the response 
he received: “The original flavor blend for the 
SKITTLES BITE SIZE CANDIES is lemon, lime, 
orange, strawberry and grape. ‘They were chosen as 
a result of consumer preference tests we conducted. 
The flavor blend is 20 percent of each flavor.” 


(a) State appropriate hypotheses for a significance test of 
the company’s claim. 


(b) Find the expected counts for a bag of Skittles with 
60 candies. 


(c) How large a x’ statistic would you need to have sig- 
nificant evidence against the company’s claim at the 
a = 0.05 level? At the a = 0.01 level? 


(d) Create a set of observed counts for a bag with 60 can- 
dies that gives a P-value between 0.01 and 0.05. Show 
the calculation of your chi-square statistic. 


14. Is your random number generator working? Use 
your calculator’s RandInt function to generate 200 
digits from 0 to 9 and store them in a list. 


(a) State appropriate hypotheses for a chi-square test 
for goodness of fit to determine whether your 
calculator’s random number generator gives each 
digit an equal chance to be generated. 


(b) Carry outa test at the a = 0.05 significance level. 


For parts (c) and (d), assume that the students’ random 
number generators are all working properly. 


(c) What is the probability that a student who does this 
exercise will make a Type I error? 


(d) Suppose that 25 students in an AP Statistics class in- 
dependently do this exercise for homework. Find the 
probability that at least one of them makes a Type | 
error. 


What's your sign? The University of Chicago’s Gen- 


is: 
688 eral Social Survey (GSS) is the nation’s most impor- 


& tant social science sample survey. For reasons known 
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only to social scientists, the GSS regularly asks a 
random sample of people their astrological sign. 
Here are the counts of responses from a recent GSS: 


Sign: Aries Taurus Gemini Cancer Leo Virgo 
Count: 321 360 367 374 383 402 
Sign: Libra Scorpio Sagittarius Capricorn Aquarius Pisces 
Count: 392 329 331 354 376 355 


If births are spread uniformly across the year, we 
expect all 12 signs to be equally likely. Do these data 
provide convincing evidence that all 12 signs are 
not equally likely? If you find a significant result, 
perform a follow-up analysis. 


16. Munching Froot Loops Kellogg’s Froot Loops ce- 
real comes in six fruit flavors: orange, lemon, cherry, 
raspberry, blueberry, and lime. Charise poured 
out her morning bow! of cereal and methodically 
counted the number of cereal pieces of each flavor. 
Here are her data: 


Flavor: Orange Lemon Cherry Raspberry Blueberry Lime 
Count: 28 21 16 25 14 16 


Do these data provide convincing evidence that 
Kellogg’s Froot Loops do not contain an equal pro- 
portion of each flavor? If you find a significant result, 
perform a follow-up analysis. 


17. Mendel and the peas Gregor Mendel (1822-1884), 
an Austrian monk, is considered the father of 
genetics. Mendel studied the inheritance of vari- 
ous traits in pea plants. One such trait is whether 
the pea is smooth or wrinkled. Mendel predicted a 
ratio of 3 smooth peas for every | wrinkled pea. In 
one experiment, he observed 423 smooth and 133 
wrinkled peas. Assume that the conditions for infer- 
ence were met. Carry out an appropriate test of the 
genetic model that Mendel predicted. What do you 
conclude? 


18. You say tomato The paper “Linkage Studies of the 
Tomato” (Transactions of the Canadian Institute, 
1931) reported the following data on phenotypes 
resulting from crossing tall cut-leaf tomatoes with 
dwarf potato-leaf tomatoes. We wish to investigate 
whether the following frequencies are consistent 
with genetic laws, which state that the phenotypes 
should occur in the ratio 9:3:3:1. 


Phenotype: Tall Tall Dwarf Dwarf 
cut potato cut potato 


Frequency: 926 288 293 104 


Assume that the conditions for inference were met. 
Carry out an appropriate test of the proposed genetic 
model. What do you conclude? 


Multiple choice: Select the best answer for Exercises 19 
to 22. 

Exercises 19 to 21 refer to the following setting. The man- 
ager of a high school cafeteria is planning to offer several 
new types of food for student lunches in the following 
school year. She wants to know if each type of food will 
be equally popular so she can start ordering supplies and 
making other plans. To find out, she selects a random 
sample of 100 students and asks them, “Which type of 
food do you prefer: Asian food, Mexican food, pizza, or 
hamburgers?” Here are her data: 


Type of Food: Asian Mexican Pizza Hamburgers 
Count: 18 22 39 21 


19. An appropriate null hypothesis to test whether the 
food choices are equally popular is 


(a) Ho: = 25, where y = the mean number of students 
that prefer each type of food. 


(b) Ho:p = 0.25, where p = the proportion of all students 
who prefer Asian food. 


(c) Ho:nq = ny = np = ny= 25, where ng is the number 
of students in the school who would choose Asian 
food, and so on. 


(d) Ho:pa = pu = pe= pu= 9.25, where pa is the pro- 
portion of students in the school who would choose 
Asian food, and so on. 


(e) Ho:ba = fu = fp = pu = 9.25, where fy is the pro- 
portion of students in the sample who chose Asian 
food, and so on. 


20. The chi-square statistic is 


(a) 18 — 25)? is (22 — 25)? ts (39 — 25)? n (21 — 25)? 
25 25 25 25 
b) 25 — 18) i. (25 — 22)? : (25 — 39)? _ (25 — 21)? 
18 22 39 21 
fe) 18 — 25 : (22 — 25) i (39 — 25) & (21 — 25) 
25 25 a 25 
(a) 18 — 25) r (22 — 25)? : (39 — 25)? ‘ (21 — 25)? 
100 100 100 100 
0.18 — 0.25)? (0.22 -0.25)? (0.39 — 0.25) 
(e) a3 
0.25 0.25 0.25 
(0.21 — 0.25) 
0.25 


21. The P-value for a chi-square test for goodness of fit is 
0.0129. Which of the following is the most appropri- 


ate conclusion? 
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(a) Because 0.0129 is less than a = 0.05, reject Hp. 
There is convincing evidence that the food choices 
are equally popular. 


(b) Because 0.0129 is less than a = 0.05, reject Hp. 
There is not convincing evidence that the food 
choices are equally popular. 


(c) Because 0.0129 is less than a = 0.05, reject Hp. 
There is convincing evidence that the food choices 
are not equally popular. 


(d) Because 0.0129 is less than a = 0.05, fail to reject 
Ho. There is not convincing evidence that the food 
choices are equally popular. 


(e) Because 0.0129 is less than a = 0.05, fail to reject 
Ho. There is convincing evidence that the food 
choices are equally popular. 


22. Which of the following is false? 


(a) Achi-square distribution with k degrees of freedom 
is more right-skewed than a chi-square distribution 
with k + | degrees of freedom. 


(b) Achi-square distribution never takes negative values. 


(c) ‘The degrees of freedom for a chi-square test is deter- 
mined by the sample size. 


(d) P(x? > 10) is greater when df = k + 1 than when 
df=k 


(e) The area under a chi-square density curve is always 
equal to 1. 


Exercises 23 through 25 refer to the following setting. Do 
students who read more books for pleasure tend to earn 
higher grades in English? The boxplots below show data 
from a simple random sample of 79 students at a large 
high school. Students were classified as light readers if 
they read fewer than 3 books for pleasure per year. Other- 
wise, they were classified as heavy readers. Each student’s 
average English grade for the previous two marking peri- 
ods was converted to a GPA scale where A+ = 4.3, 
A=4.0, A— =3.7, B + =3.3, and so on. 


English Grades vs Type of Reader 


Heavy 


i 
3 
: 


Light 


28 30 32 34 36 38 40 42 
English_Grades_GPA_scale 
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Reading and grades (1.3) Write a few sentences 
comparing the distributions of English grades for 
light and heavy readers. 


Reading and grades (10.2) Summary statistics for 
the two groups from Minitab are provided below. 


Type of reader N Mean StDev SE Mean 
Heavy 47 3.640 0.324 0.047 
Light 32 34356 0.380 0.067 


Explain why it is acceptable to use two-sample t pro- 
cedures in this setting. 


Construct and interpret a 95% confidence interval 
for the difference in the mean English grade for light 
and heavy readers. 


Does the interval in part (b) provide convincing 
evidence that reading more causes a difference in 
students’ English grades? Justify your answer. 


Reading and grades (3.2) The Fathom scatterplot 
below shows the number of books read and the 
English grade for all 79 students in the study. A least- 
squares regression line has been added to the graph. 


2 ££ 6 & 
Books_Read 
— GPA = 3.42 + 0.024Books_Read: r2 = 0.083 


10 12 14 16 18 20 22 


Interpret the meaning of the slope and y intercept in 
context. 
The student who reported reading 17 books for plea- 


sure had an English GPA of 2.85. Find this student's 
residual and interpret this value in context. 


How strong is the relationship between English 
grades and number of books read? Give appropriate 
evidence to support your answer. 

Yahtzee (5.3, 6.3) In the game of Yahtzee, 5 six- 
sided dice are rolled simultaneously. To get a Yahtzee, 
the player must get the same number on all 5 dice. 


Luis says that the probability of getting a Yahtzee in one 
5 


roll of the dice is (z) . Explain why Luis is wrong. 


Nassir decides to keep rolling all 5 dice until he gets 
a Yahtzee. He is surprised when he still hasn’t gotten 
a Yahtzee after 25 rolls. Should he be? Calculate an 


appropriate probability to support your answer. 


Section 11.2 Inference for Two-Way Tables 4,697 


Inference for 
Two-Way Tables 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e Compare conditional distributions for data in a two-way e Perform a chi-square test for homogeneity. 


table. e Perform a chi-square test for independence. 
State appropriate hypotheses and compute expected counts e Choose the appropriate chi-square test. 


for a chi-square test based on data in a two-way table. 


Calculate the chi-square statistic, degrees of freedom, 
and P-value for a chi-square test based on data in a 
two-way table. 


The two-sample z procedures of Chapter 10 allow us to compare the proportions of 
successes in two populations or for two treatments. What if we want to compare more 
than two samples or groups? More generally, what if we want to compare the distribu- 
tions of a single categorical variable across several populations or treatments? We need 
a new statistical test. ‘The new test starts by presenting the data in a two-way table. 
‘Two-way tables have more general uses than comparing distributions of a single 
categorical variable. As we saw in Section 1.1, they can be used to describe rela- 
tionships between any two categorical variables. In this section, we will start by 
developing a test to determine whether the distribution of a categorical variable 
is the same for each of several populations or treatments. This test is called a chi- 
square test for homogeneity. Then we'll examine a related test to see whether 
there is convincing evidence of an association between the row and column vari- 
ables in a two-way table. This test is known as a chi-square test for independence. 


Comparing Distributions of a 
Categorical Variable 


We'll start with an example involving a randomized experiment. 


Does Background Music Influence 
What Customers Buy? 


Comparing conditional distributions 


Market researchers suspect that background music may affect the mood and buy- 
ing behavior of customers. One study in a European restaurant compared three 
randomly assigned treatments: no music, French accordion music, and Italian 
string music. Under each condition, the researchers recorded the number of 
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customers who ordered French, Italian, and other entrees.’ Here is a table that 
summarizes the data: 


Type of Music 
Entree ordered None French Italian Total 
French 30 39 30 99 
Italian 11 1 19 31 
Other 43 35 35 113 
Total 84 75 84 243 


PROBLEM: 
(a) Calculate the conditional distribution (in proportions) of entrees ordered for each treatment. 
(b) Make an appropriate graph for comparing the conditional distributions in part (a). 


(c) Write afew sentences comparing the distributions of entrees ordered under the three music 
treatments. 


SOLUTION: 


(a) When no music was playing, the distribution of entree orders was 


30 tal 43 
French: —- = 0.357 Italian: —- = 0.131 Other: —— = 0.512 
84 84 84 
When French accordion music was playing, the distribution of entree orders was 
French a8 0.520  Itali i 0.013 Oth *” 0.467 
rench: —— = 0. alian: —~ = 0. Ca — 0; 
ie 75 75 
When Italian string music was playing, the distribution of entree orders was 


30 19 35 
French:—— = 0.357 Italian: = 0.226 Other: = 0.417 
84 84 84 


(b) The bar graphs in Figure 11.6 compare the distributions of entrees ordered for each of the 
three music treatments. 


Music = None Musi¢ = French Music = Italian 

60 60 60 

50 50 50 

40 
— coy 7 
§ 8 8 

s s 50 

20 

10 

0 0 

French Italian Other French Italian Other French Italian Other 
Entree Entree Entree 
(a) (b) (¢) 


FIGURE 11.6 Bar graphs comparing the distributions of entrees ordered for different music conditions. 
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(c) The type of entree that customers order seems to differ considerably across the three music 
treatments. Orders of Italian entrees are very low (1.3%) when French music is playing but are higher 
when Italian music (22.6%) or no music (13.1%) is playing. French entrees seem popular in this 
restaurant, as they are ordered frequently under all music conditions but notably more often when 
French music is playing. For all three music treatments, the percent of Other entrees ordered was 
similar. 


For Practice Try Exercise 


The researchers in the restaurant study expected that music would influence 
customer orders, so the type of music played is the explanatory variable and the 
type of entree ordered is the response variable. A good general strategy is to com- 
pare the conditional distributions of the response variable for each value of the 
explanatory variable. That’s why we compared the conditional distributions of 
entrees ordered for each type of music played. 

It is common practice to describe a two-way table by its number of rows and 
columns (not including totals). For instance, the data in the previous example 
were given ina 3 X 3 table. The following Check Your Understanding involves 
a3 X 2 table. 


CHECK YOUR UNDERSTANDING 


The Pennsylvania State University has its main campus in the town of State College and 
more than 20 smaller “commonwealth campuses” around the state. ‘The Penn State Divi- 
sion of Student Affairs polled separate random samples of undergraduates from the main 
campus and commonwealth campuses about their use of online social networking. Face- 
book was the most popular site, with more than 80% of students having an account. Here 
is a comparison of Facebook use by undergraduates at the main campus and common- 
wealth campuses who have a Facebook account:° 


Use Facebook Main campus Commonwealth 
Several times a month or less 55 76 
At least once a week 215 157 
At least once a day 640 394 
Total Facebook users 910 627 


1. Calculate the conditional distribution (in proportions) of Facebook use for each 
campus setting. 


2. Why is it important to compare proportions rather than counts in Question 1? 


3. Make a bar graph that compares the two conditional distributions. What are the 
most important differences in Facebook use between the two campus settings? 


Stating Hypotheses The null hypothesis in the restaurant example is 


Ho: There is no difference in the true distributions of entrees ordered at this 
restaurant when no music, French accordion music, or Italian string 
music is played. 
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If the null hypothesis is true, the observed differences in the distributions of en- 
trees ordered for the three groups are due to the chance involved in the random 
assignment of treatments. The alternative hypothesis says that there is a difference 
but does not specify the nature of that difference: 


H,: There is a difference in the true distributions of entrees ordered at this 
restaurant when no music, French accordion music, or Italian string 
music is played. 


Any difference among the three true distributions of entrees ordered when no 
music, French accordion music, or Italian string music is played means that the 
null hypothesis is false and the alternative hypothesis is true. The alternative hy- 
pothesis is not one-sided or two-sided. We might call it “many-sided” because it 
allows any kind of difference. 

With only the methods we already know, we might start by comparing the pro- 
portions of French entrees ordered when no music and French accordion music 
are played. We could similarly compare other pairs of proportions, ending up with 
many tests and many P-values. This is a bad idea. The P-values belong to each test 
separately, not to the collection of all the tests together. 


Type of Music 
Entree ordered None French Italian Total 
French 30 39 30 99 
Italian 11 1 19 31 
Other 43 35 35 113 
Total 84 75 84 243 


When we do many individual tests or construct many confidence inter- 
vals, the individual P-values and confidence levels don’t tell us how con- @ 
fident we can be in all the inferences taken together. Because of this, it’s 
cheating to pick out one large difference from the two-way table and then perform 
a significance test as if it were the only comparison we had in mind. For example, 
the proportions of French entrees ordered under the no music and French ac- 
cordion music treatments are 30/84 = 0.357 and 39/75 = 0.520, respectively. A 
two-sample z test shows that the difference between the proportions is statistically 
significant (z = 2.06, P = 0.039) if we make just this one comparison. 

But we could also pick a comparison that is not significant. For example, the 
proportions of Italian entrees ordered for the no music and Italian string music 
treatments are 11/84 = 0.131 and 19/84 = 0.226, respectively. These two propor- 
tions do not differ significantly (z = 1.61, P = 0.107). Individual comparisons 
can’t tell us whether the three distributions of the categorical variable (in this case, 
type of entree ordered) are significantly different. 

The problem of how to do many comparisons at once with an overall measure 
of confidence in all our conclusions is common in statistics. This is the problem 
of multiple comparisons. Statistical methods for dealing with multiple compari- 
sons usually have two parts: 


1. An overall test to see if there is convincing evidence of any differences among 
the parameters that we want to compare. 


2. A detailed follow-up analysis to decide which of the parameters differ and to 
estimate how large the differences are. 
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The overall test uses the familiar chi-square statistic. But in this new setting the 
test will be used to compare the distribution of a categorical variable for several 
populations or treatments. The follow-up analysis can be quite elaborate. We will 
concentrate on the overall test and do a follow-up analysis only when the observed 
results are statistically significant. 


Expected Counts and the 
Chi-Square Statistic 


A chi-square test for homogeneity begins with the hypotheses 


It would also be correct to state the Ho: There is no difference in the distribution of a categorical variable for 
null hypothesis as Ho: The distribution several populations or treatments. 

of a categorical variable is the same . ; ; — . : 2 

for each of several populations H,: There is a difference in the distribution of a categorical variable for 
or treatments. We prefer the “no several populations or treatments. 


difference” wording because it’s more 

consistent with the language we used") perform a test, we compare the observed counts in a two-way table with the 
counts we would expect if Hp were true. Finding the expected counts is not that 
difficult, as the following example illustrates. 


in the significance tests of Chapter 10. 


Does Background Music Influence 


What Customers Buy? 


Computing expected counts 


The null hypothesis in the restaurant experiment is that there’s no difference in 
the distribution of entrees ordered when no music, French accordion music, or 
Italian string music is played. To find the expected counts, we start by assuming 
that Ho is true. We can see from the two-way table that 99 of the 243 entrees 
ordered during the study were French. 


Type of Music 
Entree ordered None French Italian Total 
French 30 39 30 99 
Italian 11 1 19 31 
Other 43 35 35 113 
Total 84 75 84 243 


If the specific type of music that’s playing has no effect on entree orders, the pro- 
portion of French entrees ordered under each music condition should be 99/243 = 
0.4074. For instance, there were 84 total entrees ordered when no music was play- 
ing. We would expect 


2 
84 - 7437 84(0.4074) = 34.22 
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Although any count of entrees 
ordered must be a whole number, 
an expected count need not be. The 
expected count gives the average 
number of entrees ordered if Hp is 
true and the random assignment 
process is repeated many times. 
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of those entrees to be French, on average. The expected counts of French entrees 
ordered under the other two music conditions can be found in a similar way: 


French music: 75(0.4074) = 30.56 Italian music: 84(0.4074) = 34.22 


We repeat the process to find the expected counts for the other two types of entrees. 
The overall proportion of Italian entrees ordered during the study was 31/243 = 
0.1276. So the expected counts of Italian entrees ordered under each treatment are 


No music: 84(0.1276) = 10.72 
Italian music: 84(0.1276) = 10.72 


French music: 75(0.1276) = 9.57 


The overall proportion of Other entrees ordered during the experiment was 113/243 = 
0.465. So the expected counts of Other entrees ordered for each treatment are 


No music: 84(0.465) = 39.06 
Italian music: 84(0.465) = 39.06 


French music: 75(0.465) = 34.88 


The following table summarizes the expected counts for all three treatments. Note 
that the values for no music and Italian music are the same because 84 total entrees 
were ordered under each condition. We can check our work by adding the expected 


counts to obtain the (Ow 200 


column totals, as in the table. 


These should be the same as Type of Music 

those in the table of observed _ | Entree ordered None French _ italian Total 
counts except for small round- _ | French 34.22 30.56 34.22 99 
off errors, such as 75.01 rather __| !talian 10.72 9.57 10.72 31 
than 75 for the total number of _ | Other 39.06 34.88 39.06 113 
French entrees ordered. Total 84 15 84 243 


Let’s take a look at the two-way table from the restaurant study one more time. 
In the example, we found the expected count of French entrees ordered when no 
music was playing as follows: 


99 
84 543 34.22 
Type of Music 
Entree ordered None French Italian Total 
French 30 39 30 
Italian 11 1 19 31 
Other 43 35 35 113 


Total 


75 84 


We marked the three numbers used in this calculation in the table. These values 
are the row total for French entrees ordered, the column total for entrees ordered 
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when no music was playing, and the table total of entrees ordered during the ex- 
periment. We can rewrite the original calculation as 


84-99 99-84 
243.243 


= 34.22 


This suggests a more general formula for the expected count in any cell of a two- 
way table: 


row total - column total 
table total 


FINDING EXPECTED COUNTS 


All the expected counts in the restaurant study are at least 5. This satisfies the 
Large Counts condition. The Random condition is met because the treatments 
were assigned at random. We don’t need to check the 10% condition because 
the researchers were not sampling without replacement from some population of 
interest. They just performed an experiment using customers who happened to be 
in the restaurant at the time. 


CONDITIONS FOR PERFORMING A CHI-SQUARE 
TEST FOR HOMOGENEITY 


Just as we did with the chi-square test for goodness of fit, we compare the ob- 
served counts with the expected counts using the statistic 


(Observed — Expected)? 
Expected 


x=> 


This time, the sum is over all cells (not including the totals!) in the two-way table. 
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Does Background Music Influence 
What Customers Buy? 


The chi-square statistic 


PROBLEM: The tables below show the observed and expected counts for the restaurant 
experiment. Calculate the chi-square statistic. Show your work. 


Type of Music Type of Music 
Entree ordered None French italian ‘Total Entree ordered None French Italian Total 
French 30 30 99 French 34.22 30.56 34.22 99 
Italian 11 19 31 Italian 10.72 9.57 10.72 31 
Other 43 35 113 Other 39.06 3488 39.06 
Total 84 84 243 Total 84 75 84 


SOLUTION: For French entrees with no music, the observed count is 30 orders and the expected 


AP® EXAM TIP In the “Do” 
count is 34.22. The contribution to the x” statistic for this cell is 


step, you aren’t required 
to show every term in the (Observed — Expected)? (30 — 34.22)" 
chi-square statistic. Writing = 

the first few terms of the Expected 54.22 
sum followed by “....” is The x? statistic is the sum of nine such terms: 

considered as “showing , 
work.” We suggest that ran . (Observed — Expected) _ (30 — 34.22)? (39 — 30.56)? _ (35 — 39.06)? 
you do this and then let a Expected 34.22 30.56 39.06 


Sie en = 0.52 + 2.33 +--+ + 042 = 18.28 


= Oe” 


For Practice Try Exercise 


As in the test for goodness of fit, you should think of the chi-square statistic 
x? as a measure of how much the observed counts deviate from the expected 
counts. Once again, large values of y’ are evidence against Hy and in favor of H,. 
The P-value measures the strength of this evidence. When conditions are met, 
P-values for a chi-square test for homogeneity come from a chi-square distribution 
with df = (number of rows — 1) X (number of columns — 1). 


Does Background Music Influence 
What Customers Buy? 


P-value and conclusion 


Earlier, we started a significance test of 


Ho: There is no difference in the true distributions of entrees ordered at this 
restaurant when no music, French accordion music, or Italian string 
music is played. 
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H,: There is a difference in the true distributions of entrees ordered at this 
restaurant when no music, French accordion music, or Italian string 
music is played. 


Type of Music 
Entree ordered None French Italian Total 
French 30 39 30 99 
Italian 11 1 19 31 
Other 43 35 35 113 
Total 84 75 84 243 


We already checked that the conditions are met. Our calculated test statistic is 


18.28, 


PROBLEM: 

(a) Use Table C to find the P-value. Then use your calculator’s y*cd£ command. 
(b) Interpret the P-value from the calculator in context. 

(c) What conclusion would you draw? Justify your answer. 


SOLUTION: 


(a) Because the two-way table that summarizes the data from the study has three rows and three 
columns, we use a chi-square distribution with df = (3 — 1)(3 — 1) = 4 to find the P-value. 


° Table: Look at the df = 4 rowin Table C. The calculated value P 
\7 = 18.28 lies between the critical values 16.42 and 18.47. The df 0025 001 
corresponding P-value is between 0.001 and 0.0025. 4 


* Calculator: The command y*cdf£ (18.28,10000, 4) gives 
0.0011. 


(b) Assuming that there is no difference in the true distributions of entrees ordered in this restau- 
rant when no music, French accordion music, or Italian string music is played, there is a0.00111 prob- 
ability of observing a difference in the distributions of entrees ordered among the three treatment 
groups as large as or larger than the one in this study. 

(c) Because the P-value, 0.0011, is less than our default c. = 0.05 significance level, we reject 
Ho. We have convincing evidence of a difference in the true distributions of entrees ordered at this 
restaurant when no music, French accordion music, or Italian string music is played. Furthermore, the 
random assignment allows us to say that the difference is caused by the music that’s played. 


For Practice Try Exercise 


CHECK YOUR UNDERSTANDING 


In the previous Check Your Understanding (page 699), we presented data on the use of 
Facebook by two randomly selected groups of Penn State students. Here are the data once 
again. 


Use Facebook Main campus Commonwealth 
Several times a month or less 55 76 
At least once a week 215 157 
At least once a day 640 394 


Total Facebook users 910 627 
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Do these data provide convincing evidence of a difference in the distributions of Facebook 
use among students in the two campus settings? 


1. State appropriate null and alternative hypotheses for a significance test to help answer 
this question. 


2. Calculate the expected counts. Show your work. 
Calculate the chi-square statistic. Show your work. 
Use Table C to find the P-value. Then use your calculator’s x7cdf command. 


Interpret the P-value from the calculator in context. 


ONE ae 


What conclusion would you draw? Justify your answer. 


Calculating the expected counts and then the chi-square statistic by hand is a 
bit time-consuming. As usual, technology saves time and gets the arithmetic right. 


CHI-SQUARE TESTS FOR TWO-WAY 


CORNER TABLES ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


You can use the T1-83/84 or T1-89 to perform calculations for a chi-square test for homogeneity. We'll use the data from 
the restaurant study to illustrate the process. 


1. Enter the observed counts in matrix [A]. 


TI-83/84 TI-89 
e Press [2nd]| x"? | (MATRIX), arrow to EDIT, and e Press [Apps], select Data/Matrix Editor and 
choose A. then New. ... 
e Enter the dimensions of the matrix: 3 X 3. e Adjust your settings to match those shown. 
NORMAL FLOAT ALITO REAL RADIAN CL fl 
NAMES MATH (eae —————_——— 
RICA] : Matrix? 
2: (BI : mdi > 
3:[C] 
a a Row dimension: [= | 
6:LF] Coldimansion: [F__] 
7:[G] 
8: CH] 
9VCT] MaIN RAD AUTO FUME 


NORMAL FLOAT ALTO REAL RADIAN CL 


- Uae 
P fis eee 
MATRIXCA] 3 x3 
( KOM 9 30 
ce 4a 1 19 1 
c 4S 35 35 1 


taK1,19= 30 


2. Specify the chi-square test, the matrix where the observed counts are found, and the matrix where the expected 
counts will be stored. 
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e §=©Press [STAT], arrow to TESTS, and e In the Statistics/List Editor, press [2nd 
choose y*-Test. [F1]({F6]), and choose Chi2 2-way.... 
e Adjust your settings as shown. e Adjust your settings as shown. 
X2-Test 
Observed: [A] Dbserved Mat: [a +d 
Expected: [B] 
Color: EUS Store Exrected ko: 
Calculate Draw 4 StoreCompMatte: [itatuarssc | 


Results: Calculate + 


1isti={0,1,2,35,4,5,6,7,5,.. 
MAIN RAD AUTO FUNG 176 


3. Choose “Calculate” or “Draw” to carry out the test. If you choose “Calculate,” you should get the test statistic, 
P-value, and df shown below. If you specify “Draw,” the chi-square distribution with + degrees of freedom will be 
drawn, the area in the tail will be shaded, and the P-value will be displayed. 


X%2=18.27921151 a Chi-e Si8.2792 
p=, 0010882802 | =.001088 
df=4 1 | a zh, 


SUC34.2222730.5... 
SIL S20824/2,32... 


list1={0,1,2,3 5 F 
MAIN RAD AUTO FUNC iv 6 


x2-Test 
RF=18.2792 


4. To see the expected counts, go to the home screen and ask for a display of the matrix [B]. 


e Press [2nd][ x"? | (MATRIX), arrow to EDIT, e Press [2nd]|-] (Var-LINK) and 
and choose [B]. choose B. 


waiafeiaaeeemualea ar 


MATRIXCB] 3 x3 

k 34.222 30,556 34.222 3 

{ 16.716 9.5679 10.716 J 

[ 39.062 34.877 39.062 1 a 


34.2222 30.5556 34.222 
19.716 F.sore Is. icp 
39.0617 34.8765 39,061 


Man RAD AUTO FUNC i/ta 


AP® EXAM TIP You can use your calculator to carry out the mechanics of a significance test on the AP® exam. But there’s a 
risk involved. If you just give the calculator answer with no work, and one or more of your values is incorrect, you will probably 
get no credit for the “Do” step. We recommend writing out the first few terms of the chi-square calculation followed by “... ”. 


This approach might help you earn partial credit if you enter a number incorrectly. Be sure to name the procedure (x?-Test for 
homogeneity) and to report the test statistic (y? =18.279), degrees of freedom (df = 4), and P-value (0.0011). 


The Chi-Square Test for Homogeneity 


In Section 11.1, we used a chi-square test for goodness of fit to test a hypothesized 
model for the distribution of a categorical variable. Our P-values came from a 
chi-square distribution with df = the number of categories —1. When the Ran- 
dom, 10%, and Large Counts conditions are met, the x’ statistic calculated from 
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a two-way table can be used to perform a test of Ho: There is no difference in the 
distribution of a categorical variable for several populations or treatments. This 
new procedure is known as a chi-square test for homogeneity. 


This test is also known as a chi-square 
test for homogeneity of proportions. We CHI-SQUARE TEST FOR HOMOGENEITY 
prefer the simpler name. 
Suppose the conditions are met. You can use the chi-square test for homo- 


geneity to test 


Ho: There is no difference in the distribution of a categorical variable 
for several populations or treatments. 


H,: There is a difference in the distribution of a categorical variable 
for several populations or treatments. 


Start by finding the expected counts. Then calculate the chi-square statistic 


(Observed — Expected)? 
Expected 


=> 


where the sum is over all cells (not including totals) in the two-way table. If 
Hy is true, the y? statistic has approximately a chi-square distribution with 
degrees of freedom = (number of rows — 1)(number of columns — 1). The 
P-value is the area to the right of y* under the corresponding chi-square 
density curve. 


Let’s look at an example of a chi-square test for homogeneity from start to 
finish. As usual, we follow the four-step process when performing a signifi- 
cance test. 


Are Cell-Only Telephone 


Users Different? 4 
The chi-square test for homogeneity L. 


Random digit dialing telephone surveys used to exclude cell phone numbers. If the 
opinions of people who have only cell phones differ from those of people who have 
landline service, the poll results may not represent the entire adult population. The 
Pew Research Center interviewed separate random samples of cell-only and land- 
line telephone users who were less than 30 years old. Here’s what the Pew survey 
found about how these people describe their political party affiliation:’ 


Cell-only sample Landline sample 


Democrat or lean Democratic 49 47 
Refuse to lean either way 15 27 
Republican or lean Republican 32 30 


Total 96 
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PROBLEM: 
(a) Compare the distributions of political party affiliation for cell-only and landline phone users. 


(b) Do these data provide convincing evidence at the cv = 0.05 level that the distribution of party 
affiliation differs in the under-30 cell-only and landline user populations? 


SOLUTION: 


(a) Because the sample sizes are different, we should compare the proportions of individuals in each 
political affiliation category in the two samples. The table below shows the conditional distributions 
of political party affiliation for cell-only and landline phone users. We made a segmented bar graph to 
compare these two distributions. Cell-only users appear slightly more likely to declare themselves 
as Democrats or Republicans than people who have landlines. People with landlines seem much more 
likely to say they don’t lean Democratic or Republican than those who use only cell phones. 


1.00 
0.90 
0.80 
0.70 
0.60 
0.50 
0.40 
0.30 
0.20 
0.10 
0.00 


Proportion 


Cell-only 


Phone status 


Di Pemocrat No lean @ Republican 


Phone Status 
Political affiliation Cell only Landline 
Democrat 0.51 0.45 
No lean 0.16 0.26 
Republican 0.33 0.29 


(b) STATE: We want to perform a test of 


Ho: There is no difference in the distribution of party affiliation 


Landline in the under-30 cell-only and landline populations. 


H,: There is a difference in the distribution of party affiliation 
in the under-30 cell-only and landline populations. 


NORMAL FLOAT AUTO REAL RADIAN CL 


MATRIXCBI 3 x2 


c 46.68 49.92 
kK 26.146 21.84 J] 
f 29.76 32.24 J] 


at the a = 0.05 level. 
Ty] «PLAN: If conditions are met, we should use a chi-square test for homogeneity. 


* Random: The data came from independent random samples of 96 cell-only and 104 
landline users. 
° 10%: Sampling without replacement was used, so there need to be at least 10(96) = 
960 cell-only users under age 30 and at least 10(104) = 1040 landline users under 
age 30. This is safe to assume. 
* LargeCounts: We followed the steps in the Technology Corner on page 706 to get the 
expected counts. The calculator screen shot confirms that all expected counts are at least 5. 


NORMAL FLOAT AUTO REAL RADIAN CL f 


%2-Test 
%2=3.2199 


p=.1999 


DO: Achi-square test on the calculator gave 


° Test statistic: 
= 2 
as (Observed — Expected) 
Expected 
_ (49 — 46.08) (47 — 49.92)? | 
~ 4608 4992 


* Pyalue: Using df = (number of rows — 1)(number of columns — 1) = (3 — 1)(2 — 1) = 2, 
the P-value is 0.1999. 


7° = 3.22 


CONCLUDE: Because our P-value, 0.1999, is greater than « = 0.05, we fail to reject Ho. There 
is not convincing evidence that the distribution of party affiliation differs in the under-30 cell-only 
and landline user populations. 


For Practice Try Exercise 
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FIGURE 11.7 Minitab output for 

the two-way table in the restaurant 
study. The output gives the observed 
counts, the expected counts, and 
the individual components of the 
chi-square statistic. 


THINK 
ABOUT IT 


Follow-up Analysis The chi-square test for homogeneity allows us to com- 
pare the distribution of a categorical variable for any number of populations or treat- 
ments. Ifthe test allows us to reject the null hypothesis of no difference, we may want 
to do a follow-up analysis that examines the differences in detail. Start by examining 
which cells in the two-way table show large deviations between the observed and ex- 

(Observed — Expected)? 


Expected 


pected counts. Then look at the individual components 


to see which terms contribute most to the chi-square statistic. 

Our earlier restaurant study found significant differences among the true distri- 
butions of entrees ordered under each of the three music conditions. We entered 
the two-way table for the study into Minitab software and requested a chi-square test. 
The output appears in Figure 11.7. Minitab repeats the two-way table of observed 
counts and puts the expected count for each cell below the observed count. Finally, 
the software prints the 9 individual components that contribute to the x” statistic. 


Chi-Square Test: None, French, Italian 


Expected counts are printed below observed counts 
Chi-Square contributions are printed below expected counts 


None French Italian Total 
1 30 39 30 99 
34.22 30.56 34.22 
0.521 2.334 0.521 
2 ni 1 19 31 
10.72 9.57 10.72 
0.008 (7.672) (6.404) 
3 43 35 35 113 
39.06 34.88 39.06 
0.397 0.000 0.422 
Total 84 75 84 243 


Chi-Sq = 18.279, DF = 4, P-Value = 0.001 


Looking at the output, we see that just two of the nine components that make 
up the chi-square statistic contribute about 14 (almost 77%) of the total y* = 
18.28. Comparing the observed and expected counts in these two cells, we see 
that orders of Italian entrees are much below expectation when French music is 
playing and well above expectation when Italian music is playing. We are led to 
a specific conclusion: orders of Italian entrees are strongly affected by Italian and 
French music. More advanced methods provide tests and confidence intervals 
that make this follow-up analysis more complete. 


What if we want to compare several proportions? Many studies 
involve comparing the proportion of successes for each of several populations or 
treatments. The two-sample z test from Chapter 10 allows us to test the null hy- 
pothesis Ho: p; = p2, where p; and p are the true proportions of successes for the 
two populations or treatments. The chi-square test for homogeneity allows us to 
test Ho: pi = p2 = +++ = px. This null hypothesis says that there is no difference 
in the proportions of successes for the k populations or treatments. The alterna- 
tive hypothesis is H,: at least two of the ps are different. Many students incorrectly 
state H, as “all the proportions are different.” Think about it this way: the opposite 
of “all the proportions are equal” is “some of the proportions are not equal.” 


ey 
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CHECK YOUR UNDERSTANDING 


Canada has universal health care. The United States does not but often offers more elabo- 
rate treatment to patients with access. How do the two systems compare in treating heart 
attacks? Researchers compared random samples of U.S. and Canadian heart attack pa- 
tients. One key outcome was the patients’ own assessment of their quality of life relative to 
what it had been before the heart attack. Here are the data for the patients who survived 
a year: 


Quality of life Canada United States 
Much better 75 541 
Somewhat better 71 498 
About the same 96 779 
Somewhat worse 50 282 
Much worse 19 65 
Total 311 2165 


1. Construct an appropriate graph to compare the distributions of opinion about 
quality of life among heart attack patients in Canada and the United States. 

2. Is there a significant difference between the two distributions of quality-of life 
ratings? Carry out an appropriate test at the a = 0.01 level. 


Relationships between 
Two Categorical Variables 


‘Two-way tables can arise in several ways. The restaurant experiment compared en- 
trees ordered for three music treatments. The phone use and political party affilia- 
tion observational study compared independent random samples from the cell-only 
and landline user populations. In both cases, we are comparing the distributions of 
a categorical variable for several populations or treatments. We use the chi-square 
test for homogeneity to perform inference in such settings. 

Another common situation that leads to a two-way table is when a single ran- 
dom sample of individuals is chosen from a single population and then classified 
based on two categorical variables. In that case, our goal is to analyze the relation- 
ship between the variables. The next example describes a study of this type. 


Angry People and Heart Disease 


Relationships between two categorical variables 


A study followed a random sample of 8474 people with normal blood pressure for 
about four years.® All the individuals were free of heart disease at the beginning of 
the study. Each person took the Spielberger Trait Anger Scale test, which measures 
how prone a person is to sudden anger. Researchers also recorded whether each in- 
dividual developed coronary heart disease (CHD). This includes people who had 
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heart attacks and those who needed medical treatment for heart disease. 
Here is a two-way table that summarizes the data: 


Low anger Moderate anger High anger Total 
CHD 53 110 27 190 
No CHD 3057 4621 606 8284 
Total 3110 4731 633 8474 


PROBLEM: 
(a) Is this an observational study or an experiment? Justify your answer. 


(b) Make a well-labeled bar graph that compares CHD rates for the different anger levels. 
Describe what you see. 


SOLUTION: 


(a) This is an observational study. Researchers did not deliberately impose any treat- 
ments. They just recorded data about two variables—anger level and whether or not the 
person developed CHD—for each randomly chosen individual. 


(b) Inthis setting, anger level is the explanatory variable and whether or not a person gets heart 
disease is the response variable. So we compare the percents of people who did and did not get 
heart disease in each of the three anger categories: 


CHD no CHD 
5S 3057 
Lowanger: =—~= 0.0170 = 1.704 =— = 0.9830 = 98.30% 
3110 3110 
110 4621 
Moderate anger: ——— = 0.0233 = 2.33% —— = 0.9767 = 97.67% 
4731 4731 
27 606 
High anger: —-— = 0.0427 =4.27% ——=0.9573 = 95.73% 
633 633 


The bar graph in Figure 11.8 shows the percent of people in each of the three anger catego- 
ries who developed CHD. There is a clear trend: as the anger score increases, so does the 
percent who suffer heart disease. A much higher percent of people in the high anger category 
developed CHD (4.277%) than in the moderate (2.33%) and low (1.70%) anger categories. 


s 
2 4 
= 3 
& 
8 2 
° 1 
FIGURE 11.8 Bar graph comparing 
the percents of people in each i Hedene High 
anger category who got coronary Aagee catina 
heart disease (CHD). 


For Practice Try Exercise 


Anger rating on the Spielberger scale is a categorical variable that takes three 
possible values: low, medium, and high. Whether or not someone gets heart 
disease is also a categorical variable. The two-way table in the example shows 
the relationship between anger rating and heart disease for a random sample of 


We could substitute the word 
“dependent” in place of “not 
independent” in the alternative 
hypothesis. We'll avoid this practice, 
however, because saying that two 
variables are dependent sounds too 
much like saying that changes in one 
variable cause changes in the other. 
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8474 people. Do these data provide convincing evidence of an association be- 
tween the variables in the larger population? ‘To answer that question, we work 
with a new significance test. 


The Chi-Square Test for Independence 


We often gather data from a random sample and arrange them in a two-way table 
to see if two categorical variables are associated. The sample data are easy to inves- 
tigate: turn them into percents and look for a relationship between the variables. 
Is the association in the sample evidence of an association between these variables 
in the entire population? Or could the sample association easily arise just from the 
luck of random sampling? This is a question for a significance test. 

Our null hypothesis is that there is no association between the two categorical 
variables in the population of interest. The alternative hypothesis is that there is an 
association between the variables. For the observational study of anger level and 
coronary heart disease, we want to test the hypotheses 


Ho: There is no association between anger level and heart-disease status 
in the population of people with normal blood pressure. 


H,: There is an association between anger level and heart-disease status 
in the population of people with normal blood pressure. 


No association between two variables means that knowing the value of one 
variable does not help us predict the value of the other. That is, the variables are 
independent. An equivalent way to state the hypotheses is therefore 


Ho: Anger and heart-disease status are independent in the population 
of people with normal blood pressure. 

H,: Anger and heart-disease status are not independent in the population 
of people with normal blood pressure. 


As with the two previous types of chi-square tests, we begin by comparing the ob- 
served counts in a two-way table with the expected counts if Ho is true. 


Angry People and Heart Disease 
Finding expected counts 


The null hypothesis is that there is no association between anger level and heart-disease 
status in the population of interest. If we assume that Hp is true, then anger level and 
CHD status are independent. We can find the expected cell counts in the two-way table 
using the definition of independent events from Chapter 5: P(A | B) = P(A). The chance 
process here is randomly selecting a person and recording his or her anger level and 
CHD status. 


Low anger Moderate anger High anger Total 
CHD 53 110 27 190 
No CHD 3057 4621 606 8284 


Total 3110 4731 633 8474 
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Let’s start by considering the events “CHD” and “low anger.” We see from the 
two-way table that 190 of the 8474 people in the study had CHD. If we imagine 
choosing one of these people at random, P(;CHD) = 190/8474. Because anger level 
and CHD status are independent, knowing that the selected individual is low an- 
ger does not change the probability that this person develops CHD. That is to say, 
P(CHD | low anger) = P(CHD) = 190/8474 = 0.02242. 


Of the 3110 low-anger people in the study, we’d expect 


190 
a0 3474 ~ 3110(0.02242) = 69.73 


to get CHD. You can see that the general formula we developed earlier for a test 
for homogeneity applies in this situation also: 
row total: column total _ 190-3110 


expected count = BES el coon a 69.73 


To find the expected count in the “low anger, no CHD” cell, we begin by not- 
ing that P(no CHD) = 8284/8474 = 0.97758 for a randomly selected person in the 
study. Of the 3110 low-anger people in the study, we would expect 


8284 
Ss 3474 ~ 3110(0.97758) = 3040.27 


to not develop CHD. 


We find the expected counts for the remaining cells in the two-way table in a 
similar way. 


CHD, Low CHD, Moderate CHD, High 
3110(0.02242) = 69.73 4731(0.02242) = 106.08 633(0.02242) = 14.19 


no CHD, Low no CHD, Moderate no CHD, High 
3110(0.97758) = 3040.27 4731(0.97758) = 4624.92 633(0.97758) = 618.81 


The 10% and Large Counts conditions for the chi-square test for indepen- 
dence are the same as for the homogeneity test. There is a slight difference in the 
Random condition for the two tests: a test for independence uses data from one 
sample but a test for homogeneity uses data from two or more samples/groups. 


CONDITIONS FOR PERFORMING A CHI-SQUARE 

TEST FOR INDEPENDENCE 

e Random: The data come from a well-designed random sample or ran- 
domized experiment. 


1 
© 10%: When sampling without replacement, check that n = 10 


e Large Counts: All expected counts are at least 5. 
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If the Random, 10%, and Large Counts conditions are met, the y” statistic 
calculated from a two-way table can be used to perform a test of Ho: There is no as- 
sociation between two categorical variables in the population of interest. P-values 
for this test come from a chi-square distribution with df = (number of rows — 1) X 
(number of columns — 1). This new procedure is known as a chi-square test for 
independence. 


CHI-SQUARE TEST FOR INDEPENDENCE 


The chi-square test for independence Suppose the conditions are met. You can use the chi-square test for inde- 
is also known as the chi-square test for 


association. 


pendence to test 
Ho: There is no association between two categorical variables 


in the population of interest. 


H,: There is an association between two categorical variables 
in the population of interest. 


Or, alternatively, 


Ho: Two categorical variables are independent in the 
population of interest. 


H,: Two categorical variables are not independent in 
the population of interest. 


Start by finding the expected counts. Then calculate the chi-square statistic 


(Observed — Expected)? 
Expected 


== 


where the sum is over all cells in the two-way table. If Hp is true, the y? sta- 
tistic has approximately a chi-square distribution with degrees of freedom = 
(number of rows — 1)(number of columns — 1). The P-value is the area to 
the right of x? under the corresponding chi-square density curve. 


Now we're ready to complete the significance test for the anger and heart dis- 
ease study. 


Angry People and Heart Disease «= A 


Chi-square test for independence iL 


Here is the complete table of observed and expected counts for the CHD and 
anger study side by side: 


Observed Expected 
Low Moderate High Low Moderate High 

53 110 27 69.73 106.08 14.19 
3057 4621 606 3040.27 4624.92 618.81 


CHD 
No CHD 
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Do the data provide convincing evidence of an association between anger level 
and heart disease in the population of interest? 
STATE: We want to perform a test of 


Ho: There is no association between anger level and heart-disease status in the 
population of people with normal blood pressure. 

H,: There is an association between anger level and heart-disease status in the 
population of people with normal blood pressure. 


Because no significance level was stated, we'll use a = 0.05. 


PLAN: Ifconditions are met, we should carry out a chi-square test for independence. 


Random: The data came from a random sample of 8474 people with normal blood pressure. 


° 10%: Because the researchers sampled without replacement, we need to check that the total 
number of people in the population with normal blood pressure is at least 10(8474) = 64,740. This 
seems reasonable to assume. 

Large Counts: Allthe expected counts are at least 5 (the smallest is 14.19), so this condition is met. 


DO: We perform calculations assuming Hp is true. 


° Test statistic: 
= 2 
pe ee Expected) 
Expected 
_ (53 — 69.73)" (110 — 106.08)? | _ (606 — 618.81)? 


69.73 106.08 j 618.81 
= 4.014 + 0.145 + +--+ 0.265 = 16.077 


¢ P-value: The two-way table of anger level versus heart disease has 2 rows and 3 columns. We 
will use the chi-square distribution with df = (2 — 1)(3 — 1) = 2 to find the P-value. Look at 


%2=16.07676213 the df = 2 line in Table C. The observed statistic x” = 16.077 is larger than the critical value 
eo 15.20 for c = 0.0005. So the P-value is less than 0.0005. 


Using Technology: The calculator’s \7-Test gives y” = 16.077 and P-value = 0.00032 using df = 2. 
CONCLUDE: Because the P-value of 0.00032 is less than a = 0.05, we reject Ho. We have 


convincing evidence of an association between anger level and heart-disease status in the population 
of people with normal blood pressure. 


For Practice Try Exercise 


A follow-up analysis reveals that two cells contribute most of the chi-square 
statistic: Low anger, CHD (4.014) and High anger, CHD (11.564). A much 
smaller number of low-anger people developed CHD than expected. And a much 
larger number of high-anger people got CHD than expected. 

Can we conclude that proneness to anger causes heart disease? No. 
The anger and heart-disease study is an observational study, not an experi- 
ment. It isn’t surprising that some other variables are confounded with 
anger level. For example, people prone to anger are more likely than others to be 
men who drink and smoke. We don’t know whether the increased rate of heart 
disease among those with higher anger levels in the study is due to their anger or 
perhaps to their drinking and smoking or maybe even to gender. 


AP® EXAM TIP If you have 
trouble distinguishing the two 
types of chi-square tests for 
two-way tables, you’re better 


off just saying “chi-square 
test” than choosing the wrong 
type. Better yet, learn to tell the 
difference! 
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CHECK YOUR UNDERSTANDING 


Many popular businesses are franchises—think of McDonald’s. ‘The owner of a local fran- 
chise benefits from the brand recognition, national advertising, and detailed guidelines 
provided by the franchise chain. In return, he or she pays fees to the franchise firm and 
agrees to follow its policies. The relationship between the local owner and the franchise 
firm is spelled out in a detailed contract. 

One clause that the contract may or may not contain is the entrepreneur’s right to an 
exclusive territory. ‘This means that the new outlet will be the only representative of the 
franchise in a specified territory and will not have to compete with other outlets of the 
same chain. How does the presence of an exclusive-territory clause in the contract relate 
to the survival of the business? 

A study designed to address this question collected data from a random sample of 170 
new franchise firms. ‘Two categorical variables were measured for each franchisor. First, 
the franchisor was classified as successful or not based on whether or not it was still offer- 
ing franchises as of a certain date. Second, the contract each franchisor offered to fran- 
chisees was classified according to whether or not there was an exclusive-territory clause. 
Here are the count data, arranged in a two-way table: 


Exclusive Territory 


Success Yes No Total 
Yes 108 15 123 
No 34 13 47 
Total 142 28 170 


Do these data provide convincing evidence at the a = 0.01 level of an association be- 
tween an exclusive-territory clause and business survival for new franchise firms? 


Using Chi-Square Tests Wisely 


Both the chi-square test for homogeneity and the chi-square test for 
independence start with a two-way table of observed counts. They even 
calculate the test statistic, degrees of freedom, and P-value in the same 

way. The questions that these two tests answer are different, however. A chi-square 
test for homogeneity tests whether the distribution of a categorical variable is 
the same for each of several populations or treatments. The chi-square test for 
independence tests whether two categorical variables are associated in some 
population of interest. 

Unfortunately, it is quite common to see questions asking about association 
when a test for homogeneity applies and questions asking about differences be- 
tween proportions or the distribution of a variable when a test of independence 
applies. Sometimes, people avoid the distinction altogether and pose questions 
about the “relationship” between two variables. 

Instead of focusing on the question asked, it’s much easier to look at how the 
data were produced. If the data come from two or more independent random 
samples or treatment groups in a randomized experiment, then do a chi-square 
test for homogeneity. If the data come from a single random sample, with the 
individuals classified according to two categorical variables, use a chi-square test 
for independence. 
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Scary Movies and Fear 
Choosing the right type of chi-square test 


Are men and women equally likely to suffer lingering fear from watching 
scary movies as children? Researchers asked a random sample of 117 col- 
lege students to write narrative accounts of their exposure to scary movies 
before the age of 13. More than one-fourth of the students said that some 
of the fright symptoms are still present when they are awake.'” The follow- 
ing table breaks down these results by gender. 


Gender 
Fright symptoms? Male Female Total 
Yes 7 29 36 
No 31 50 81 
Total 38 79 117 


Minitab output for a chi-square test using these data is shown below. 


Chi-Square Test: Male, Female 
Expected counts are printed below observed counts 


Chi-Square contributions are printed below expected counts 


Male Female Total 
il a 29 26 
dlak 69) 24.31 
ig tehtsis} 0.906 
2 suk 50 81 
PS. 5 Sil 54.69 
02837 0.403 
Total 38 50) uy 


Chi-Sq = 4.028, DF = 1, P-Value = 0.045 


PROBLEM: Assume that the conditions for performing inference are met. 

(a) Explain why a chi-square test for independence and not a chi-square test for homogeneity should 
be used in this setting. 

(b) State an appropriate pair of hypotheses for researchers to test in this setting. 

(c) Which cell contributes most to the chi-square statistic? In what way does this cell differ from 
what the null hypothesis suggests? 

(4) Interpret the P-value in context. What conclusion would you draw at « = 0.01? 


SOLUTION: 

(a) The data were produced using a single random sample of college students, who were then 
classified by gender and whether or not they had lingering fright symptoms. The chi-square test 

for homogeneity requires independent random samples from each population. 

(b) The null hypothesis is Ho: There is no association between gender and ongoing fright symptoms in 
the population of college students. The alternative hypothesis is H,: There is an association between 
gender and ongoing fright symptoms in the population of college students. 
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(c) Menwho admit to having lingering fright symptoms account for the largest component of the 
chi-square statistic: 1.633 of the total 4.028. Far fewer men in the sample admitted to fright 
symptoms (7) than we would expect if Ho were true (11.69). 


(4) If gender and ongoing fright symptoms really are independent in the population of interest, there 
isa 0.045 chance of obtaining a random sample of 117 students that gives a chi-square statistic 
of 4.026 or higher. Because the P-value, 0.045, is greater than 0.01, we would fail to reject Ho. We 
do not have convincing evidence that there is an association between gender and fright symptoms in 
the population of college students. 


For Practice Try Exercise 


What if we want to compare two proportions? Shopping at second- 
hand stores is becoming more popular and has even attracted the attention of busi- 
ness schools. A study of customers’ attitudes toward secondhand stores interviewed 
separate random samples of shoppers at two secondhand stores of the same chain 
in two cities. The two-way table shows the breakdown of respondents by gender."! 


City 1 City 2 
Men 38 68 
Women 203 150 
Total 241 218 


Do the data provide convincing evidence of a difference in the true gender distri- 
butions of shoppers at the two stores? 

To answer this question, we could perform a chi-square test for homogeneity. 
Our hypotheses are 


Ho: There is no difference in the true gender distributions of shoppers 
at the two stores. 


H,: There is a difference in the true gender distributions of shoppers 
at the two stores. 


But a difference in gender distributions would mean that there is a difference in 
the true proportions of female shoppers at the two stores. So we could also use a 


two-sample z test from Section 10.1 to compare two proportions. The hypotheses 


oe for this test are 
z=3. 915874113 
pres aazazaesis i, 
i=. 
62=, 6880733945 Ag: pi — p2 # 9 
b=, 7690631808 . 
ni=241 where p and p2 are the true proportions of women shoppers at Store | and Store 
nines 2, respectively. 
The TI-84 screen shots in the margin show the results from a two-sample z test for 
p; — p2 and from a chi-square test for homogeneity. (We checked that the Random, 
10%, and Large Counts conditions are met before carrying out the calculations.) 
NORMAL FLOAT AUTO REAL RADIAN CL fy Note that the P-values from the two tests are the same except for rounding er- 
rors. You can also check that the chi-square statistic is the square of the two-sample 
X2=15,. 33407007 z statistic: (3.915...)* = 15.334. 
are © As the previous example suggests, the chi-square test for homogeneity 4 
based on a 2 X 2 two-way table is equivalent to the two-sample z test for 


p1 — p2 with a two-sided alternative hypothesis. We cannot use a chi-square 
test for a one-sided alternative hypothesis. ‘The two-sample z procedures allow us 
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to perform one-sided tests and to construct confidence intervals for the difference 
between proportions. For that reason, we recommend the Chapter 10 methods for 
comparing two proportions whenever you are given a choice. 


Grouping quantitative data into categories As we mentioned in 
Chapter 1, it is possible to convert a quantitative variable to a categorical variable 
by grouping together intervals of values. Here’s an example. Researchers surveyed 
independent random samples of shoppers at two secondhand stores of the same 
chain in two cities. The two-way table below summarizes data on the incomes of 
the shoppers in the two samples. 


Income 

Under $10,000 
$10,000 to $19,999 
$20,000 to $24,999 
$25,000 to $34,999 
$35,000 or more 


City 1 
70 
52 
69 
22 
28 


Personal income is a quantitative variable. But by grouping the 


City 2 values of this variable, we create a categorical variable. We could use 
62 these data to carry out a chi-square test for homogeneity because the 
63 data came from independent random samples of shoppers at the two 
: stores. Comparing the distributions of income for shoppers at the 


two stores would give more information than simply comparing their 
24 mean incomes. 

What can we do if the expected cell counts aren’t all at 
least 5? Let’s look at a situation where this is the case. A sample survey asked a 
random sample of young adults, “Where do you live now? That is, where do you 
stay most often?” A two-way table of all 2984 people in the sample (both men and 
women) classified by their age and by where they lived is shown below.!” Living 
arrangement is a categorical variable. Even though age is quantitative, the two-way 
table treats age as dividing the young adults into four categories. The table gives the 
observed counts for all 20 combinations of age and living arrangement. 


Age (years) 
Living arrangement 19 20 21 22 Total 
Parents’ home 324 378 337 318 1357 
Another person’s home 37 47 40 38 162 
Your own place 116 279 372 487 1254 
Group quarters 58 60 49 25 192 
Other 5 2 3 9 19 
Total 540 766 801 877 2984 


Our null hypothesis is Ho: There is no association between age and living ar- 
rangement in the population of young adults. The table below shows the expected 
counts assuming Hp is true. We can see that two of the expected counts (circled in 
red) are less than 5. This violates the Large Counts condition. 


Age (years) 
Living arrangement 19 20 21 22 Total 
Parents’ home 245.57 348.35 364.26 398.82 1357 
Another person’s home 29.32 41.59 43.49 47.61 162 
Your own place 226.93 321.90 336.61 368.55 1254 
Group quarters 34.75 49.29 51.54 56.43 192 


Other 5.10 5.58 19 


Total 540 766 801 877 2984 
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A clever strategy is to “collapse” the table by combining two or more rows or col- 
umns. In this case, it might make sense to combine the Group quarters and Other 
living arrangements. Doing so and then running a chi-square test in Minitab gives 
the following output. Notice that the Large Counts condition is now met. 


Chi-Square Test: 19, 20, 21, 22 


Expected counts are printed below observed counts 
Chi-Square contributions are printed below expected counts 


19 20 21 a2 Total 
1 324 378 337 318 1357 
245.57 348.35 364.26 398.82 
25.049 2.525 2.040 16.379 
2 37 47 40 38 162 
29.32 41.59 43.49 47.61 
2.014 0.705 0.279 1.940 
3 116 279 372 487 1254 
226.93 321.90 336.61 368.55 
54.226 5.719 3.720 38.068 
4 63 62 52 34 211 
38.18 54.16 56.64 62.01 
16.129 1.134 0.380 12.654 
Total 540 766 801 877 2984 
Chi-Sq = 182.961, DF = 9, P-Value = 0.000 
=> 
Do Dogs Resemble Their Owners? 
g 
¥ In the chapter-opening Case Study (page 677), we described a 


study that investigated whether or not dogs resemble their own- 
ers. The researchers who conducted the experiment believe that 
resemblance between dog and owner might differ for purebred 
and mixed-breed dogs. Here is a two-way table summarizing the 
results of the experiment: 


Breed status 


Resemblance? Purebred dogs 


16 
9 


Mixed-breed dogs 


7 
13 


Resemble owner 
Don’t resemble 
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1. Why did researchers photograph a random sample of dogs and 
their owners in this study? 


Do the data from this study provide convincing evidence of an association be- 
tween dogs’ breed status and whether or not they resemble their owners? Ques- 
tions 2 through 5 address this issue. 


2. Which type of chi-square test should be used to help answer the 
question of interest? State an appropriate pair of hypotheses for 
the test you choose. 

3. The table shows the expected counts for the appropriate chi- 
square test in Question 2. 


Breed status 


Resemblance? Purebred dogs Mixed-breed dogs 


Resemble owner 12.78 10.22 
Don’t resemble 12.22 9.78 


(a) Show how the expected count for the cell “purebred dogs, 
resemble owner” was computed. 
(bp) Explain why the Large Counts condition is met. 


Find the test statistic and P-value. Be sure to state the degrees of 
freedom you are using. 
5. What conclusion would you draw? 


Summary 


e We can use a two-way table to summarize data involving two categorical vari- 
ables. To analyze the data, we compare the conditional distributions of one 
variable for each value of the other variable. Then we turn to formal infer- 
ence. Two different ways of producing data for two-way tables lead to two 
different types of chi-square tests. 


e Some studies aim to compare the distribution of a single categorical vari- 
able for each of several populations or treatments. In such cases, researchers 
should take independent random samples from the populations of interest or 
use the groups in a randomized experiment. The null hypothesis is that there 
is no difference in the distribution of the categorical variable for each of the 
populations or treatments. We use the chi-square test for homogeneity to 
test this hypothesis. 
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e The conditions for performing a chi-square test for homogeneity are: 
e Random: The data come from independent random samples or the 
groups in a randomized experiment. 
© 10%: When sampling without replacement, check that the popula- 
tion is at least 10 times as large as the sample. 
e Large Counts: All expected counts must be at least 5. 


e Other studies are designed to investigate the relationship between two cate- 
gorical variables. In such cases, researchers take a random sample from the 
population of interest and classify each individual based on the two categorical 
variables. The chi-square test for independence tests the null hypothesis that 
there is no association between the two categorical variables in the population 
of interest. Another way to state the null hypothesis is Ho: The two categorical 
variables are independent in the population of interest. 


e The conditions for performing a chi-square test for independence are: 


e Random: The data come from a well-designed random sample or ran- 
domized experiment. 


© 10%: When sampling without replacement, check that the popula- 
tion is at least 10 times as large as the sample. 


e Large Counts: All expected counts must be at least 5. 
e The expected count in any cell of a two-way table when Hp is true is 


row total - column total 
table total 


expected count = 


e The chi-square statistic is 


(Observed — Expected)? 
Expected 


hi 


xX 


where the sum is over all cells in the two-way table. 


Both types of chi-square tests for two-way tables compare the value of the statistic 
x? with critical values from the chi-square distribution with df = (number of 
rows — 1)(number of columns — 1). Large values of x7 are evidence against Ho 
and in favor of H,, so the P-value is the area under the chi-square density curve 
to the right of 7. 


e If the test finds a statistically significant result, consider doing a follow-up 
analysis that compares the observed and expected counts and that looks for 
the largest components of the chi-square statistic. 


TECHNOLOGY 
CORNER 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


27. Chi-square tests for two-way tables on the calculator 
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Exercises 


27. Why men and women play sports Do men and (a) Calculate the conditional distribution (in propor- 
poly women participate in sports for the same reasons? tions) of responses for each group of parents. 


& One goal for sports participants is social comparison — 
the desire to win or to do better than other people. 
Another is mastery —the desire to improve one’s 


(b) Make an appropriate graph for comparing the condi- 
tional distributions in part (a). 


skills or to try one’s best. A study on why students (c) Write a few sentences comparing the distributions of 
participate in sports collected data from independent responses for the three groups of parents. 
amples of aller femal - 
Sua ol ees og es 29. Why women and men play sports Refer to Exercise 
graduates at a large university.’ Each student was ee 
Speen : : 704 27. Do the data provide convincing evidence of a 

classified into one of four categories based on his or : ' Seen 

; : difference in the distributions of sports goals for male 
her responses to a questionnaire about sports goals. ee 

and female undergraduates at the university? 

The four categories were high social comparison—high 
mastery (HSC-HM), high social comparison—low mas- (a) State appropriate null and alternative hypotheses for 
tery (HSC-LM), low social comparison—high mastery a significance test to help answer this question. 


(LSC-HM), and low social comparison—low mastery 

(LSC-LM). One purpose of the study was to compare 
the goals of male and female students. Here are the (c) Calculate the chi-square statistic. Show your work. 
data displayed in a two-way table: 


(b) Calculate the expected counts. Show your work. 


30. How are schools doing? Refer to Exercise 28. Do 
Gender the data provide convincing evidence of a differ- 
ence in the distributions of opinions about how 


| Femal Mal 
ae alee Be high schools are doing among black, Hispanic, and 
HSC-HM 14 31 white parents? 
HSC-LM 1 18 ' ; 
LSC-HM 94 5 (a) State appropriate null and alternative hypotheses for 
LSC-LM 95 13 a significance test to help answer this question. 
(a) Calculate the conditional distribution (in propor- (b) Calculate the expected counts. Show your work. 
tions) of the reported sports goals for each gender. (c) Calculate the chi-square statistic. Show your work. 
(b) Make an appropriate graph for comparing the condi- Al, Whesnsnen wall nem alin aeons [ete to thee 
tional distributions in part (a). me 704 See Teun) 


(c) Write a few sentences comparing the distributions of & a) 


sports goals for male and female undergraduates. Cee Rete cond oer one 


chi-square test are met. 


(b) Use ‘Table C to find the P-value. Then use your 
calculator’s y7cad£ command. 


28. How are schools doing? ‘The nonprofit group 
Public Agenda conducted telephone interviews with 
three randomly selected groups of parents of high 
school children. ‘There were 202 black parents, 202 (c) Interpret the P-value from the calculator in context. 
Hispanic parents, and 201] white parents. One ques- 


tion asked was “Are the high schools in your state do- UY ng Some Ison Or Vea SuSE yOu 


: ‘ answer. 
ing an excellent, good, fair, or poor job, or don’t you 
know enough to say?” Here are the survey results:!* 32. How are schools doing? Refer to Exercises 28 and 30. 
Black Hispanic — White (a) Check that the conditions for performing the 
parents parents _parents chi-square test are met. 

Pecan, Ie a ee (b) Use ‘Table C to find the P-value. Then use your 

Good 69 a9 a calculator’s y7cd£ command. 

Fair 1 61 60 

Poor 24 24 24 (c) Interpret the P-value from the calculator in context. 

Don't know 22 28 le (d) What conclusion would you draw? Justify your 


Total 202 202 201 answer. 


23. 
708 
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30: 


Python eggs How is the hatching of water python 
eggs influenced by the temperature of the snake’s 
nest? Researchers randomly assigned newly laid eggs 
to one of three water temperatures: hot, neutral, or 
cold. Hot duplicates the extra warmth provided by 
the mother python, and cold duplicates the absence 
of the mother. Here are the data on the number of 


eggs that hatched and didn’t hatch: 


Water Temperature 
Hatched? Cold Neutral Hot 
Yes 16 38 15) 
No 11 18 29 


Compare the distributions of hatching status for the 
three treatments. 


Are the differences between the three groups statisti- 
cally significant? Give appropriate evidence to sup- 
port your answer. 


Don’t do drugs! Cocaine addicts need cocaine to 
feel any pleasure, so perhaps giving them an antide- 
pressant drug will help. A three-year study with 72 
chronic cocaine users compared an antidepressant 
drug called desipramine with lithium (a standard 
drug to treat cocaine addiction) and a placebo. 
One-third of the subjects were randomly assigned to 
receive each treatment. Here are the results:'° 


Drug administered 


Relapsed? Desipramine Lithium Placebo 
Yes 10 18 20 
No 14 6 4 


Compare the distributions of relapse status for the 
three treatments. 


Are the differences among the three groups statisti- 
cally significant? Give appropriate evidence to sup- 
port your answer. 


Sorry, no chi-square How do USS. residents who 
travel overseas for leisure differ from those who travel 
for business? The following is the breakdown by oc- 
cupation:!” 


Leisure Business 
Occupation travelers (%) travelers (%) 
Professional/technical 36 39 
Manager/executive 23 48 
Retired 14 3 
Student 7 3 
Other 20 i 
Total 100 100 


Explain why we can’t use a chi-square test to learn 
whether these two distributions differ significantly. 


Section 11.2 Inference for Two-Way Tables 


36. Going Nuts The UR Nuts Company sells Deluxe 


37. 


and Premium nut mixes, both of which contain only 
cashews, brazil nuts, almonds, and peanuts. ‘The 
Premium nuts are much more expensive than the 
Deluxe nuts. A consumer group suspects that the two 
nut mixes are really the same. To find out, the group 
took separate random samples of 20 pounds of each 
nut mix and recorded the weights of each type of nut 
in the sample. Here are the data:!® 


Type of mix 
Type of nut Premium Deluxe 
Cashew 6 Ib 5 Ib 
Brazil nut 3 Ib 4 lb 
Almond 5 Ib 6 Ib 
Peanut 6 |b 5 Ib 


Explain why we can’t use a chi-square test to determine 
whether these two distributions differ significantly. 


How to quit smoking It’s hard for smokers to quit. 
Perhaps prescribing a drug to fight depression will work 
as well as the usual nicotine patch. Perhaps combining 
the patch and the drug will work better than either 
treatment alone. Here are data from a randomized, 
double-blind trial that compared four treatments.!? A 
“success” means that the subject did not smoke for a 
year following the beginning of the study. 


Group ‘Treatment Subjects Successes 
1 Nicotine patch 244 40 
2 Drug 244 74 
3 Patch plus drug 245 87 
4 Placebo 160 25 


Summarize these data in a two-way table. Then 
compare the success rates for the four treatments. 


Explain in words what the null hypothesis Ho: pj 
p2 = p3 = p4 says about subjects’ smoking habits. 


Do the data provide convincing evidence of a differ- 
ence in the effectiveness of the four treatments at the 
a = 0.05 significance level? 


Preventing strokes Aspirin prevents blood from 
clotting and so helps prevent strokes. The Second 
European Stroke Prevention Study asked whether 
adding another anticlotting drug named dipyridamole 
would be more effective for patients who had already 
had a stroke. Here are the data on strokes during the 
two years of the study:”° 


Group ‘Treatment 


1 


2 
3 
4 


Number of patients Number of strokes 


Placebo 1649 250 
Aspirin 1649 206 
Dipyridamole 1654 211 
Both 1650 Wey 
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Summarize these data in a two-way table. Then 
compare the stroke rates for the four treatments. 


Explain in words what the null hypothesis 
Ho: p) = p2 = p3 = p4 says about the incidence of 
strokes. 


Do the data provide convincing evidence of a differ- 
ence in the effectiveness of the four treatments at the 
a = 0.05 significance level? 


How to quit smoking Perform a follow-up analysis of 
the test in Exercise 37 by finding the individual compo- 
nents of the chi-square statistic. Which cell(s) contrib- 
uted most to the final result and in what direction? 


Preventing strokes Perform a follow-up analysis of the 
test in Exercise 38 by finding the individual compo- 
nents of the chi-square statistic. Which cell(s) contrib- 
uted most to the final result and in what direction? 


Attitudes toward recycled products Some people be- 
lieve recycled products are lower in quality than other 
products, a fact that makes recycling less practical. 


Here are data on attitudes toward coffee filters made of 


recycled paper from a random sample of adults:”! 


Recycled coffee filter status 


Quality rating Buyers Nonbuyers 
Higher 20 29 
Same It 25) 
Lower 9 43 


Make a well-labeled bar graph that compares buy- 
ers’ and nonbuyers’ opinions about recycled filters. 
Describe what you see. 


Is astrology scientific? ‘The General Social Survey 
asked a random sample of adults their opinion about 
whether astrology is very scientific, sort of scientific, 
or not at all scientific. Here is a two-way table of 
counts for people in the sample who had three levels 
of higher education: 


Degree Held 


Associate’s _ Bachelor’s Master’s 


Not at all scientific 169 256 114 
Very or sort of scientific 65 65 18 


13) 


Make a well-labeled bar graph that compares opin- 
ions about astrology for the three education catego- 
ties. Describe what you see. 


Attitudes toward recycled products Refer to 
Exercise 41. 


State appropriate hypotheses for performing a chi- 
square test of independence in this setting. 


INFERENCE FOR DISTRIBUTIONS OF CATEGORICAL DATA 


(b) Compute the expected counts assuming that Hp is 
true. Show your work. 


(c) Calculate the chi-square statistic, df, and P-value. 
(d) What conclusion would you draw? 
44. Is astrology scientific? Refer to Exercise 42. 


(a) State appropriate hypotheses for performing a chi- 
square test of independence in this setting. 


(b) Compute the expected counts assuming that Hy is 
true. Show your work. 


(c) Calculate the chi-square statistic, df, and P-value. 
(d) What conclusion would you draw? 


Regulating guns ‘The National Gun Policy Survey 


a9) 
715 asked a random sample of adults, “Do you think 
& there should be a law that would ban possession of 


handguns except for the police and other authorized 
persons?” Here are the responses, broken down by 
the respondent's level of education:”* 


Education 
Less than Highschool Some College Postgrad 
high school grad college grad degree 
Yes 58 84 169 98 77 
No 58 129 294 eo 99 


Does the sample provide convincing evidence 
of an association between education level and 
opinion about a handgun ban in the adult 
population? 


46. Market research Before bringing a new product to 


market, firms carry out extensive studies to learn how 
consumers react to the product and how best to ad- 
vertise its advantages. Here are data from a study of a 
new laundry detergent.”* The participants are a ran- 
dom sample of people who don’t currently use the 
established brand that the new product will compete 
with. Give subjects free samples of both detergents. 
After they have tried both for a while, ask which they 
prefer. The answers may depend on other facts about 
how people do laundry. 


Laundry Practices 


Soft water, Soft water, Hard water, Hard water, 


warm wash hotwash warmwash hot wash 
Prefer 
standard 
product 8) 27 42 30 
Prefer new 
product 63 29 68 42 


Does the sample provide convincing evidence of an 
association between laundry practices and product 
preference in the population of interest? 


47. Where do young adults live? A survey by the 
me] 718 National Institutes of Health asked a random sam- 
& ple of young adults (aged 19 to 25 years), “Where 
do you live now? That is, where do you stay most 
often?” Here is the full two-way table (omitting a 
few who refused to answer and one who claimed to 


be homeless): 


Female Male 
Parents’ home 923 986 
Another person’s home 144 eZ 
Own place 1294 1129 
Group quarters 127 119 


(a) Should we use a chi-square test for homogeneity or 
a chi-square test for independence in this setting? 
Justify your answer. 


(b) State appropriate hypotheses for performing the type 
of test you chose in part (a). 


Minitab output from a chi-square test is shown below. 


Chi-Square Test: Female, Male 

Expected counts are printed below observed 
counts 

Chi-Square contributions are printed below 
expected counts 


Section 11.2 Inference for Two-Way Tables 


The answer may differ for different groups of stu- 
dents. Here are results for separate random samples 
of American and Asian students at a large mid- 
western university:7° 


American Asian 


Save time 29 10 
Easy 28 11 
Low price 17 34 
Live far from stores 11 4 
No pressure to buy 10 3 


(a) Should we use a chi-square test for homogeneity or 
a chi-square test for independence in this setting? 
Justify your answer. 


(b) State appropriate hypotheses for performing the type 
of test you chose in part (a). 


Minitab output from a chi-square test is shown below. 


Chi-Square Test: American, Asian 

Expected counts are printed below observed 
counts 

Chi-Square contributions are printed below 
expected counts 


Female Male Total 

al 923 986 1909 
978.49 eye}(0) Sil 
3.147 22309 

2 144 2} 276 
141.47 alc y-Susyc) 
0.045 0.048 

3 1294 Jala) 2423 
2 AROS) 1181.05 
2.181 2.294 

4 ey is 246 
126.09 Doo 
0.007 0.007 

Total 2488 2366 4854 

Chi Sq. —sii0s'8)) DEH 3, P-Value = 0.012 


(c) Check that the conditions for carrying out the test 


are met. 


(d) Interpret the P-value in context. What conclusion 


would you draw? 


48. Students and catalog shopping What is the most 
important reason that students buy from catalogs? 


American Asian Total 

al 29 10 39 
23.60 15.40 
i2o6 1.894 

2 28 ae a9 
23.60 15.40 
OR 2all L258 

3 ley 34 Bal, 
30.86 20.14 
5,225 9.5318 

4 aL al. 4 a5) 
9.08 Sel! 
0.408 0.625 

iS) aL{e) 3 13 
V.67 5.3 
C2579 0.887 

Total 95 62 ibis; 7/ 


Chi-Sq = 23.470, DF = 4, P-Value = 0.0001 


(c) Check that the conditions for carrying out the test 
are met. 

(d) Interpret the P-value in context. What conclusion 
would you draw? 

49. Treating ulcers Gastric freezing was once a 


recommended treatment for ulcers in the upper 
intestine. Use of gastric freezing stopped after 
experiments showed it had no effect. One random- 
ized comparative experiment found that 28 of the 
82 gastric-freezing patients improved, while 30 of 
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the 78 patients in the placebo group improved.”’ 
We can test the hypothesis of “no difference” in the 
effectiveness of the treatments in two ways: with a 
two-sample z test or with a chi-square test. 


Minitab output for a chi-square test is shown below. 
State appropriate hypotheses and interpret the P- 
value in context. What conclusion would you draw? 


Placebo 


Expected counts are printed below observed 
counts 

Chi-Square contributions are printed below 
expected counts 


Gastric freezing Placebo Total 
al 28 30 58 
2.13 Als) o Ad 
0.100 4 LOS 
2 54 48 102 
BA 6a 7 49.73 
O.057/ 0.060 
Total 82 78 160 
Chi-Sq = 0.322, DF = 1, P-Value = 0.570 


(b) Minitab output for a two-sample z test is shown 


50. 


below. Explain how these results are consistent with 
the test in part (a). 


Test for Two Proportions 


Sample xX N Sample p 

ob 28 82 0.341463 

2 30 73 0.384615 
Difference = p (1) — p (2) 

Estimate for difference: —0.0431520 
Test for difference = 0 (vs not = 0): 
Z = —0.57 P-Value = 0.570 


Opinions about the death penalty ‘The General 
Social Survey asked separate random samples of 
people with only a high school degree and people 
with a bachelor’s degree, “Do you favor or oppose 
the death penalty for persons convicted of murder?” 
The following table gives the responses of people 
whose highest education was a high school degree 
and of people with a bachelor’s degree: 


Highest education level 


High school Bachelor’s degree 
Favor 1010 319 
Oppose 369 185 


We can test the hypothesis of “no difference” in 
support for the death penalty among people in these 
educational categories in two ways: with a two- 
sample z test or with a chi-square test. 

Minitab output for a chi-square test is shown below. 
State appropriate hypotheses and interpret the P- 
value in context. What conclusion would you draw? 
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Chi-Square Test: Cl, C2 

Expected counts are printed below 
observed counts 

Chi-Square contributions are printed be- 
low expected counts 


il €2 Total 

il 1010 319 1329 
O73 28 SES. 72 
IL, Vas 3 7S 

2 369 185 554 
405.72 148.28 
ce | 9092 

Total 1379 504 1883 


Chi-Sq = 17.590, DF = 1, P-Value = 0.000 
Minitab output for a two-sample z test is shown 
below. Explain how these results are consistent with 
the test in part (a). 


Test for Two Proportions 


Sample xX N Sample p 

ilk 1010 i379 0.732415 

2 SS) 504 Cries. o3aa 
Difference = p (1) — p (2) 

Estimate for difference: 0.0994783 

Test for difference = 0 (vs not = QO): 


Z — 4,19 P-Value = 0.1000 


Multiple choice: Select the best answer for Exercises 51 
to 56. 

Exercises 51 to 55 refer to the following setting. The 
National Longitudinal Study of Adolescent Health inter- 
viewed a random sample of 4877 teens (grades 7 to 12). 
One question asked was “What do you think are the 
chances you will be married in the next ten years?” Here 
is a two-way table of the responses by gender:** 


oul. 


Female Male 
Almost no chance 119 103 
Some chance, but probably not 150 171 
A 50-50 chance 447 512 
A good chance 735 710 
Almost certain 1174 756 


Which of the following would be the most appropri- 
ate type of graph for these data? 


A bar chart showing the marginal distribution of 
opinion about marriage 


A bar chart showing the marginal distribution of gender 


A bar chart showing the conditional distribution of 
gender for each opinion about marriage 


A bar chart showing the conditional distribution of 
opinion about marriage for each gender 


Dotplots that display the number in each opinion 
category for each gender 


52. 


The appropriate null hypothesis for performing a 
chi-square test is that 


equal proportions of female and male teenagers are 
almost certain they will be married in 10 years. 


there is no difference between the distributions of 
female and male teenagers’ opinions about mar- 
riage in this sample. 


there is no difference between the distributions of 
female and male teenagers’ opinions about mar- 
riage in the population. 


there is no association between gender and opinion 
about marriage in the sample. 


there is no association between gender and opinion 
about marriage in the population. 


. The expected count of females who respond “al- 


most certain” is 
AASa le (c) 965. (e) 
DAD. (d) 1038.8. 


The degrees of freedom for the chi-square test for 
this two-way table are 


4, (c) 10. (e) 
8) (d) 20. 


IA. 


4876. 


For these data, x” = 69.8 with a P-value of approxi- 
mately 0. Assuming that the researchers used 

a significance level of 0.05, which of the following 
is true? 


A Type I error is possible. 
A Type I] error is possible. 
Both a Type I and a ‘Type II error are possible. 


There is no chance of making a Type I or Type I 
error because the P-value is approximately 0. 


There is no chance of making a Type I or Type I 
error because the calculations are correct. 


When analyzing survey results from a two-way 
table, the main distinction between a test for inde- 
pendence and a test for homogeneity is 


how the degrees of freedom are calculated. 
how the expected counts are calculated. 
the number of samples obtained. 

the number of rows in the two-way table. 


the number of columns in the two-way table. 
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For Exercises 57 and 58, you may find the inference sum- 
mary chart inside the back cover helpful. 


57. Inference recap (8.1 to 11.2) In each of the follow- 

»> ing settings, state which inference procedure from 

€ Chapter 8, 9, 10, or 11 you would use. Be specific. 
For example, you might say “two-sample < test for 
the difference between two proportions.” You do not 
need to carry out any procedures.”” 


(a) What is the average voter turnout during an election? 
A random sample of 38 cities was asked to report the 
percent of registered voters who actually voted in the 
most recent election. 


(b) Are blondes more likely to have a boyfriend than 
the rest of the single world? Independent random 
samples of 300 blondes and 300 nonblondes were 
asked whether they have a boyfriend. 


58. Inference recap (8.1 to 11.2) In each of the follow- 

»> ing settings, state which inference procedure from 

a Chapter 8, 9, 10, or 11 you would use. Be specific. 
For example, you might say “two-sample < test for 
the difference between two proportions.” You do not 
need to carry out any procedures.” 


(a) Is there a relationship between attendance at religious 
services and alcohol consumption? A random sample 
of 1000 adults was asked whether they regularly attend 
religious services and whether they drink alcohol daily. 


(b) Separate random samples of 75 college students 
and 75 high school students were asked how much 
time, on average, they spend watching television 
each week. We want to estimate the difference in the 
average amount of l'V watched by high school and 
college students. 


Exercises 59 to 60 refer to the following setting. For their fi- 
nal project, a group of AP® Statistics students investigated 
the following question: “Will changing the rating scale on 
a survey affect how people answer the question?” ‘To find 
out, the group took an SRS of 50 students from an alpha- 
betical roster of the school’s just over 1000 students. ‘The 
first 22 students chosen were asked to rate the cafeteria 
food ona scale of | (terrible) to 5 (excellent). ‘The remain- 
ing 28 students were asked to rate the cafeteria food on a 
scale of () (terrible) to 4 (excellent). Here are the data: 


1 to 5 scale 
Rating 1 2 8 4 5 
Frequency 2 e8 1 8 € 

0 to 4 scale 
Rating Oo | 2. 3 al 


Frequency 0 0 2 is 
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59. Design and analysis (4.2) 


J (a) Was this an observational study or an experiment? 
Justify your answer. 

(b) Explain why it would not be appropriate to perform a 

chi-square test in this setting. 

60. Average ratings (1.3, 10.2) The students decided to 
a, compare the average ratings of the cafeteria food on 
. the two scales. 

(a) Find the mean and standard deviation of the ratings 

for the students who were given the 1-to-5 scale. 


(b) For the students who were given the 0-to- scale, the 
ratings have a mean of 3.2] and a standard deviation of 
0.568. Since the scales differ by one point, the group 
decided to add | to each of these ratings. What are the 


mean and standard deviation of the adjusted ratings? 

(c) Would it be appropriate to compare the means from 
parts (a) and (b) using a two-sample ¢ test? Justify 
your answer. 


Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


‘Two statistics students wanted to know if including ad- 
ditional information in a survey question would change 
the distribution of responses. To find out, they randomly 
selected 30 teenagers and asked them one of the follow- 
ing two questions. Fifteen of the teenagers were randomly 
assigned to answer Question A, and the other 15 students 
were assigned to answer Question B. 


A: When choosing a college, how important is a good 
athletic program: very important, important, somewhat 
important, not that important, or not important at all? 


B: It’s sad that some people choose a college based on 
its athletic program. When choosing a college, how im- 
portant is a good athletic program: very important, im- 
portant, somewhat important, not that important, or not 
important at all? 


The table below summarizes the responses to both ques- 
tions. For these data, the chi-square test statistic is x? = 6.12. 


QuestionA QuestionB Total 
Very important 
Important 
Somewhat important 
Not that important 
Not important at all 
Total 


(a) State the hypotheses that the students are interest- 
ed in testing. 


(b) Describe a Type I error and a Type II error in the 
context of the hypotheses stated in part (a). 


(c) For these data, explain why it would not be appro- 
priate to use a chi-square distribution to calculate 
the P-value. 


(d) ‘To estimate the P-value, 100 trials of a simulation 
were conducted, assuming that the additional in- 
formation didn’t have an effect on the response to 
the question. In each trial of the simulation, the 
value of the chi-square statistic was calculated. 
These simulated chi-square statistics are displayed 
in the dotplot below. 


Neaccccnea vers 


Simulated chi-square statistic 


Based on the results of the simulation, what 
conclusion would you make about the hypotheses 
stated in part (a)? 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score. 
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Section 11.1: Chi-Square Tests for Goodness of Fit 


In this section, you learned the details for performing a chi- 
square test for goodness of fit. The null hypothesis is that 
a single categorical variable follows a specified distribution. 
The alternative hypothesis is that the variable does not follow 
the specified distribution. 

The chi-square statistic measures the difference between 
the observed distribution of a categorical variable and its hy- 
pothesized distribution. ‘To calculate the chi-square statistic, 
use the following formula that involves the observed and ex- 
pected counts for each value of the categorical variable: 


(Observed — Expected)? 
Expected 


= > 


To calculate the expected counts, multiply the total 
sample size by the hypothesized proportion for each cat- 
egory. Larger values of the chi-square statistic provide more 
convincing evidence that the categorical variable does not 
have the hypothesized distribution. 

When the Random, 10%, and Large Counts conditions 
are satisfied, we can accurately model the sampling distribu- 
tion of a chi-square statistic using a chi-square distribution 
(density curve). The Random condition says that the data 
are from a well-designed random sample or a randomized 
experiment. The 10% condition says that the sample size 
should be at most 10% of the population size when sampling 
without replacement. The Large Counts condition says that 
the expected counts for each category must be at least 5. In 
a test for goodness of fit, use a chi-square distribution with 
degrees of freedom = number of categories — 1. 

When the results of a test for goodness of fit are signifi- 
cant, consider doing a follow-up analysis. Identify which 
categories of the variable had the largest contributions to 
the chi-square statistic and whether the observed values in 
those categories were larger or smaller than expected. 


Section 11.2: Inference for Two-Way Tables 


In this section, you learned how to perform inference for cat- 
egorical data that are summarized in a two-way table. To be- 
gin the analysis, compare the conditional distributions of the 
response variable for each value of the explanatory variable. 
Displaying these distributions with a bar graph will help you 


make an effective comparison. 


There are two types of chi-square tests that could apply 
when data are summarized in a two-way table. A test for 
homogeneity compares the distribution of a single categori- 
cal variable for two or more populations or treatments. A 
test for independence looks for an association between two 
categorical variables in a single population. 

In a chi-square test for homogeneity, the null hypoth- 
esis is that there is no difference between the true distribu- 
tions of a categorical variable for two or more populations 
or treatments. ‘The alternative hypothesis is that there is a 
difference in the distributions. ‘The Random condition is 
that the data come from independent random samples or 
groups in a randomized experiment. The 10% condition 
applies when sampling without replacement, but not in 
experiments. Finally, the Large Counts condition remains 
the same—the expected counts must be at least 5 in each 
cell of the two-way table. 

To calculate the expected counts for a test for homoge- 
neity, use the following formula: 


row total - column total 
table total 


expected count = 


To calculate the P-value, compute the chi-square statistic 
and use a chi-square distribution with degrees of freedom = 
(number of rows — 1)(number of columns — 1). 

In a chi-square test for independence, the null hypoth- 
esis is that there is no association between two categorical 
variables in one population. The alternative hypothesis 
is that there is an association between the two variables. 
For this test, the Random condition says that the data must 
come from a single random sample. The 10% condition 
applies when sampling without replacement. The Large 
Counts condition is still the same—the expected counts 
must all be at least 5. The method for calculating expected 
counts, the chi-square statistic, the degrees of freedom, and 
the P-value are exactly the same in a test for independence 
and a test for homogeneity. 

As with tests for goodness of fit, when the results of a test 
for homogeneity or independence are significant, consider 
doing a follow-up analysis. Identify which cells in the two- 
way table had the largest contributions to the chi-square 
statistic and whether the observed values in those cells were 
larger or smaller than expected. 
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chi-square test based on data in a two-way table. We 701, 713 R11.4,R11.5 
Calculate the chi-square statistic, degrees of freedom, and P-value 

for a chi-square test based on data in a two-way table. WZ 704 Rie oeRdales 
Perform a chi-square test for homogeneity. We 708 R11.3 
Perform a chi-square test for independence. lee als R11.5 
Choose the appropriate chi-square test. WZ 718 R11.4 


Chapter 11 Chapter Review Exercises 


These exercises are designed to help you review the impor- 
tant ideas and methods of the chapter. 


R11.1 Testing a genetic model Biologists wish to cross 


pairs of tobacco plants having genetic makeup Gg, 
indicating that each plant has one dominant gene 


(G) and one recessive gene (g) for color. Each 


offspring plant will receive one gene for color from 
each parent. ‘The Punnett square below shows the 


possible combinations of genes received by the 


offspring: 
Parent 2 passes on: 
G g 
Parent 1 passes on: G GG Gg 
g Gg gg 


The Punnett square suggests that the expected 


ratio of green (GG) to yellow-green (Gg) to albino 


(gg) tobacco plants should be 1:2:1. In other 
words, the biologists predict that 25% of the off- 


R11.2 


spring will be green, 50% will be yellow-green, and 
25% will be albino. To test their hypothesis about 
the distribution of offspring, the biologists mate 

84 randomly selected pairs of yellow-green parent 
plants. Of 84 offspring, 23 plants were green, 50 
were yellow-green, and 11 were albino. Do the 
data provide convincing evidence at the a = 0.01 
level that the true distribution of offspring is differ- 
ent from what the biologists predict? 


Sorry, no chi-square We would prefer to learn 
from teachers who know their subject. Perhaps 
even preschool children are affected by how 
knowledgeable they think teachers are. Assign 48 
three- and four-year-olds at random to be taught 
the name of a new toy by either an adult who 
claims to know about the toy or an adult who 
claims not to know about it. Then ask the children 
to pick out a picture of the new toy in a set of pic- 
tures of other toys and say its name. ‘he response 
variable is the count of right answers in four tries. 
Here are the data:*! 


Correct Answers 


Knowledgeable teacher | 6 3 9 
Ignorant teacher 20 O 3 0 il 


The researchers report that children taught by the 
teacher who claimed to be knowledgeable did sig- 
nificantly better (v7 = 20.24, P < 0.05). Explain 
why this result isn’t valid. 


R11.3 Stress and heart attacks You read a newspaper 


article that describes a study of whether stress man- 
agement can help reduce heart attacks. The 107 
subjects all had reduced blood flow to the heart 
and so were at risk of a heart attack. They were as- 
signed at random to three groups. The article goes 
on to say: 


One group took a four-month stress manage- 
ment program, another underwent a four- 
month exercise program, and the third received 
usual heart care from their personal physicians. 
In the next three years, only three of the 33 
people in the stress management group suffered 
“cardiac events,” defined as a fatal or non-fatal 
heart attack or a surgical procedure such as a 
bypass or angioplasty. In the same period, 7 of 
the 34 people in the exercise group and 12 out 
of the 40 patients in usual care suffered such 
events. 7 


(a) Use the information in the news article to make a 
two-way table that describes the study results. 

(b) Compare the success rates of the three treatments 
in preventing cardiac events. 

(c) Do the data provide convincing evidence that the 
true success rates are not the same for the three 
treatments? 


R11.4 Sexy magazine ads? Researchers looked at 1509 


full-page ads that show a model. The two-way table 
below shows the main audience of the magazines 
in which the ads were found (young men, young 
women, or young adults in general) and whether 
or not the ad was “sexual.” This was determined 


based on how the model was dressed (or not 
dressed).*? 


Readers 
Men Women General 
Sexual 105 225 66 
Not sexual 514 351 248 


The following figure displays Minitab output for a 
chi-square test using these data. 


Sex 105 225 66 396 
16.96 39.06 21.02 26.24 

162.4 151.2 82.4 396.0 

20.312 36.074 3.265 be 

notsexy 514 351 248 1113 
83.04 60.94 78.98 73.76 

456.6 424.8 231.6 1113.0 

7.227 12.835 1.162 » 

All 619 576 314 1509 
100.00 100.00 100.00 100.00 

619.0 576.0 314.0 1509.0 


Cell Contents: Count 


% of Column 
Expected count 
Contribution to Chi-square 


Chi-Square = 80.874, DP = 2, P-Value = 0.00 


«) 


] sa 


(a) Describe how these data could have been col- 
lected so that a test for homogeneity is appropriate. 

(b) Describe how these data could have been collect- 
ed so that a test for independence is appropriate. 

(c) Show how each of the numbers 60.94, 424.8, and 
12.835 was obtained for the “notsexy, Women” cell. 

(d) Which cell contributes most to the chi-square sta- 
tistic? How do the observed and expected counts 
compare for this cell? 


R11.5 Popular kids Who were the popular kids at your 


elementary school? Did they get good grades or 
have good looks? Were they good at sports? A study 
was performed to examine the factors that deter- 
mine social status for children in grades 4, 5, and 
6. Researchers administered a questionnaire to a 
random sample of +78 students in these grades. 
One of the questions they asked was “What would 
you most like to do at school: make good grades, 
be good at sports, or be popular?” ‘The two-way 


table below summarizes the students’ responses. * 


Gender 
Goal Female Male 
Grades 130 117 
Popular 91 50 
Sports 30 60 


(a) Construct an appropriate graph to compare male 
and female responses. Write a few sentences 
describing the relationship between gender and 
goals. 

(b) Is there convincing evidence at the a = 0.05 level 
of an association between gender and goals for 
elementary school students? 
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Chapter 11 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


T11.1 A chi-square test is used to test whether a 0 to 9 spinner 
is “fair” (that is, the outcomes are all equally likely). 
The spinner is spun 100 times, and the results are 
recorded. ‘The degrees of freedom for the test will be 


(a) 8. (b) 9. (c) 10. (d) 99. (e) None of these. 


Exercises T11.2 and T11.3 refer to the following setting. Re- 
cent revenue shortfalls in a midwestern state led to a reduc- 
tion in the state budget for higher education. To offset the 
reduction, the largest state university proposed a 25% tuition 
increase. It was determined that such an increase was need- 
ed simply to compensate for the lost support from the state. 
Separate random samples of 50 freshmen, 50 sophomores, 
50 juniors, and 50 seniors from the university were asked 
whether they were strongly opposed to the increase, given 
that it was the minimum increase necessary to maintain the 
university’s budget at current levels. Here are the results. 


Year 
Strongly 
Opposed? Freshman Sophomore Junior Senior 
Yes 39 36 29 18 
No i 14 21 oy 


T11.2 Which hypotheses would be appropriate for per- 
forming a chi-square test? 

(a) The null hypothesis is that the closer students get to 
graduation, the less likely they are to be opposed to 
tuition increases. The alternative is that how close 
students are to graduation makes no difference in 
their opinion. 

(b) ‘The null hypothesis is that the mean number of stu- 
dents who are strongly opposed is the same for each 
of the 4 years. The alternative is that the mean is dif 
ferent for at least 2 of the 4 years. 

(c) The null hypothesis is that the distribution of student 
opinion about the proposed tuition increase is the 
same for each of the + years at this university. ‘The 
alternative is that the distribution is different for at 
least 2 of the 4 years. 

(d) ‘The null hypothesis is that year in school and student 
opinion about the tuition increase in the sample are 
independent. The alternative is that these variables 
are dependent. 

(e) The null hypothesis is that there is an association be- 
tween year in school and opinion about the tuition 
increase at this university. The alternative hypothesis 
is that these variables are not associated. 


T11.3 The conditions for carrying out the chi-square test 
in exercise I'11.2 are 
I. Independent random samples from the populations 
of interest. 


Il. All expected counts are at least 5. 
Il. The population sizes are at least 10 times the sample 
sizes. 
Which of the conditions is (are) satisfied in this case? 
(c) Land II only (e) I, Il, and Ul 
(d) Il and III only 


(a) I only 
(b) II only 


Exercises T11.4 to T11.6 refer to the following setting. A ran- 
dom sample of traffic tickets given to motorists in a large city 
is examined. The tickets are classified according to the race of 
the driver. The results are summarized in the following table. 


Race: White Black Hispanic Other 
Number of tickets: 69 52 18 9 


The proportion of this city’s population in each of the racial 
categories listed above is as follows: 


Race: White Black Hispanic Other 
Proportion: 0.55 0.30 0.08 0.07 


We wish to test Ho: The racial distribution of traffic tickets in the 

city is the same as the racial distribution of the city’s population. 

T11.4 Assuming H) is true, the expected number of His- 
panic drivers who would receive a ticket is 


(a) 8. (b) 10.36. (c) ll. (d) 11.84. (e) 12. 


T11.5 We compute the value of the y’ statistic to be 6.58. 
Assuming that the conditions for inference are met, 
the P-value of our test is 


(a) greater than 0.20. 
(b) between 0.10 and 0.20. (e) less than 0.01. 
) between 0.05 and 0.10. 


( 
T11.6 The category that contributes the largest compo- 
nent to the y’ statistic is 
(a) White. (c) Hispanic. 
(b) Black. (d) Other. 
(e) ‘The answer cannot be determined because this is 
only a sample. 


Exercises T11.7 to T 11.10 refer to the following setting. All current- 
carrying wires produce electromagnetic (EM) radiation, includ- 
ing the electrical wiring running into, through, and out of our 
homes. High-frequency EM radiation is thought to be a cause 
of cancer. he lower frequencies associated with household cur- 
rent are generally assumed to be harmless. To investigate the 
relationship between current configuration and type of cancer, 
researchers visited the addresses of a random sample of children 
who had died of some form of cancer (leukemia, lymphoma, or 
some other type) and classified the wiring configuration outside 
the dwelling as either a high-current configuration (HCC) or a 
low-current configuration (LCC). Here are the data: 


(d) between 0.01 and 0.05. 


Cc 


Leukemia Lymphoma Other cancers 
HCC 52 10 17 
LCC 84 21 31 


Computer software was used to analyze the data. The output 
included the value x7 = 0.435. 
T11.7 The appropriate degrees of freedom for the y’ statistic is 

(a) 1. (b) 2. (c)i 3: (d) 4. (e) 5. 

T11.8 The expected count of cases with lymphoma in 
homes with an HCC is 
TSK | 10321 903i 
ia Oa arg 

(e) None of these. 

T11.9 Which of the following may we conclude, based on 
the test results? 

(a) There is convincing evidence of an association be- 
tween wiring configuration and the chance that a 
child will develop some form of cancer. 

(b) HCC either causes cancer directly or is a major contrib- 
uting factor to the development of cancer in children. 


1346) = 31 
2b 


(d) 
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(c) Leukemia is the most common type of cancer 
among children. 

(d) There is not convincing evidence of an association 
between wiring configuration and the type of can- 
cer that caused the deaths of children in the study. 

(e) There is convincing evidence that HCC does not 
cause cancer in children. 


T11.10 A Type I error would occur if we found convincing 
evidence that 

(a) HCC wiring caused cancer when it actually didn’t. 

(b) HCC wiring didn’t cause cancer when it actually 
did. 

(c) there is no association between the type of wiring 
and the form of cancer when there actually is an 
association. 

(d) there is an association between the type of wiring 
and the form of cancer when there actually is no 
association. 

(e) the type of wiring and the form of cancer have a 
positive correlation when they actually don’t. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T11.11 A large distributor of gasoline claims that 60% of 
all cars stopping at their service stations choose 
regular unleaded gas and that premium and 
supreme are each selected 20% of the time. ‘To 
investigate this claim, researchers collected data 
from a random sample of drivers who put gas in 
their vehicles at the distributor’s service stations in 
a large city. The results were as follows: 


Gasoline Selected 


Regular Premium Supreme 
261 51 88 


Carry out a test of the distributor’s claim at the 5% 
significance level. 


T11.12 A study conducted in Charlotte, North Carolina, 
tested the effectiveness of three police responses 
to spouse abuse: (1) advise and possibly separate 
the couple, (2) issue a citation to the offender, 
and (3) arrest the offender. Police officers were 
trained to recognize eligible cases. When 
presented with an eligible case, a police officer 
called the dispatcher, who would randomly as- 
sign one of the three available treatments to be 
administered. ‘There were a total of 650 cases in 
the study. Each case was classified according to 


whether the abuser was subsequently arrested 
within six months of the original incident.” 


(a) Explain the purpose of the random assignment in 
the design of this study. 

(b) Construct a well-labeled graph that is suitable for 
comparing the effectiveness of the three treatments. 

(c) State an appropriate pair of hypotheses for perform- 
ing a chi-square test in this setting. 

(d) Assume that all the conditions for performing the test 
in part (b) are met. The test yields x7 = 5.063 anda 
P-value of 0.0796. Interpret this P-value in context. 
What conclusion should we draw from the study? 


T11.13 In the United States, there is a strong relationship 
between education and smoking: well-educated 
people are less likely to smoke. Does a similar re- 
lationship hold in France? To find out, research- 
ers recorded the level of education and smoking 
status of a random sample of 459 French men 
aged 20 to 60 years.*° The two-way table below 
displays the data. 


Treatment 
Advise and 
Subsequent arrest? separate Citation Arrest 
No 187 181 175 
Yes 25 43 39 


Education 


Smoking Status Primary School Secondary School University 


Nonsmoker 56 On 53 
Former 54 43 28 
Moderate 41 Pf 36 
Heavy 36 32 16 


Is there convincing evidence of an association be- 
tween smoking status and educational level among 
French men aged 20 to 60 years? 
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the PGA Tour? 


Do Longer Drives Mean Lower Scores on 


Recent advances in technology have led to golf balls that fly farther, clubs that generate more speed at 
impact, and swings that have been perfected through computer video analysis. Moreover, today’s profes- 
sional golfers are fitter than ever. The net result is many more players who routinely hit drives traveling 
300 yards or more. Does greater distance off the tee translate to better (lower) scores? 


We collected data on mean drive distance (in 
yards) and mean score per round from an SRS of 
19 of the 197 players on the Professional Golfers As- 
sociation (PGA) Tour in a recent year. Figure 12.1 
is a scatterplot of the data with results from a least- 
squares regression analysis added. The graph shows 
that there is a moderately weak negative linear re- 
lationship between mean drive distance and mean 
score for the 19 players in the sample.! 


Predicted mean score = 76.90 — 0.02016(mean distance) 
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FIGURE 12.1 Scatterplot and least-squares regression line 
of mean score versus mean drive distance for a random 
sample of 19 players on the PGA Tour. 
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It is conventional to refer to a 
scatterplot of the points (x, y) as a 
graph of y versus x. So a scatterplot 
of life span versus temperature uses 
life span as the response variable 
and temperature as the explanatory 
variable. 


ACTIVITY 


MATERIALS: 


50 copies of the helicopter 
template from the Teacher’s 
Resource Materials, scissors, 
tape measures, stopwatches 


Introduction 


When a scatterplot shows a linear relationship between a quantitative explanatory 
variable x and a quantitative response variable y, we can use the least-squares line 
calculated from the data to predict y for a given value of x. If the data are a random 
sample from a larger population, we need statistical inference to answer questions 
like these: 


e Is there really a linear relationship between x and y in the population, or could 
the pattern we see in the scatterplot plausibly happen just by chance? 


e In the population, how much will the predicted value of y change for each 
increase of | unit in x? What’s the margin of error for this estimate? 


If the data come from a randomized experiment, the values of the explanatory 
variable correspond to the levels of some factor that is being manipulated by the 
researchers. For instance, researchers might want to investigate how temperature 
affects the life span of mosquitoes. They could set up several tanks at each of several 
different temperatures and then randomly assign hundreds of mosquitoes to each of 
the tanks. The response variable of interest is the average time (in days) from hatch- 
ing to death. Suppose that a scatterplot of average life span versus temperature has 
a linear form. We need statistical inference to decide if it’s plausible that there is no 
linear relationship between the variables, and that the pattern observed in the graph 
is due simply to the chance involved in the random assignment. 

In Section 12.1, we will show you how to estimate and test claims about the 
slope of the population (true) regression line that describes the relationship be- 
tween two quantitative variables. The following Activity gives you a preview of 
inference for linear regression. 


The Helicopter Experiment 


Is there a linear relationship between the height from which a paper helicopter 
is released and the time it takes to hit the ground? In this Activity, your class will 
perform an experiment to investigate this question.’ 


1. Follow the directions provided with the template to construct 50 long-rotor 
helicopters. 


2. Randomly assign 10 helicopters to each of five different drop heights. (‘The 
experiment works best for drop heights of 5 feet or more.) 


3. Work in teams to release the helicopters from their assigned drop heights and 
record the descent times. 


4. Make a scatterplot of the data in Step 3. Find the least-squares regression line 
for predicting descent time from drop height. 


5. Interpret the slope of the regression line from Step 4 in context. What is your 
best guess for the increase in descent time for each additional foot of drop height? 


6. Does it seem plausible that there is really no linear relationship between de- 
scent time and drop height and that the observed slope happened just by chance 
due to the random assignment? Discuss this as a class. 


WHAT YOU WILL LEARN 
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Sometimes a scatterplot reveals that the relationship between two quantitative 
variables has a strong curved form. One strategy is to transform one or both vari- 
ables so that the graph shows a linear pattern. Then we can use least-squares 
regression to fit a linear model to the data. Section 12.2 examines methods of 
transforming data to achieve linearity. 


Inference for Linear 
Regression 


By the end of the section, you should be able to: 


e Check the conditions for performing inference about e Construct and interpret a confidence interval for the 


the slope 3 of the population (true) regression line. slope G of the population (true) regression line. 


e Interpret the values of a, b, s, SE,, and r? in context, e Perform a significance test about the slope 7 of the 
and determine these values from computer output. population (true) regression line. 


The sample regression line is 
sometimes called the estimated 
regression line. 


In Chapter 3, we examined data on eruptions of the Old Faithful geyser. Figure 12.2 
is a scatterplot of the duration and interval of time until the next eruption for all 
222 recorded eruptions in a single month. The least-squares regression line for 
this population of data has been added to the graph. Its equation is 


predicted interval = 33.97 + 10.36 (duration) 


We call this the population regression line (or true regression line) because it uses 
all the observations that month. 


100 + 


90 + 


Interval (minutes) 
I 
Oo 
| 


FIGURE 12.2 Scatterplot of the 

duration and interval between 

eruptions of Old Faithful for all 

222 eruptions in a single month. 

1 2 3 4 5 The population least-squares line 
Duration (minutes) is shown in blue. 


Suppose we take an SRS of 20 eruptions from the population and calculate 
the least-squares regression line f = a + bx for the sample data. How does the 
slope b of the sample regression line relate to the slope of the population regres- 
sion line? Figure 12.3 on the next page shows the results of taking three different 
SRSs of 20 Old Faithful eruptions in this month. Each graph displays the selected 
points and the least-squares regression line for that sample. Notice that the slopes 
of the sample regression lines (10.2, 7.7, and 9.5) vary quite a bit from the slope 
of the population regression line, 10.36. The pattern of variation in the slope b is 
described by its sampling distribution. 
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(a) Sample 1: j = 32.8 + 10.2x (b) Sample 2: p = 44.0 + 7.7x (c) Sample 3: } = 36.0 + 9.5x 


Sampling Distribution of b 


Confidence intervals and significance tests about the slope of 
the population regression line are based on the sampling dis- 
tribution of b, the slope of the sample regression line. We used 
Fathom software to simulate choosing 1000 SRSs of n = 20 
from the Old Faithful data, each time calculating the equation 
§ =a + bx of the least-squares regression line for the sample. 
Figure 12.4 displays the values of the slope b for the 1000 sam- 
ple regression lines. We have added a vertical line at 10.36 cor- 
responding to the slope of the population regression line. Let’s 
describe this approximate sampling distribution of b. 


Approximate sampling distribution 
of b (n = 20) 


tie 
vou or, 
Bs are rs Shape: We can see that the distribution of b-values is roughly 
re a  Symimenie and unimodal. Figure 12.5(a) is a Normal proba- 
bility plot of these sample regression line slopes. The strong 
linear pattern in the graph tells us that the approximate sam- 


pling distribution of b is close to Normal. 


Center: The mean of the 1000 b-values is 10.35. This value is 
quite close to the slope of the population (true) regression line, 10.36. 


° 4 wae 
6 


Slope 


FIGURE 12.4 Dotplot of the slope b of the least-squares 
regression line in 1000 simulated SRSs by Fathom software. 


Spread: The standard deviation of the 1000 b-values is 1.29. Soon, we will see 
that the standard deviation of the sampling distribution of b is actually 1.27. 

Figure 12.5(b) is a histogram of the b-values from the 1000 simulated SRSs. 
We have superimposed the density curve for a Normal distribution with mean 
10.36 and standard deviation 1.27. This curve models the approximate sampling 
distribution of the slope quite well. 
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FIGURE 12.5 (a) Normal probability plot and (b) histogram of the 1000 sample regression line 
slopes from Figure 12.4. The red density curve in Figure 12.5(b) is for a Normal distribution with 
mean 10.36 (marked by the yellow line) and standard deviation 1.27. 


Section 12.1 Inference for Linear Regression 24741 


al —e ’ Let’s do a quick recap. For all 222 eruptions of Old Faithful 

5 as 07 ease ries a4 in a single month, the population regression line is: predict- 

= ed interval = 33.97 + 10.36 (duration). We use the symbols 

5 80> a = 33.97 and @ = 10.36 to represent the y intercept and 

2 slope parameters. The standard deviation of the residuals for 
E this line is the parameter o = 6.131. 

2 Figure 12.5(b) shows the approximate sampling distribu- 

- aie tion of the slope b of the sample regression line for samples of 

: 20 eruptions. If we take all possible SRSs of size n = 20 from 

a the population, we get the sampling distribution of b. Can you 


guess its shape, center, and spread? 


1 } 3 4 5 
Duration (minutes) Shape: Approximately Normal 
Center: pj, = 3 = 10.36 (b is an unbiased estimator of /3) 
o 6,13] 


= 1.27 where o, is the standard deviation of 


Spread: o, = = 
Pes hg 1.081520 
the 222 eruption durations. 


We interpret o7, just like any other standard deviation: the slopes of the sample regres- 
sion lines typically differ from the slope of the population regression line by about 1.27. 
Here’s a summary of the important facts about the sampling distribution of b. 


Note that the symbols cand 8 SAMPLING DISTRIBUTION OF A SLOPE 


here refer to the intercept and 
slope, respectively, of the population 
regression line. They are in no way 
related to Type | and Type II error 
probabilities, which are sometimes 
designated by these same symbols. 


We'll say more about the Normal condition in a moment. 


THINK What’s with that formula for Op? There are three factors that affect the 
ABOUT IT standard deviation of the sampling distribution of b: 

e a, the standard deviation of the residuals for the population regression line. 
Because o is in the numerator of the formula, when a is larger, so is o,. When 
the points are more spread out around the population (true) regression line, 
we should expect more variability in the slopes b of sample regression lines 
from repeated random sampling or random assignment. 


742 CHAPTER 12 MORE ABOUT REGRESSION 


THINK 
ABOUT IT 


e oy, the standard deviation of the explanatory variable. Because o, is in the 
denominator of the formula, when 9, is larger, op is smaller. More variability 
in the values of the explanatory variable leads to a more precise estimate of the 
slope of the true regression line. 


en, the sample size. Just like every other formula for the standard deviation of a 
statistic, the variability of the statistic gets smaller as the sample size increases. 
A larger sample size will lead to a more precise estimate of the true slope. 


SS or a 


Conditions for Regression Inference 


We can fit a least-squares line to any data relating two quantitative variables, 
but the results are useful only if the scatterplot shows a linear pattern. Inference 
about regression involves more detailed conditions. Figure 12.6 shows the regres- 
sion model when the conditions are met in picture form. The regression model 
requires that for each possible value of the explanatory variable x: 


1. The mean value of the response variable j1, falls on the population (true) 
regression line jy = a + Gx. 


2. The values of the response variable y follow a Normal distribution with com- 
mon standard deviation a. 


follow a Normal distribution 


For any fixed x, the responses y 
with standard deviation o. 


My = @ + Bx 


a 7 
xy Xy X3 
x 


FIGURE 12.6 The regression model when the conditions for inference are met. The line is the 
population (true) regression line, which shows how the mean response ,., changes as the ex- 
planatory variable x changes. For any fixed value of x, the observed response y varies according 
to a Normal distribution having mean j., and standard deviation o. 


What does the regression model in Figure 12.6 tell us? Consider 
the population of all eruptions of the Old Faithful geyser in a given year. For each 
eruption, let x be the duration (in minutes) and y be the interval of time (in minutes) 
until the next eruption. Suppose that the conditions for regression inference are met 
for this data set, that the population regression line is 44, = 34 + 10.4x, and that the 
spread around the line is given by o = 6. Let’s focus on the eruptions that lasted 
x = 2 minutes. For this “subpopulation”: 


e The average amount of time until the next eruption is p4, = 34 + 10.4(2) = 
54.8 minutes. 
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e The amounts of time until the next eruption follow a Normal distribution 
with mean 54.8 minutes and standard deviation 6 minutes. 

¢ For about 95% of these eruptions, the amount of time y until the next eruption 
is between 54.8 — 2(6) = 42.8 minutes and 54.8 + 2(6) = 66.8 minutes. ‘That 
is, if the previous eruption lasted 2 minutes, 95% of the time the next eruption 
will occur in 42.8 to 66.8 minutes. 


Duration (minutes) 
‘.. —_——_ —_ ——_ — Jom Jo  ————_ —a 


Here are the conditions for performing inference about the linear regression 
model. 


CONDITIONS FOR REGRESSION INFERENCE 


The acronym LINER should help you 
remember the conditions for inference 
about regression. 


Although the conditions for regression inference are a bit complicated, it is not 
hard to check for major violations. Most of the conditions involve the population 
(true) regression line and the deviations of responses from this line. We usually 
can’t observe the population line, but the sample regression line estimates it. The 
residuals from the sample regression line estimate the deviations from the popula- 
tion line. We can check several of the conditions for regression inference by look- 
ing at graphs of the residuals. Start by making a residual plot and a histogram or 
Normal probability plot of the residuals. 

Here’s a summary of how to check the conditions one by one. 
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Linear: Examine the scatterplot to see if the overall pattern is roughly linear. 
Make sure there are no curved patterns in the residual plot. Check to see that the 
residuals center on the “residual = 0” line at each x-value in the residual plot. 


Price (dollars) 


45000 { 
40000 - : 
35000 5 
30000 + 
25000 5 
20000 + 
15000 ; 
10000 ; 


10000 + 


5000 + = 


Residual 
oO 
es 


-5000/ = - 


-10000 -_ 


S S S S S S SS) 

S S S S S S 

sr SF FF SF SC S 
Miles driven 


Good: Scatterplot has a linear form. 


8 


s 


S 


S SS) 
S S 
SS 


SS) S S) S 
s s s se SS 
S S S S 
re F SS RS 


Sp 
Miles driven 


Bad: Residual plot shows a curved pattern. 


Independent: Look at how the data were produced. Random sampling and 
random assignment help ensure the independence of individual observations. 
If sampling is done without replacement, remember to check that the popula- 
tion is at least 10 times as large as the sample (10% condition). But there are 
other issues that can lead to a lack of independence. One example is measur- 
ing the same variable at intervals over time, yielding what is known as time- 
series data. Knowing that a young girl’s height at age 6 is 48 inches would defi- 
nitely give you additional information about her height at age 7. You should 
avoid doing inference about the regression model for time-series data. 


Normal: Make a stemplot, histogram, or Normal probability plot of the residu- 
als and check for clear skewness or other major departures from Normality. 
Ideally, we would check the distribution of residuals for Normality at each pos- 
sible value of x. Because we rarely have enough observations at each x-value, 
however, we make one graph of all the residuals to check for Normality. 

Equal SD: Look at the scatter of the residuals above and below the “residual 


= (” line in the residual plot. The vertical spread of the residuals should be 
roughly the same from the smallest to the largest x-value. 


10000 + 


Residual 
o 
4 
e 


-10000 


Residual 
oO 


Good: Residuals have roughly equal spread at all 
xX-values in the data set. 


ES EEE EEE | ~ 


Miles driven 


x 
Bad: The response variable y has greater spread 
for larger values of the explanatory variable x. 


Note that we do not have to check 
the 10% condition here because 
there was no random sampling. 
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e¢ Random: See if the data came from a well-designed random sample or ran- 
domized experiment. If not, we can’t make inferences about a larger popula- 
tion or about cause and effect. 


Let’s look at an example that illustrates the process of checking conditions. 
fe 


The Helicopter Experiment 
Checking conditions 


Mrs. Barrett’s class did a variation of the helicopter experiment on page 738. Stu- 
dents randomly assigned 14 helicopters to each of five drop heights: 152 centime- 
ters (cm), 203 cm, 254 cm, 307 cm, and 442 cm. Teams of students released the 
70 helicopters in a predetermined random order and measured the flight times in 
seconds. The class used Minitab to carry out a least-squares regression analysis for 
these data. A scatterplot and residual plot, plus a histogram and Normal probabil- 
ity plot of the residuals are shown below. 


3.0 0.507 ‘6 
o 
i . 
. 
= 25 0.254 « ° ® ° 
Fd a" oe ° ° e 
— e . 
~ 20 ° g 3 : : 
g 3 0.00 f 
2 i a I 3 a 
£15 e 4 e 
S 9 8 ® g 
c | ° 0.25 ns * 
1.0 * 
° 3 
r r : r r , = 0.504+_, x r x r r = 
150 200 250 300 350 400 450 150 200 250 300 350 400 450 
Drop height (cm) Drop height (cm) 
20 37 
> 
24 Pa 
15 i Pd 
§ of 
9 % 
x 
$ 10 3 0 
g a -1 
5 i a 
-24 .* 
. 
0 “34, - + + 7 
-0.4 -0.2 0.0 0.2 0.4 -0.50 -0.25 0.00 0.25 0.50 
Residual Residual 


PROBLEM: Check whether the conditions for performing inference about the regression model 
are met. 


SOLUTION: We'lluse our LINER acronym! 


* Linear: The scatterplot shows a clear linear form. The residual plot shows a random scatter about 
the horizontal line. For each drop height used in the experiment, the residuals are centered on the 
horizontal line at O. 


* Independent: Because the helicopters were released in a random order and no helicopter was used 
twice, knowing the result of one observation should not help us predict the value of another observation. 
* Normal: The histogram of the residuals is single-peaked and somewhat bell-shaped. In addition, 
the Normal probability plot is very close to linear. 
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AP® EXAM TIP The AP® 
exam formula sheet gives 

J = bo + b,x for the equation 
of the sample regression line. 
We will stick with our simpler 


notation, 7 = a+ bx, which is 
also used by Tl calculators. Just 
remember: the coefficient of x 
is always the slope, no matter 
what symbol is used. 


Because sis estimated from data, it 
is sometimes called the regression 
Standard error or the root mean 
Squared error. 


* Equal SD: The residual plot shows a similar amount of scatter about the residual = O line for the 
152, 203, 254, and 442 cm drop heights. Flight times (and the corresponding residuals) seem to 
vary alittle more for the helicopters that were dropped from a height of 307 cm. 

* Random: The helicopters were randomly assigned to the five possible drop heights. 

Except for a slight concern about the equal-SD condition, we should be safe performing inference 
about the regression model in this setting. 


For Practice Try Exercise 


You will always see some irregularity when you look for Normality and equal 
standard deviation in the residuals, especially when you have few observations. 
Don’t overreact to minor issues in the graphs when checking these two conditions. 


Estimating the Parameters 


When the conditions are met, we can do inference about the regression model 
[ty = a + Bx. The first step is to estimate the unknown parameters. If we calculate 
the sample regression line $ = a + bx, the slope b is an unbiased estimator of the 
true slope (, and the y intercept a is an unbiased estimator of the true y intercept 
a. The remaining parameter is the standard deviation a, which describes the vari- 
ability of the response y about the population (true) regression line. 

The least-squares regression line computed from the sample data estimates the 
population (true) regression line. So the residuals estimate how much y varies 
about the population line. Because a is the standard deviation of responses about 
the population (true) regression line, we estimate it by the standard deviation of 

residuals 


a ee 


Recall from Chapter 3 that s describes the size of a “typical” prediction error. 

It is possible to do inference about any of the three parameters in the regression 
model: a, 3, or o. However, the slope ( of the population (true) regression line 
is usually the most important parameter in a regression problem. So we'll restrict 
our attention to inference about the slope. 

When the conditions are met, the sampling distribution of the slope b is ap- 
proximately Normal with mean 4, = ( and standard deviation 


o 
o.WVn 


In practice, we don’t know a for the true regression line. So we estimate it with the 
standard deviation of the residuals, s. We also don’t know the standard deviation 
o, for the population of x-values. For reasons beyond the scope of this text, we re- 
place the denominator with s,Vn — 1. So we estimate the spread of the sampling 
distribution of b with the standard error of the slope 


Of. 


s 


a — 
: sVn — | 
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ae 


What happens if we transform the values of b by standardizing? Because the 
sampling distribution of b is approximately Normal, the statistic 


b- 8B 
z= 
Op 


is modeled well by the standard Normal distribution. Replacing the standard de- 
viation oy of the sampling distribution with its standard error gives the statistic 


— 
SE, 


which has a t distribution with n — 2 degrees of freedom. (The explanation of why 
df = n — 2 is beyond the scope of this book.) 

Let’s return to the Old Faithful eruption data. Figure 12.7(a) displays the simu- 
lated sampling distribution of the slope b from 1000 SRSs of n = 20 eruptions. 
Figure 12.7(b) shows the result of standardizing the b-values from these 1000 
samples. The superimposed curve is a ¢ distribution with df = 20 — 2 = 18. 


t distribution 
with 18 
degrees of 
freedom 


10 -4 = 0 2 4 
Slope Standardized slope 


FIGURE 12.7 (a) The approximate sampling distribution of the slope b for samples of size 

n= 20 eruptions. This distribution has a roughly Normal shape with mean about 10.36 and 
standard deviation about 1.27. (b) The sampling distribution of the standardized slope values has 
approximately a f distribution with df = n- 2. 


Constructing a Confidence Interval 
for the Slope 


In a regression setting, we often want to estimate the slope @ of the population 
(true) regression line. The slope b of the sample regression line is our point esti- 
mate for 3. A confidence interval is more useful than the point estimate because 
it gives a set of plausible values for (. 

The confidence interval for has the familiar form 


statistic + (critical value) - (standard deviation of statistic) 
Because we use the statistic b as our point estimate, the confidence interval is 


ba Ski 
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We call this a t interval for the slope. Here are the details. 


t INTERVAL FOR THE SLOPE 


AP® EXAM TIP The AP® exam 
formula sheet gives the formula 


for the standard error of the When the conditions for regression inference are met, a C% confidence 
slope as interval for the slope ( of the population (true) regression line is 
SVMi- WP pier Si 
n-2 
= SS In this formula, the standard error of the slope is 
VdKm- %) 
The numerator is just a fancy way SE, = See 
of writing the standard deviation Sev il 
of the residuals s. Can you show and ¢* is the critical value for the t distribution with df = n — 2 having C% of 


that the denominator of this its area between —t* and t*. 
formula is the same as ours? 


Although we give the formula for the standard error of b, you should rarely 
have to calculate it by hand. Computer output gives the standard error SE, along 
with b itself. However we get it, SE, estimates how much the slope of the sample 
regression line typically varies from the slope of the population (true) regression 
line if we repeat the data production process many times. 


The Helicopter Experiment 
A confidence interval for 3 


Earlier, we used Minitab to perform a least-squares regression analysis on the he- 
licopter data for Mrs. Barrett’s class. Recall that the data came from dropping 
70 paper helicopters from various heights and measuring the flight times. Some 
computer output from this regression is shown below. We checked conditions for 
performing inference earlier. 


Regression Analysis: Flight time versus Drop height 
Coef SE Coef T P 
—0.03761 0.05838 —0.64 0.522 
(cm) 0.0057244 0.0002018 28.37 0.000 


Predictor 


Constant 


Drop height 
S=0.168181 R-Sq = 92.2% R-Sq(adj) = 92.1% 


PROBLEM: 
(a) Give the standard error of the slope, SE,, Interpret this value in context. 


(b) Find the critical value for a 95% confidence interval for the slope of the true 
regression line. Then calculate the confidence interval. Show your work. 


(c) Interpret the interval from part (b) in context. 
(4) Explain the meaning of “95% confident” in context. 


Flight time (sec) 


When we compute the least- 
squares regression line based on 

a random sample of data, we can 
think about doing inference for the 
population regression line. When 
our least-squares regression line is 
based on data from a randomized 
experiment, as in this example, 

the resulting inference is about the 
true regression line relating the 


explanatory and response variables. 


We'll follow this convention from 
now on. 


re 
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SOLUTION: 


(a) We got the value of the standard error of the slope, O.0002016, from the “SE Coef” column in 
the computer output. If we repeated the random assignment many times, the slope of the sample 
regression line would typically vary by about 0.0002 from the slope of the true regression line for 
predicting flight time from drop height. 


(b) Because the conditions are met, we can calculate a t interval for the slope ( based ona tdistri- 


bution with df = n — 2 = 70 — 2 = 68. Using the more conservative df = 60 from Table B gives 
t* = 2.000. The 95% confidence interval is 


b+ t*SE, = 0.0057244 + 2.000(0.0002018) = 0.0057244 + 0.0004036 
= (0.0053208, 0.0061280) 


Using technology: From invT (.025,68),weget t* = 1.995. The resulting 95% confidence 
interval is 


0.0057244 = 1.995(0.0002018) = 0.0057244 + 0.0004026 
= (0.0053218, 0.0061270) 


This interval is slightly narrower due to the more precise t* critical value. 

(c) Weare 95% confident that the interval from 0.00532 18 to 0.0061 270 seconds per cm 
captures the slope of the true regression line relating the flight time yand drop height x of paper 
helicopters. 

(4) Ifwe repeat the experiment many, many times, and use the method in part (b) to construct a 
confidence interval each time, about 95% of the resulting intervals will capture the slope of the true 
regression line relating flight time yand drop height xof paper helicopters. 


For Practice Try Exercise 


The values of t given in the computer regression output are not the 4g 
critical values for a confidence interval. They come from carrying out a 
significance test about the y intercept or slope of the population (true) 
regression line. We'll discuss tests in more detail shortly. 

You can find a confidence interval for the y intercept a of the population (true) 
regression line in the same way, using a and SE, from the “Constant” row of the 
Minitab output. However, we are usually interested only in the point estimate for 
a that’s provided in the computer output. 

Here is an example using a familiar context that illustrates the four-step process 
for calculating and interpreting a confidence interval for the slope. 


How Much Is That Truck Worth? := A 


L 


Everyone knows that cars and trucks lose value the more they are 
driven. Can we predict the price of a used Ford F-150 SuperCrew 
4 Xx 4if we know how many miles it has on the odometer? A random 
sample of 16 used Ford F-150 SuperCrew 4 X 4s was selected from 
among those listed for sale on autotrader.com. The number of miles 
driven and price (in dollars) were recorded for each of the trucks.’ 


Confidence interval for a slope 
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Here are the data: 


Miles driven: 70,583 129,484 29,932 29,953 24,495 75,678 8359 4447 
Price (in dollars): 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 


Miles driven: 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
Price (in dollars): 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 


Minitab output from a least-squares regression analysis for these data is shown 


below. 
() 45.000 4 ©) 0.000 4 
40,000 + a : : 
~ 35,000 4 5000 -| : 
zg . 
S 30,000 5 - . . . 
3 = e 
1& 25.000 5 3 0 
— vo 
3 20,000 4 Z : ; 
Fy 
15,000 4 -5000 4 * ‘ : 
10,000 + . ° f 
5000 T T T T T T T T ~ 10,000 -} T T T T T T T T 
0 20,000 40,000 60,000 80,000. 100,000 120,000 140,000 160,000 0 20,000 40,000 60,000 80,000 100,000 120,000 140,000 160,000 
Miles driven Miles driven 
34 
Regression Analysis: Price (dollars) versus Miles driven 
> 2] 
e Predictor Coef SE Coef T Pp 
s 
Zz Constant 38257 2446 15.64 0.000 
a Miles driven -—0.16292 0.03096 =5.26 0.000 
S$=5740.13 R-Sq=66.4%  R-Sq(adj) = 64.0% 
-8000 -4000 0 4000 8000 
Residual 


PROBLEM: Construct and interpret a 90% confidence interval for the slope of the population 
regression line. 


SOLUTION: Wewill follow the familiar four-step process. 


STATE: We want to estimate the slope (3 of the population regression line relating miles driven to 
price with 90% confidence. 

PLAN: Ifthe conditions are met, we will use a t interval for the slope of a regression line. 

* Linear: The scatterplot shows a clear linear pattern. Also, the residual plot shows a random 
scatter of points about the residual = O line. 

* Independent: Because we sampled without replacement to get the data, there have to be at least 
10(16) = 160 used Ford F-150 SuperCrew 4 X 4s listed for sale on autotrader.com. This seems 
reasonable to believe. 

* Normal: The histogram of the residuals is roughly symmetric and single-peaked, so there are no 
obvious departures from Normality. 

* Equal SD: The scatter of points around the residual = O line appears to be about the same at all 
x-values. 

* Random: We randomly selected the 16 pickup trucks in the sample. 
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~~ 


DO: Weuse the tdistribution with 16 —2 = 14 degrees of freedom to find the critical value. For a 
90% confidence level, the critical value is t* = 1.761. So the 90% confidence interval for Gis 


b+ t*SE,= —0.16292 + 1.761(0.03096) = —0.16292 + 0.05452 
= (—0.21744, —0.10840) 


Using technology: Refer to the Technology Corner that follows the example. The calculator’s 
LinRegTInt gives (— 0.2173, —0.1064) using df = 14. 

CONCLUDE: Weare 90% confident that the interval from —0.2173 to —0.1084 captures 
the slope of the population regression line relating price to miles driven for used Ford F-150 
SuperCrew 4 X 46 listed for sale on autotrader.com. 


For Practice Try Exercise 9 | 


The predicted change in price of a used Ford F-150 is quite small for a 1-mile 
increase in miles driven. What if miles driven increased by 1000 miles? We can 
just multiply both endpoints of the confidence interval in the example by 1000 to 
get a 90% confidence interval for the corresponding predicted change in average 
price. The resulting interval is (—217.3, — 108.4). That is, the population regres- 
sion line predicts a decrease in price of between $108.40 and $217.30 for every 
additional 1000 miles driven. 

So far, we have used computer regression output when performing inference 
about the slope of a population (true) regression line. The T1-83/84, TI-89, and TI- 
Nspire can do the calculations for inference when the sample data are provided. 


CONFIDENCE INTERVAL FOR SLOPE 


28. CORNER ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Let’s use the data from the previous example to construct a confidence interval for the slope of a population (true) 
regression line on the T1-83/84 and T1-89. Enter the x-values (miles driven) into LI/listl and the y-values (price) into 
L2Alist2. 


TI-83/84 with recent OS TI-89 
e Press|stat], then choose TESTS and e Press |2nd][F2]([F7]) and choose 
IbpAIRXS NIMC 5 vn 6 LinRegTIint. 
e Inthe LinRegTInt screen, adjust the inputs as shown. ¢ In the LinRegT'Int screen, adjust the inputs as shown 
Then highlight “Calculate” and press [ENTER |. and press [ENTER |. 
ea 
LinRegTInt PY hist: 
Xlist:Li j Frea | fo | 
cao } store RedEan te: wdexh> 
ewes 9 Interual: S1oRe+ 
ResEQ: : eee ae] 
Calculate € Leuel: as 
7 =a 
TYPE # CENTERISOK AND [ESCISCANCEL 
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e The linear regression t interval results are shown below. The TI-84 Plus C fits the results on one screen. The 
TI-83/84 and TI-89 require you to arrow down to see the rest of the output. 


Sarre a 
LinRegT Int 

y=atbx =Fe = 

(-.2173. -.1084) fie 

=-. 1628114837 =,QEKEZY 

df=14 =14, 

5=5737.55499 =EP4O13 

a=38254. 8639 =.080886 

r?=, 664549225 . =3B257.4 

r=~.8151988868 ig 

MAIN Fav AUTO FUN za 


Note that s is the standard deviation of the residuals, not the standard error of the slope. 


AP® EXAM TIP The formula for the t interval for the slope of a population (true) 
regression line often leads to calculation errors by students. As a result, we recommend 
using the calculator’s LinRegTInt feature to compute the confidence interval on the 


AP® Exam. Be sure to name the procedure (t interval for slope) and to give the interval 
(-0.217, —0.108) and df (14) as part of the “Do” step. 


CHECK YOUR UNDERSTANDING 


Does fidgeting keep you slim? Some people don’t gain weight even when they overeat. 
Perhaps fidgeting and other “nonexercise activity” (NEA) explain why—some people 
may spontaneously increase nonexercise activity when fed more. Researchers deliberately 
overfed a random sample of 16 healthy young adults for 8 weeks. They measured fat gain 
(in kilograms) as the response variable and change in energy use (in calories) from activity 
other than deliberate exercise—fidgeting, daily living, and the like—as the explanatory 
variable. Here are the data:* 


NEA change (cal): 94 57 29 135 143 151 245 355 
Fat gain (kg): 4.2 3.0 3.7 2.7 3.2 3.6 2.4 1.3 
NEA change (cal): 392 473 486 535 571 580 620 690 
Fat gain (kg): 3.8 1.7 1.6 2.2 1.0 0.4 2.3 1.1 


Minitab output from a least-squares regression analysis for these data is shown below. 


w 


N 


Residual 


Fat gain (kilograms) 


~ 


100 0 100 200 300 400 500 600 700 “t00 0 100 200 300 400 500 600 700 
NEA change (calories) NEA change (calories) 
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Regression Analysis: Fat gain versus NEA change 


Predictor Coef SE Coef T P 
Constant 3.5051 0.03036 11.54 0.000 
NEA change —0.0034415 0.0007414 —4.64 0.000 


S=0.739853 R-Sq=60.6% R-Sq(adj) =57.8% 


Construct and interpret a 95% confidence interval for the slope of the population (true) 
regression line. 


Performing a Significance Test for the Slope 


When the conditions for inference are met, we can use the slope b of the sample 
regression line to construct a confidence interval for the slope @ of the popula- 
tion (true) regression line. We can also perform a significance test to determine 
whether a specified value of ( is plausible. The null hypothesis has the general 
form Ho: = . To do a test, standardize b to get the test statistic: 


statistic — parameter 


test statistic = ree aaa 
standard deviation of statistic 


bab 
SE; 


To find the P-value, use a ¢ distribution with n — 2 degrees of freedom. Here are 
the details for the ¢ test for the slope. 


t TEST FOR THE SLOPE 


If sample data suggest a linear relationship between two variables, how can 
we determine whether this happened just by chance or whether there is actually 
a linear relationship between x and y in the population? By performing a test of 
Ho: = 0. A regression line with slope 0 is horizontal. ‘That is, the mean of y does 
not change at all when x changes. So Hp: 3 = 0 says that there is no linear rela- 
tionship between x and y in the population. Put another way, Ho says that linear 
regression of y on x is of no value for predicting y. 
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ee ahaa will only do a test with Regression output from statistical software usually gives t and its two-sided 
0: B =U, 


P-value for a test of Hp: G = 0. For a one-sided test in the proper direction, just di- 
vide the P-value in the output by 2. The following example shows what we mean. 


Crying and IQ "A 
Significance test for B LL 


Infants who cry easily may be more easily stimulated than others. This may be a 
sign of higher IQ. Child development researchers explored the relationship be- 
tween the crying of infants 4 to 10 days old and their later IQ test scores. A snap 
of a rubber band on the sole of the foot caused the infants to cry. The researchers 
recorded the crying and measured its intensity by the number of peaks in the most 
active 20 seconds. They later measured the children’s IQ at age three years using 
the Stanford-Binet IQ test. The table below contains data from a random sample 


Crycount 10 Crycount 10 Crycount IQ Crycount 10 

AP® EXAM TIP When you 10 87 20 90 17 94 12 94 
see a list of data values on 12 97 16 100 19 103 12 103 
an exam question, don't just 9 103 23 103 13 104 14 106 
ster Ding ihe ate ne YOur 16 106 27 108 18 109 10 109 
calculator. Read the question 

first. Often, additional 18 109 15 112 18 112 23 113 
information is provided that 15 (114 21 114 16 118 9 119 
makes it unnecessary for you 12 119 12 120 19 120 16 124 
to enter the data at all. This 20 132 15 133 22 135 31 135 
can save you valuable time 16 136 17 144 30 155 22 157 

® 
on the AP™ exam. 33 159 13 162 


Some computer output from a least-squares regression analysis on these data is 
shown below. 


Regression Analysis: IQ versus Crycount 


Predictor Coef SE Coef T P 
Constant 91.268 8.934 10.22 0.000 
Crycount 1.4929 0.4870 3.07 0.004 


S=17.50 R-Sq=20.7%  R-Sq(adj) =18.5% 


o io: 10 6 20 3 30 3 40 
Count of crying peaks 


Because there were no infants who 
recorded fewer than 9 crying peaks 
in their most active 20 seconds, it 
is a risky extrapolation to use this 
line to predict the value of y when 
x=0. 
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(a) What is the equation of the least-squares regression line for predicting IQ at age 3 from the 
number of crying peaks (crycount)? Interpret the slope and yintercept of the regression line in 
context. 


(b) Explain what the value of s means in this setting. 


(c) Do these data provide convincing evidence of a positive linear relationship between crying counts 
and IQ in the population of infants? 


SOLUTION: 
(a) The equation of the least-squares line is 


predicted IQ score = 91.268 + 1.4929 (crycount) 


Slope: For each additional crying peak in the most active 20 seconds, the regression line predicts an 
increase of about 1.5 IQ points. y intercept: The model predicts that an infant who doesn’t cry when 
flicked with a rubber band will have a later IQ score of about 91. 

(b) The size of a typical prediction error when using the regression line in part (a) is 17.50 IQ points. 


(c) We'll follow the four-step process. 
STATE: We want to perform a test of 


Hp: 3 =0 
H,:G>0 


where (3 is the slope of the population regression line relating crying count to IQ score. No signifi- 
cance level was given, so we'lluse a = 0.05. 


PLAN: Ifthe conditions are met, we will doa ttest for the slope (2. 
° Linear: The scatterplot suggests a moderately weak positive linear relationship between crying 
peaks and IQ. The residual plot shows a random scatter of points about the residual = O line. 


* Independent: Due to sampling without replacement, there have to be at least 10(38) = 380 
infants in the population from which these children were selected. 


* Normal: The Normal probability plot of the residuals shows slight curvature, but no strong 
skewness or obvious outliers that would prevent use of t procedures. 


* Equal SD: The residual plot shows a fairly equal amount of scatter around the horizontal line at O 
for all x-values. 


° Random: We are told that these 38 infants were randomly selected. 
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Our usual formula for the test DO: Wecanget the test statistic and P-value from the Minitab output. 

statistic confirms the value in the 

computer output: * Test statistic: t = 3.07 (lookin the “T” column of the computer output across from “Crycount”) 

a bBo _ 1.4929—-0 _ 307  ° Pralue: Figure 12.8 displays the P-value for this one-sided test as an area under the tdistribution curve 
SE» 0.4870 , with 38 — 2 = 36 degrees of freedom. The Minitab output gives P= 0.004 as the P-value for a 


two-sided test. The P-value for the one-sided testis half of this, P= 0.002. 


t distribution, Using technology: Refer to the Technology Corner that follows the example. The 
. ces of calculator’s LinRegTTest gives t = 3.065 and P-value = 0.002 using df = 36. 
reeaom 


CONCLUDE: Because the P-value, 0.002, is less than « = 0.05, we reject Ho. There 
is convincing evidence of a positive linear relationship between intensity of crying and IQ 
score in the population of infants. 


P -Valuc = 0.002 


<—Vvalues of t—> * 
t= 3.0F 


FIGURE 12.8 The P-value for the one-sided test. For Practice Try Exercise 


Based on the results of the crying and IQ study, should we ask doctors and 
parents to make infants cry more so that they'll be smarter later in life? Hardly. 
This observational study gives statistically significant evidence of a positive linear 
relationship between the two variables. However, we can’t conclude that more 
intense crying as an infant causes an increase in IQ. Maybe infants who cry more 
are more alert to begin with and tend to score higher on intelligence tests. 


SIGNIFICANCE TEST FOR SLOPE 


29-| CORNER” ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


Let’s use the data from the crying and IQ study to perform a significance test for the slope of the population regression 
line on the TI-83/84 and T1-89. Enter the x-values (crying count) into L1/listl and the y-values (IQ score) into L2/list2. 


TI-83/84 TI-89 
e Press[stat], then choose TEST'S and e Press |[2nd]/[F1|([F'6]) and choose 
LinRegTTest. .. . LinRegTTest. 
¢ Inthe LinRegTTest screen, adjust the inputs asshown. ¢ Inthe LinRegTTest screen, adjust the inputs as 
Then highlight “Calculate” and press [ENTER]. shown and press [ENTER | 
NORMAL FLOAT AUTO REAL RADIAN CL 


o 


LinResTTest = List: : 
a vit 
Ylist:L2 
ae <O M@ternate Hye: Ba POF 
RegEQ: Stove ReSEan ter vitx> 
Calculate 
1s< = 
USE © @NO > TO OPEN CHOICES 
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e The linear regression ¢ test results take two screens to present. We show only the first screen. 


NORMAL FLOAT AUTO REAL RADIAN CL fl 


LinResTTest 
y=atbx 
B>@ and 9>d 
t=3. 065489379 
P=, 0020526501 
df=36 
a=91. 26829865 
b=1.492896598 
+s=17.49872122 


AP® EXAM TIP The formula for the test statistic in a f test for the slope of a population (true) 
regression line often leads to calculation errors by students. As a result, we recommend using the 
calculators LinRegTTest feature to perform calculations on the AP® exam. Be sure to name the 
procedure (ttest for slope) and to report the test statistic (t = 3.065), P-value (0.002), and df (36) as 
part of the “Do” step. 


What’s with that p > 0 in the LinRegITest screen? The slope b 
THINK Bed ety: 
of the least-squares regression line is closely related to the correlation r between 
ABOUT IT 


8) 
the explanatory and response variables x and y. (Recall that b = re In the same 


way, the slope ( of the population regression line is closely related to the corre- 
lation p (the lowercase Greek letter rho) between x and y in the population. In 
particular, the slope is 0 when the correlation is 0. 

Testing the null hypothesis Ho: @ = 0 is, therefore, exactly the same as testing 
that there is no correlation between x and y in the population from which we drew 
our data. You can use the test for zero slope to test the hypothesis Hp: p = 0 of zero 
correlation between any two quantitative variables. That’s a useful trick. Because 
correlation also makes sense when there is no explanatory-response distinction, it 
is handy to be able to test correlation without doing regression. 


OO 


CHECK YOUR UNDERSTANDING 


The previous Check Your Understanding (page 752) described some results from a study 
of nonexercise activity (NEA) and fat gain. Here, again, is the Minitab output from a least- 
squares regression analysis for these data. 


Regression Analysis: Fat gain versus NEA change 
Predictor Coef SE Coef T P 
Constant 3.5051 0.3036 11.54 0.000 
NEA change —0.0034415 0.0007414 —4.64 0.000 
S=0.739853 R-Sq = 60.6% R-Sq(adj) =57.8% 


Do these data provide convincing evidence at the a = 0.05 significance level of a negative 
linear relationship between fat gain and NEA change in the population of healthy young 
adults? Assume that the conditions for regression inference are met. 
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Summary 


e Least-squares regression fits a straight line of the form = a + bx to data to 
predict a response variable y from an explanatory variable x. Inference in this 
setting uses the sample regression line to estimate or test a claim about the 
population (true) regression line. 
e The conditions for regression inference are 
e Linear: The actual relationship between x and y is linear. For any fixed 
value of x, the mean response /, falls on the population (true) regression 
lime eer a, 

e Independent: Individual observations are independent. When sam- 
pling is done without replacement, check the 10% condition. 

e¢ Normal: For any fixed value of x, the response y varies according to a 
Normal distribution. 

e Equal SD: The standard deviation of y (call it a) is the same for all 
values of x. 

e Random: The data are produced from a well-designed random sample 
or randomized experiment. 

e The slope b and intercept a of the sample regression line estimate the slope 
@ and intercept a of the population (true) regression line. Use the standard 
deviation of the residuals, s, to estimate o. 

¢ Confidence intervals and significance tests for the slope @ of the population 
regression line are based on a t distribution with n — 2 degrees of freedom. 


e The t interval for the slope ( has the form b + t*SE,, where the standard 


error of the slope is SE, = : 


s.Vn — 1 
e To test the null hypothesis Ho: = (, carry out a t test for the slope. This 
ae 
test uses the statistic t = = The most common null hypothesis is 


Hy: =0, which says that there is no linear relationship between x and y in 
the population. 


TECHNOLOGY 
CORNERS 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


28. Confidence interval for slope on the calculator 


29. Significance test for slope on the calculator 
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Exercises 


Oil and residuals Exercise 53 on page 194 
(Chapter 3) examined data on the depth of small 
defects in the Trans-Alaska Oil Pipeline. Research- 
ers compared the results of measurements on 100 
defects made in the field with measurements of the 
same defects made in the laboratory.° The figure 
below shows a residual plot for the least-squares 
regression line based on these data. Explain why the 
conditions for performing inference about the slope 
@ of the population regression line are not met. 


BOR] 
e 
oy AS 
e 
) 
5 
z . 
5 ORS » e 
Ss ® % e 
Z ‘cae oe ® ye e 
° ~~ es 
a=) OK, ak YZ 
2 e ee. oo e DF > *. 
= *. % © @ e 
‘ ‘0° ° e 
E -10 4 * 
7 a 
Pd 
-20 4 
-30 4 
T T T T T 
0 20 40 60 80 


Laboratory measurement 


SAT Math scores In Chapter 3, we examined data 
on the percent of high school graduates in each 
state who took the SAT and the state’s mean SAT 
Math score ina recent year. The figure below shows 
a residual plot for the least-squares regression line 
based on these data. Explain why the conditions for 
performing inference about the slope @ of the popu- 
lation regression line are not met. 
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Beer and BAC How well does the number of beers 
a person drinks predict his or her blood alcohol 
content (BAC)? Sixteen volunteers aged 21 or older 
with an initial BAC of 0 took part in a study to find 
out. Each volunteer drank a randomly assigned 
number of cans of beer. ‘Thirty minutes later, a police 
officer measured their BAC. Least-squares regression 
was performed on the data. A residual plot and a 
histogram of the residuals are shown below. Check 
whether the conditions for performing inference 
about the regression model are met. 


004 ‘ 
0.034 . 

|g omy ° 

\4 om; Ce “,. 

& 0.001 . 
0.01; ° . 
0.02} : . : 
———————————— 


-0.03 -0.02 -0.01 0.00 0.01 0.02 0.03 0.04 
Residual 


Prey attracts predators Here is one way in which 
nature regulates the size of animal populations: high 
population density attracts predators, which remove 
a higher proportion of the population than when 

the density of the prey is low. One study looked at 
kelp perch and their common predator, the kelp 
bass. The researcher set up four large circular pens 
on sandy ocean bottoms off the coast of southern 
California. He chose young perch at random from a 
large group and placed 10, 20, 40, and 60 perch in 
the four pens. Then he dropped the nets protecting 
the pens, allowing bass to swarm in, and counted the 
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perch left after two hours. Here are data on the 
proportions of perch eaten in four repetitions of this 


setup: 
Number of Perch Proportion Killed 
10 0.0 0.1 0.3 0.3 
20 0.2 0.3 Os 0.6 
40 0.075 0.3 0.6 0.725 
60 0.517 0'55 ONG 0.817 


The explanatory variable is the number of perch (the 
prey) in a confined area. The response variable is the 
proportion of perch killed by bass (the predator) in two 
hours when the bass are allowed access to the perch. A 
scatterplot of the data shows a linear relationship. 


Predictor 
Constant 
Perch 


> (0). An 


7. 


We used Minitab software to carry out a least-squares Pole (a) 


regression analysis for these data. A residual plot and 
a histogram of the residuals are shown below. Check 
whether the conditions for performing inference 
about the regression model are met. 


Residual 


10 20 30 40 50 60 
Number of perch 


-0.4 -03 -02 -01 00 O1 O2 O03 
Residual 


5. Beer and BAC Refer to Exercise 3. Computer out- 
put from the least-squares regression analysis on the 
beer and blood alcohol data is shown below. 


Dependent variable is: BAC 


No Selector 
R squared = 80.0% R squared (adjusted) = 78.6% 


s= 0.0204 with 16-2=14 degrees of freedom 


Variable Coefficient s.e. of Coeff t-ratio prob 
Constant. —0.012701 0.0126 — ail; (00) Om 33210 
Beexs 0.017964 0.0024 7.84 =0F 000m: 


(b) 
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The model for regression inference has three parameters: 
a, 3, and o. Explain what each parameter represents in 
context. ‘l'hen provide an estimate for each. 

Prey attracts predators Refer to Exercise +. 
Computer output from the least-squares regression 
analysis on the perch data is shown below. 

Coef 
0.12049 
0.008569 


Stdev. 
0.09269 
0.002456 


t-ratio p 
IL 5 230) 
3.49 


R-Sq = 46.5% R-Sq(adj) = 42.7% 


The model for regression inference has three param- 
eters: a, 3, and o. Explain what each parameter repre- 
sents in context. Then provide an estimate for each. 


Beer and BAC Refer to Exercise 5. 


Give the standard error of the slope, SE;. Interpret 
this value in context. 


Find the critical value for a 99% confidence interval 
for the slope of the true regression line. Then calcu- 
late the confidence interval. Show your work. 


Interpret the interval from part (b) in context. 
Explain the meaning of “99% confident” in context. 
Prey attracts predators Refer to Exercise 6. 


Give the standard error of the slope, SE». Interpret 
this value in context. 


Find the critical value for a 90% confidence interval 
for the slope of the true regression line. Then calcu- 
late the confidence interval. Show your work. 


Interpret the interval from part (b) in context. 
Explain the meaning of “90% confident” in context. 


Beavers and beetles Do beavers benefit beetles? 
Researchers laid out 23 circular plots, each 4 meters 
in diameter, at random in an area where beavers 
were cutting down cottonwood trees. In each plot, 
they counted the number of stumps from trees cut by 
beavers and the number of clusters of beetle larvae. 
Ecologists think that the new sprouts from stumps 
are more tender than other cottonwood growth, so 
that beetles prefer them. If so, more stumps should 
produce more beetle larvae.® 


Minitab output for a regression analysis on these data 
is shown below. Construct and interpret a 99% confi- 
dence interval for the slope of the population regres- 

sion line. Assume that the conditions for performing 

inference are met. 


Regression Analysis: Beetle larvae versus Stumps 


Predictor Coef SEECock son Pp 
Constant —1.286 Pd {i153} —-0.45 ORG Si 
Stumps 11.894 1,136 LOA VO OieG 
S=6.41939 R-Sq= 83.9% R-Sq(adj) = 83.1% 


10. 


Section 12.1 Inference for Linear Regression 


Ideal proportions ‘The students in Mr. Shenk’s class 
measured the arm spans and heights (in inches) of a 
random sample of 18 students from their large high 
school. Some computer output from a least-squares 
regression analysis on these data is shown below. Con- 
struct and interpret a 90% confidence interval for the 
slope of the population regression line. Assume that 
the conditions for performing inference are met. 


Predictor Coef Stdev t-ratio p 
Constant 11.547 5.600 2.06 0-056 
Armspan 0.84042 0.08091 10.39 0.000 
S— 16a s)Resq— ey) 1s Roca (ad)) —se oe 

11. Beavers and beetles Refer to Exercise 9. 

(a) How many clusters of beetle larvae would you 
predict in a circular plot with 5 tree stumps cut by 
beavers? Show your work. 

(b) About how far off do you expect the prediction in 
part (a) to be from the actual number of clusters of 
beetle larvae? Justify your answer. 

12. Ideal proportions Refer to Exercise 10. 

(a) What height would you predict for a student with an 
arm span of 76 inches? Show your work. 

(b) About how far off do you expect the prediction in 


part (a) to be from the student’s actual height? Justify 
your answer. 


Weeds among the com Lamb’s-quarter is a com- 
mon weed that interferes with the growth of corn. 
An agriculture researcher planted corn at the same 
rate in 16 small plots of ground and then weeded 
the plots by hand to allow a fixed number of lamb’s- 
quarter plants to grow in each meter of corn row. 
The decision of how many of these plants to leave in 
each plot was made at random. No other weeds were 
allowed to grow. Here are the yields of corn (bushels 
per acre) in each of the plots: 


Some computer output from a least-squares regres- 
sion analysis on these data is shown below. 


Predictor Coef SH Cocke. Pp 
Constant 166.483 Pa tied oi le 0.000 
Weeds per -—1.0987 0.5712 =1.92 0.075 
meter 


S=7.97665 R-Sq=20.9% R-Sq(adj) =15.3% 


(a) What is the equation of the least-squares regression 
line for predicting corn yield from the number 
of lamb’s quarter plants per meter? Interpret the 
slope and y intercept of the regression line in 
context. 


(b) Explain what the value of s means in this settting. 


= 
le) 
WK 


Do these data provide convincing evidence 

at the a = 0.05 level that more weeds reduce corn 
yield? Assume that the conditions for performing 
inference are met. 


14. Time at the table Does how long young children 
remain at the lunch table help predict how much 
they eat? Here are data on a random sample of 
20 toddlers observed over several months.!° “Time” 
is the average number of minutes a child spent at the 
table when lunch was served. “Calories” is the 
average number of calories the child consumed 
during lunch, calculated from careful observation of 
what the child ate each day. 


Some computer output from a least-squares regres- 
sion analysis on these data is shown below. 


Time 
Predictor Coef Sis} (Cleysue Ur P 
Constant 560.65 29,37 19°09 0.000 
Time —SeOn mele 0.8498 Oe 0.002 


SS95.5900 RS =SAA.1h ReSo(eck]) = 20.98 


(a) What is the equation of the least-squares regression 
line for predicting calories consumed from time at 
the table? Interpret the slope of the regression line in 
context. Does it make sense to interpret the y inter- 
cept in this case? Why or why not? 


Ss 


Explain what the value of s means in this setting. 


= 
le) 
WK 


Do these data provide convincing evidence at the 
a = 0.01 level of a linear relationship between time 
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at the table and calories consumed in the population 
of toddlers? Assume that the conditions for perform- 
ing inference are met. 


15. Is wine good for your heart? A researcher from the 
University of California, San Diego, collected data 
on average per capita wine consumption and heart 
disease death rate in a random sample of 19 coun- 
tries for which data were available. The following 
table displays the data.!! 


Alcohol from Heart disease Alcohol from Heart disease 


wine death rate wine death rate 

(liters/year) (per 100,000) (liters/year) (per 100,000) 
De 211 7.9 107 
3.9 167 1.8 167 
2.9 131 1.9 266 
2.4 191 0.8 227 
2.9 220 6.5 86 
0.8 297 1.6 207 
9.1 71 5.8 ie 
Dall We 1¢ 285 
0.8 211 12 199 
0.7 300 


Is there statistically significant evidence of a negative 
linear relationship between wine consumption and 
heart disease deaths in the population of countries? 
Carry out an appropriate significance test at the 


a = 0.05 level. 


16. The professor swims Here are data on the time (in 
minutes) Professor Moore takes to swim 2000 yards 
and his pulse rate (beats per minute) after swimming 
on a random sample of 23 days: 


Time: 34.12 35.72 34.72 34.05 3413 35.72 
Pulse: lo2 124 140 152 146 128 
Time: 36.17 35.57 35.37 35.57 = 35.43 36.05 
Pulse: 136 144 148 144 136 124 
Time: 34.85 34.70 34.75 33.93 34.60 34.00 
Pulse: 148 144 140 156 136 148 
Time: 34.35 35.62 35.68 35.28 35.97 


Pulse: 148 132 124 132 139 


Is there statistically significant evidence of a negative 
linear relationship between Professor Moore’s swim 
time and his pulse rate in the population of days on 
which he swims 2000 yards? Carry out an appropri- 
ate significance test at the a = 0.05 level. 


17. Stats teachers’ cars A random sample of AP® Sta- 
tistics teachers was asked to report the age (in years) 
and mileage of their primary vehicles. A scatterplot 
of the data is shown at top right. 


Variable Coef 
Constant 
Car age 


Si g280 


(a) 


18. 
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120,000 4 
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Car age 


Computer output from a least-squares regression 
analysis of these data is shown below (df = 19). 
Assume that the conditions for regression inference 
are met. 


SE Coef t-ratio prob 
0.2826 
<0.0001 


7288.54 6591 
11630.6 1249 
R-Sq = 82.0% 


aeeesiralt 
S) 4 aha 
RSq(adj) = 81.1% 


Verify that the 95% confidence interval for the 
slope of the population regression line is (9016.4, 
14,244,8). 


A national automotive group claims that the typical 
driver puts 15,000 miles per year on his or her main 
vehicle. We want to test whether AP® Statistics teach- 
ers are typical drivers. Explain why an appropriate 
pair of hypotheses for this test is Ho: 8 = 15,000 
versus H,: 3 # 15,000. 


Compute the test statistic and P-value for the test in 
part (b). What conclusion would you draw at the 
a = 0).05 significance level? 


Does the confidence interval in part (a) lead to the 
same conclusion as the test in part (c)? Explain. 


Paired tires Exercise 71 in Chapter 8 (page 529) 
compared two methods for estimating tire wear. The 
first method used the amount of weight lost by a tire. 
The second method used the amount of wear in the 
grooves of the tire. A random sample of 16 tires was 
obtained. Both methods were used to estimate the 
total distance traveled by each tire. The following 
scatterplot displays the two estimates (in thousands of 
miles) for each tire.! 
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10 20 30 40 50 
Weight 


Computer output from a least-squares regression 
analysis of these data is shown below. Assume that 
the conditions for regression inference are met. 


Predictor Coef SE Coeff T P 
Constant ab, Shsjall >, AMOS) 0.64 (0) , 53a. 
Weight One SIO 2st 0.07104 akaby al 0.000 


S=2.62078 R-Sq= 89.8% R-Sq(adj) = 89.1% 


(a) Verify that the 99% confidence interval for the slope 
of the population regression line is (0.5787, 1.0017). 


(b) Researchers want to test whether there is a difference 
in the two methods of estimating tire wear. Explain 
why the researchers might think that an appropriate 
pair of hypotheses for this test is Ho: 3 = 1 versus 
H,:6# 1. 


(c) Compute the test statistic and P-value for the test in 
part (b). What conclusion would you draw at the 
a = 0.01 significance level? 


(d) Does the confidence interval in part (a) lead to the 
same conclusion as the test in part (c)? Explain. 


Multiple choice: Select the best answer for Exercises 19 
to 24, which are based on the following information. 
To determine property taxes, Florida reappraises real 
estate every year, and the county appraiser’s Web site lists 
the current “fair market value” of each piece of property. 
Property usually sells for somewhat more than the ap- 
praised market value. We collected data on the appraised 
market values x and actual selling prices y (in thousands 
of dollars) of a random sample of 16 condominium units 
in Florida. We checked that the conditions for inference 
about the slope of the population regression line are met. 
Here is part of the Minitab output from a least-squares 
regression analysis using these data.!? 


Predictor Coef SE Goch aaar Pp 
Constant LAY Ay 79.49 iL, 0) 0) 54132 
Appraisal 1.0466 0.1126 O29 0.000 


S=69.7299 R-Sq= 86.1% R-Sq(adj) = 85.1% 


19. ‘The equation of the least-squares regression line for 
predicting selling price from appraised value is 
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pricé = 79.49 + 0.1126 (appraised value). 
pricé = 0.1126 + 1.0466 (appraised value). 
price = 127.27 + 1.0466 (appraised value). 
price = 1.0466 + 127.27 (appraised value). 
price = 1.0466 + 69.7299 (appraised value). 


The slope £ of the population regression line describes 


the exact increase in the selling price of an individu- 
al unit when its appraised value increases by $1000. 


the average increase in the appraised value in a popu- 
lation of units when selling price increases by $1000. 


the average increase in selling price in a population 
of units when appraised value increases by $1000. 


the average increase in the appraised value in the 
sample of units when selling price increases by $1000. 


the average increase in selling price in the sample of 
units when the appraised value increases by $1000. 


Is there convincing evidence that selling price 
increases as appraised value increases? ‘To answer 
this question, test the hypotheses 


Ho: 8 = 0 versus H,: 3 > 0. 
Ho: 8 = 0 versus H,: 3 < 0. 
Ho: 2 = 0 versus H,:G # 0. 
Ho: > 0 versus H,: 3 = 0. 
Ho: 6 = 1 versus H,:6> 1. 


. Which of the following is the best interpretation for 


the value 0.1126 in the computer output? 


For each increase of $1000 in appraised value, the 
average selling price increases by about 0.1126. 


When using this model to predict selling price, the 
predictions will typically be off by about 0.1126. 


11.26% of the variation in selling price is accounted 
for by the linear relationship between selling price 
and appraised value. 


There is a weak, positive linear relationship between 
selling price and appraised value. 


In repeated samples of size 16, the sample slope will 
typically vary from the population slope by about 
O26: 


A 95% confidence interval for the population slope 3 is 
1.0466 + 1.046. (d) 1.0466 + 0.2207. 
1.0466 + 0.2415. (ce) 1.0466 + 0.2400. 
L0406 2 0:2387. 
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24. Which of the following would have resulted in a 
violation of the conditions for inference? 


(a) Ifthe entire sample was selected from one neighborhood 
(b) Ifthe sample size was cut in half 


(c) Ifthe scatterplot of x = appraised value and y = selling 
price did not show a perfect linear relationship 


(d) Ifthe histogram of selling prices had an outlier 


(e) If the standard deviation of appraised values was dif- 
ferent from the standard deviation of selling prices 


Exercises 25 to 28 refer to the following setting. Does the 
color in which words are printed affect your ability to read 
them? Do the words themselves affect your ability to name 
the color in which they are printed? Mr. Starnes designed a 
study to investigate these questions using the 16 students in 
his AP® Statistics class as subjects. Each student performed 
two tasks in a random order while a partner timed: (1) read 
32 words aloud as quickly as possible, and (2) say the color 
in which each of 32 words is printed as quickly as possible. 
‘Try both tasks for yourself using the word list below. 


YELLOW RED BLUE GREEN 
RED GREEN YELLOW YELLOW 
GREEN RED BLUE BLUE 
YELLOW BLUE GREEN RED 
BLUE YELLOW RED RED 
RED BLUE YELLOW GREEN 
BLUE GREEN GREEN BLUE 
GREEN YELLOW RED YELLOW 


25. Color words (4.2) Let’s review the design of the study. 


€ (a) Explain why this was an experiment and not an 


observational study. 


(b) Did Mr. Starnes use a completely randomized design 
or a randomized block design? Why do you think he 
chose this experimental design? 


(c) Explain the purpose of the random assignment in 
the context of the study. 


The data from Mr. Starnes’s experiment are shown below. 
For each subject, the time to perform the two tasks is 
given to the nearest second. 


Subject Words Colors Subject Words Colors 
1 18 20 9 10 16 
2 10 21 10 9 13 
3 15 22 11 11 11 
4 2 2) 12 17 26 
5 13 17 13 15 20 
6 11 13 14 15 15 
i 14 e2 15 12 18 
8 16 21 16 10 18 


26. Color words (1.3) Do the data provide evidence of 
*) adifference in the average time required to perform 
© the two tasks? Include an appropriate graph and 


numerical summaries in your answer. 


27. Color words (9.3) Explain why it is not safe to use 
paired t procedures to do inference about the difter- 
ence in the mean time to complete the two tasks. 


28. Color words (3.1, 3.2, 12.1) Can we use a stu- 
=> dent's word task time to predict his or her color 
© task time? 


(a) Make an appropriate scatterplot to help answer this 
question. Describe what you see. 


(b) Use your calculator to find the equation of the least- 
squares regression line. Define any symbols 
you use. 


(c) Find and interpret the residual for the student who 
completed the word task in 9 seconds. 


(d) Assume that the conditions for performing inference 
about the slope of the true regression line are met. 
The P-value for a test of Hy: 6 = 0 versus H,: 6 > 0 
is 0.0215. Explain what this value means in context. 


Note: John Ridley Stroop is often credited with the 
discovery in 1935 of the fact that the color in which 
“color words” are printed interferes with people’s 
ability to identify the color. The so-called Stroop 
Effect, though, was originally published by German 
researchers in 1929. 


Exercises 29 and 30 refer to the following setting. 
Yellowstone National Park surveyed a random sample of 
1526 winter visitors to the park. They asked each person 
whether he or she owned, rented, or had never used a 
snowmobile. Respondents were also asked whether they 
belonged to an environmental organization (like the Sierra 
Club). The two-way table summarizes the survey responses. 


Environmental Clubs 


No Yes Total 
Never used 445 Ne 657 
Snowmobile renter 497 Ui 574 
Snowmobile owner 279 16 295 
Total 1221 305 1526 


29. Snowmobiles (5.2, 5.3) 


€ (a) If we choose a survey respondent at random, what’s 


the probability that this individual 


(i) isa snowmobile owner? 


(ii) belongs to an environmental organization or 
owns a snowmobile? 


(iii) has never used a snowmobile given that the 
person belongs to an environmental organization? 
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(b) Are the events “is a snowmobile owner” and “belongs 30. Snowmobiles (11.2) Do these data provide con- 
to an environmental organization” independent for vincing evidence at the 5% significance level of an 
the members of the sample? Justify your answer. ~ association between environmental club member- 


ship and snowmobile use for the population of 
visitors to Yellowstone National Park? Justify your 
answer. 


(c) If we choose two survey respondents at random, 
what’s the probability that 


(i) both are snowmobile owners? 


(ii) at least one of the two belongs to an environ- 
mental organization? 


Transforming to Achieve 
Linearity 


WHAT YOU WILL LEARN __ By the end of the section, you should be able to: 


e Use transformations involving powers and roots to find the relationship between two variables, and use the 
a power model that describes the relationship between model to make predictions. 
two variables, and use the model to make predictions. ° Determine which of several transformations does a 
Use transformations involving logarithms to find a better job of producing a linear relationship. 
power model or an exponential model that describes 


In Chapter 3, we learned how to analyze relationships between two quantitative 
variables that showed a linear pattern. When two-variable data show a curved 
relationship, we must develop new techniques for finding an appropriate model. 
This section describes several simple transformations of data that can straighten 
a nonlinear pattern. Once the data have been transformed to achieve linearity, 
we can use least-squares regression to generate a useful model for making pre- 
dictions. And if the conditions for regression inference are met, we can estimate 
or test a claim about the slope of the population (true) regression line using the 
transformed data. 


Health and Wealth 


Straightening out a curved pattern 


The Gapminder Web site, www.gapminder.org, provides loads of data on the 
health and well-being of the world’s inhabitants. Figure 12.9 on the next page is a 
scatterplot of data from Gapminder.'* The individuals are all the world’s nations 
for which data are available. The explanatory variable is a measure of how rich a 
country is: income per person. The response variable is life expectancy at birth. 
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We expect people in richer countries to live longer because they have better ac- 
cess to medical care and typically lead healthier lives. The overall pattern of the 
scatterplot does show this, but the relationship is not linear. Life expectancy rises 
very quickly as income per person increases and then levels off. People in very rich 
countries such as the United States live no longer than people in poorer but not 
extremely poor nations. In some less wealthy countries, people live longer than in 
the United States. 


Four African nations are outliers. Their life expectancies are similar to those of 
their neighbors, but their income per person is higher. Gabon and Equatorial 
Guinea produce oil, and South Africa and Botswana produce diamonds. It may 
be that income from mineral exports goes mainly to a few people and so pulls up 
income per person without much effect on either the income or the life expec- 
tancy of ordinary citizens. That is, income per person is a mean, and we know that 
mean income can be much higher than median income. 


Life expectancy 


80 4 


70 4 


60 


50 4 


eo “es 8 & 
@ 2 %.e ° . ; 


| | East Asia & Pacific 


ia Europe & Central Asia 


= Gabon | | Middle East & North Africa 
( ) L] South Asia 
P| South Africa 
ea Sub-Saharan Africa 
° <—— Equatorial Guinea 
° <——_ Botswana 


Income per person in 2012 


FIGURE 12.9 Scatterplot of the life expectancy of people in many nations against each nation’s 
income per person. The color of each circle indicates the geographic region in which that country 
is located. The size of each circle is based on the population of the country—bigger circles 
indicate larger populations. 


The scatterplot in Figure 12.9 shows a curved pattern. We can straighten things 
out using logarithms. Figure 12.10 (on the facing page) plots the logarithm of 
income per person against life expectancy for these same countries. The effect is 
almost magical. This graph has a clear, linear pattern. 
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FIGURE 12.10 Scatterplot of life expectancy against income per person (on a logarithm scale) for 
many nations. 


Applying a function such as the logarithm or square root to a quantitative vari- 
able is called transforming the data. We will see in this section that understanding 
how simple functions work helps us choose and use transformations to straighten 
nonlinear patterns. 

Transforming data amounts to changing the scale of measurement that was 
used when the data were collected. We can choose to measure temperature in de- 
grees Fahrenheit or in degrees Celsius, distance in miles or in kilometers. ‘These 
changes of units are linear transformations, discussed in Chapter 2. 

Linear transformations cannot straighten a curved relationship between two 
variables. To do that, we resort to functions that are not linear. The logarithm 
function, applied in the “Health and Wealth” example, is a nonlinear function. 
We'll return to transformations involving logarithms later. 


Transforming with Powers and Roots 


When you visit a pizza parlor, you order a pizza by its diameter—say, 10 inches, 
12 inches, or 14 inches. But the amount you get to eat depends on the area of 
the pizza. The area of a circle is 7 times the square of its radius r. So the area of 
a round pizza with diameter x is 


ee (:) - (5) = 
area Tr T 5 T 4 4” 


This is a power model of the form y = ax? with a = m/4 and p = 2. 
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When we are dealing with things of the same general form, whether circles or 
fish or people, we expect area to go up with the square of a dimension such as 
diameter or height. Volume should go up with the cube of a linear dimension. 
That is, geometry tells us to expect power models in some settings. There are other 
physical relationships between two variables that are described by power models. 
Here are some examples from science. 


e The distance that an object dropped from a given height falls is related to time 
since release by the model 


distance = a(time)? 


e The time it takes a pendulum to complete one back-and-forth swing (its 
period) is related to its length by the model 


period = aV length = a(length)!”” 


e The intensity of a light bulb is related to distance from the bulb by the model 


: ‘ d : = 
intensity = designee” = a(distance) : 


Although a power model of the form y = ax? describes the relationship be- 
tween x and y in each of these settings, there is a linear relationship between x? and 
y. If we transform the values of the explanatory variable x by raising them to the p 
power, and graph the points (x, y), the scatterplot should have a linear form. The 
following example shows what we mean. 


Go Fish! 


Transforming with powers 


Imagine that you have been put in charge of organizing a fishing tournament in 
which prizes will be given for the heaviest Atlantic Ocean rockfish caught. You 
know that many of the fish caught during the tournament will be measured and 
released. You are also aware that using delicate scales to try to weigh a fish that is 
flopping around in a moving boat will probably not yield very accurate results. It 
would be much easier to measure the length of the fish while on the boat. What 
you need is a way to convert the length of the fish to its weight. 


You contact the nearby marine research laboratory, and they provide reference 
data on the length (in centimeters) and weight (in grams) for Atlantic Ocean rock- 
fish of several sizes.” 


Length: 5.2 : 143 168 192 213 23.3 
Weight: 2 8 21 38 69 117. 148 = 190) 264) 298 


Length: 82.0 33.0 340 349 36.4 
Weight: 318 «6371 455 504 518) 587) D1 719) 726 ~—S 810 


Weight (g) 
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Figure 12.11] is a scatterplot of the data. Note the clear curved shape. 


Because length is one-dimensional and weight (like volume) is three-dimensional, 
a power model of the form weight = a (length)’ should describe the relationship. 
What happens if we cube the lengths in the data table and then graph weight ver- 
sus length’? Figure 12.12 gives us the answer. This transformation of the explana- 
tory variable helps us produce a graph that is quite linear. 


800 4 < 
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400 4 
300 5 Ye 
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100 5 ¢ 
04 °° 
T T T T T T 
0 10,000 20,000 30,000 40,000 ~—_50,000 


Length (cm) Length* 


Weight (g) 


FIGURE 12.11 Scatterplot of Atlantic Ocean rockfish weight FIGURE 12.12 The scatterplot of weight versus length’ is 


versus length. 


FIGURE 12.13 The scatterplot of 
‘weight versus length is linear. 


linear. 


There’s another way to transform the data in the example to achieve linearity. 


We can take the cube root of the weight values and graph Wweight versus length. 
Figure 12.13 shows that the resulting scatterplot has a linear form. Why does this 
transformation work? Start with weight = a(length)’ and take the cube root of 
both sides of the equation: 


Vv weight = Wa(length)’ 
Vv weight = Va(length) 
That is, there is a linear relationship between length and Wweight. 
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; Weight 
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Length (cm) 


Once we straighten out the curved pattern in the original scatterplot, we fit a 
least-squares line to the transformed data. This linear model can be used to pre- 
dict values of the response variable y. As in Chapter 3, a residual plot tells us if the 
linear model is appropriate. The values of s and r’ tell us how well the regression 
line fits the data. 
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Go Fish! 


Transforming with Powers and Roots 


Here is Minitab output from separate regression analyses of the two sets of trans- 
formed Atlantic Ocean rockfish data. 


Transformation 1: (length’, weight) 
Predictor Coef SE Coef T P 


Constant 4.066 6.902 0.59 0.563 
Length*3 0.0146774 0.0002404 61.07 0.000 


S=18.8412 R-Sq= 99.5% R-Sq(adj) =99.5% 


Transformation 2: (length, “/weight) 


Predictor Coef SE Coef T Pp 
Constant —-0.02204 0.07762 =0:28 0.780 


Length 0.246616 0.002868 86.00 0.000 
S=0.124161 R-Sq=99.8%  R-Sq(adj) =99.7% 


Residual 


PROBLEM: Doeach of the following for both transformations. 
(a) Give the equation of the least-squares regression line. Define any variables you use. 


(b) Suppose a contestant in the fishing tournament catches an Atlantic Ocean rockfish that’s 36 
centimeters long. Use the model from part (a) to predict the fish’s weight. Show your work. 


SOLUTION: 
(a) Transformation 1; weight = 4.066 + 0.0146774 (length*) 


a 
Transformation 2: \/weight = — 0.02204 + 0.246616 (length) 


(b) Transformation 1: weight = 4.066 + 0.0146774(36°) = 688.9 grams 
a 
Transformation 2: ‘/weight = — 0.02204 + 0.246616(36) = 8.856 
weight = 8.856° = 694.6 grams 


For Practice Try Exercise 


Health and weaith of nations 
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When experience or theory suggests that the relationship between two variables 
is described by a power model of the form y = ax’, you now have two strategies for 
transforming the data to achieve linearity. 


1. Raise the values of the explanatory variable x to the p power and plot the 
points (x?, y). 


2. Take the pth root of the values of the response variable y and plot the points 
(x, Wy) 


What if you have no idea what power to choose? You could guess and test until 
you find a transformation that works. Some technology comes with built-in sliders 
that allow you to dynamically adjust the power and watch the scatterplot change 
shape as you do. 

It turns out that there isa much more efficient method for linearizing a curved 
pattern in a scatterplot. Instead of transforming with powers and roots, we use 
logarithms. This more general method works when the data follow an unknown 
power model or any of several other common mathematical models. 


Transforming with Logarithms 


Not all curved relationships are described by power models. For in- 


stance, in the “Health and Wealth” example (page 765), a graph of 
life expectancy versus the logarithm (base 10) of income per person 
showed a linear pattern. We used Fathom software to fit a least-squares 
regression line to the transformed data and to make a residual plot. 
Figure 12.14 shows the results. 

The regression line is 


log_income 


20 ° 
20° 22, 
ere So tagthe fat d RiPRGERE bE 4 8 as 
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log_income 
—Life_expectancy = 19.5 + 13.2iog_income: r* = 0.61 
FIGURE 12.14 Scatterplot with least-squares 
line added and residual plot from Fathom for the 
transformed data about the health and wealth of 
nations. 


predicted life expectancy = 19.5 + 13.2 log(income) 


How well does this model fit the data? The residual plot shows a random 
scatter of prediction errors about the residual = 0 line. Also, because 
? = 0.61, about 61% of the variation in life expectancy is accounted 
for by the linear model using log(income) as the explanatory variable. 
The relationship between life expectancy and income per person is 


described by a logarithmic model of the form y = a + b logx. We can use 
this model to predict how long a country’s citizens will live from how much money 
they make. For the United States, which has income per person of $42,296.20, 


predicted life expectancy = 19.5 + 13.2 log(42,296.20) = 80.567 years 


The actual U.S. life expectancy in 2012 was 78.80 years. 

Taking the logarithm of the income per person values straightened out the 
curved pattern in the original scatterplot. The logarithm transformation can also 
help achieve linearity when the relationship between two variables is described by 
a power model or an exponential model. 


Power Models Biologists have found that many characteristics of living 
things are described quite closely by power models. There are more mice than 
elephants, and more flies than mice—the abundance of species follows a power 
model with body weight as the explanatory variable. So do pulse rate, length of 
life, the number of eggs a bird lays, and so on. 
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Sometimes the powers can be predicted from geometry, but sometimes they 
are mysterious. Why, for example, does the rate at which animals use energy go 
up as the 3/4 power of their body weight? Biologists call this relationship Kleiber’s 
law. It has been found to work all the way from bacteria to whales. The search goes 
on for some physical or geometrical explanation for why life follows power laws. 

To achieve linearity from a power model, we apply the logarithm transforma- 
tion to both variables. Here are the details: 


1. A power model has the form y = ax?, where a and p are constants. 
2. ‘Take the logarithm of both sides of this equation. Using properties of log- 
arithms, we get 
log y = log(ax?) = log a + log(x’) = log a + p log x 
The equation log y = log a + p log x shows that taking the logarithm of both 
variables results in a linear relationship between log x and log y. 
3. Look carefully: the power p in the power model becomes the slope of the 


straight line that links log y to log x. 


If a power model describes the relationship between two variables, a scatterplot 
of the logarithms of both variables should produce a linear pattern. Then we can 
fit a least-squares regression line to the transformed data and use the linear model 
to make predictions. Here’s an example. 


Go Fish! 


Transforming with logarithms 


Let’s return to the fishing tournament from the previous 
800 . example. Our goal remains the same: to find a model for 
ee .— predicting the weight of an Atlantic Ocean rockfish from 
B sag oe its length. We still expect a power model of the form 
= 400 ms weight = a(length)’ based on geometry. Here once again is 
= as a scatterplot of the data from the local marine research lab. 
100 a 
0 Awe oe Earlier, we transformed the data in two ways to try to 
achieve linearity: (1) cubing the length values and (2) tak- 
8 16 24 32 40 “me ; 
ing the cube root of the weight values. This time we’ll use 
Length (cm) : 
logarithms. 


We took the logarithm (base 10) of the values for both variables. Some computer 
output from a linear regression analysis on the transformed data is shown below. 


3.0 
Pa 
i” 0.0504 
25) 
= 2.0 0.0254 
2 = 
) EH 
215 = 
= 0.000 
3 z° 
1} 
0.025 4 
0.5 
0.0 -0.050+ 
0.6 0.8 0 12) 14 1.6 0.6 0.8 0 12) 1.4 1.6 
log(Length) log(Length) 
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Regression Analysis: log(Weight) versus log(Length) 


Predictor Coef SE Coef T Pp 
Constant —1.89940 0.03799 —49.99 0.000 
log (Length) 3.04942 0.02764 PTO. 3 0.000 


S=0.0281823 R-Sq= 99.9% R-Sq(adj) =99.8% 


PROBLEM: 


(a) Based on the output, explain why it would be reasonable to use a power model to describe the 
relationship between weight and length for Atlantic Ocean rockfish. 


(b) Give the equation of the least-squares regression line. Be sure to define any variables you use. 


SOLUTION: 


(a) Ifa power model describes the relationship between two variables xand y, then a linear model 
should describe the relationship between log xand log y. The scatterplot of log(weight) versus 
log(length) has a linear form, and the residual plot shows a fairly random scatter of points about the 
residual = O line. So a power model seems reasonable here. 


Sa 
(b) log(weight) = —1.89940 + 3.04942 log(length) 


For Practice Try Exercise 


If we fit a least-squares regression line to the transformed data, we can find the 


On the T1-83/84, you can “undo” the predicted value of the logarithm of y for any value of the explanatory variable x by 


logarithm using the [2nq] function substituting our x-value into the equation of the line. To obtain the corresponding 
keys. To solve log y = 2, press [2nd prediction for the response variable y, we have to “undo” the logarithm transfor- 
LoG][2|[ENTER | To solveIny=2, mation to return to the original units of measurement. One way of doing this is to 
press | 2nd]/tn][2|[ENTER |, use the definition of a logarithm as an exponent: 


log,a=x=> b* =a 


For instance, if we have log y = 2, then 


logy =2=> logy y=2>10?=y=> 100=y 


If instead we have In y = 2, then 


Iny=2=> log, y=2> e =y => 7.389 =y 


Go Fish! 


Making predictions 


PROBLEM: Suppose a contestant in the fishing tournament catches an Atlantic Ocean rockfish 
that’s 36 centimeters long. Use the model from part (b) of the previous example to predict the fish's 
weight. Show your work. 


SOLUTION: Fora length of 36 centimeters, we have 
log(weight) = —1.89940 + 3.04942 log(36) = 2.8464 
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To find the predicted weight, we use the definition of a logarithm as an exponent: 


SS 
log (weight) = 2.6464 
weight = 107546 = 702.1 


This model predicts that a 36-centimeter-long rockfish will weigh about 702 grams. 


For Practice Try Exercise 


Your calculator and most statistical software will calculate the logarithms of all 
the values of a variable with a single command. The important thing to remember 
is this: if the relationship between two variables is described by a power model, 
then we can linearize the relationship by taking the logarithm of both the explana- 
tory and response variables. 


How do we find the power model for predicting y from x? The 
THINK) | | | 
east-squares line for the transformed rockfish data is 
ABOUT IT —_—— 
log(weight) = — 1.89940 + 3.04942 log(length) 
If we use the definition of the logarithm as an exponent, we can rewrite this equation as 


OE tar = 3 
weight = 10 1.89940 + 3.04942log(length) 
Using properties of exponents, we can simplify this as follows: 


weight = 197189940 ' 193-04942log(length) using the fact that b™h” = pmtn 


eioht = 3.04942 
weight = 10 1.89940 , 1 Qlostlensth) 


weight = 0.0126(length)*?? using the fact that 10!°8* = x 


using the fact that p log x = log x? 


This equation is now in the familiar form of a power model 


7 SN 3.04942 
wetene = reeenet) y = ax? with a = 0.0126 and b = 3.04942. Notice how close the 


power is to 3, as expected from geometry. 
We could use the power model to predict the weight of a 
36-centimeter-long Atlantic Ocean rockfish: 


weight = 0.0126(36)*49? ~ 701.76 grams 


This is the same prediction we got earlier (up to rounding). 
The scatterplot of the original rockfish data with the power 
sO SD 35 4-~—Cs Model added appears in Figure 12.15. Note how well this 
Length (cm) model fits the data! 


FIGURE 12.15 Rockfish data with power model. 


 —__________________________ty 


Exponential Models A linear model has the form y = a + bx. The value 
of y increases (or decreases) at a constant rate as x increases. The slope b describes 
the constant rate of change of a linear model. That is, for each | unit increase in 
x, the model predicts an increase of b units in y. You can think of a linear model 
as describing the repeated addition of a constant amount. Sometimes the relation- 
ship between y and x is based on repeated multiplication by a constant factor. That 
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is, each time x increases by | unit, the value of y is multiplied by b. An exponen- 
tial model of the form y = ab* describes such multiplicative growth. 

Populations of living things tend to grow exponentially if not restrained by out- 
side limits such as lack of food or space. More pleasantly (unless we’re talking 
about credit card debt!), money also displays exponential growth when interest is 
compounded each time period. Compounding means that last period’s income 
earns income in the next period. 


Money, Money, Money 


Understanding exponential growth 


Suppose that you invest $100 in a savings account that pays 6% interest 
compounded annually. After a year, you will have earned $100(0.06) = $6.00 
in interest. Your new account balance is the initial deposit plus the interest 

earned: $100 + ($100)(0.06), or $106. We can rewrite this as $100(1 + 0.06), or 

more simply as $100(1.06). That is, 6% annual interest means that any amount 
on deposit for the entire year is multiplied by 1.06. 


If you leave the money invested for a second year, your new balance will be 
[$100(1.06)|(1.06) = $100(1.06)? = $112.36. Notice that you earn $6.36 in in- 
terest during the second year. That’s another $6 in interest from your initial $100 
deposit plus the interest on your $6 interest earned for Year 1. After x years, your 
account balance y is given by the exponential model y = 100(1.06)*. 


The table below shows the balance in your savings account at the end of each of 


the first six years. Figure 12.16 shows the growth in your investment over 100 years. 
It is characteristic of exponential growth that the increase appears slow for a long 
period and then seems to explode. 


Time x (years) Account 
balance y 

$100.00 

$106.00 

$112.36 

$119.10 
This (years) $126.25 


$133.82 
FIGURE 12.16 Scatterplot of the growth of a $100 $141.85 
investment in a savings account paying 6% interest, 
compounded annually. 


If an exponential model of the form y = ab* describes the relationship between 
x and y, we can use logarithms to transform the data to produce a linear relation- 
ship. Start by taking the logarithm (we'll use base 10, but the natural logarithm In 
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using base e would work just as well). Then use algebraic properties of logarithms 
to simplify the resulting expressions. Here are the details: 


log y = log (ab*) taking the logarithm of both sides 
log y = log a + log (b*) using the property log(mn) = log m + log n 
logy = loga + x logb using the property log m? = p log m 


We can then rearrange the final equation as log y = log a + (log b)x. Notice 
that log a and log b are constants because a and b are constants. So the equation 
gives a linear model relating the explanatory variable x to the transformed vari- 
able log y. Thus, if the relationship between two variables follows an exponential 
model, and we plot the logarithm (base 10 or base e) of y against x, we should 
observe a straight-line pattern in the transformed data. 


© 
Moore’s Law and Computer Chips 
Logarithm transformations and exponential models 


Gordon Moore, one of the founders of Intel Corporation, predicted in 
1965 that the number of transistors on an integrated circuit chip would 
double every 18 months. This is Moore’s law, one way to measure the 
revolution in computing. Here are data on the dates and number of tran- 
sistors for Intel microprocessors:'© 


Processor Date _‘ Transistors 
4004 1971 2,250 
8008 1972 2,500 
8080 1974 5,000 
8086 1978 29,000 
286 1982 120,000 
386 1985 275,000 
zs00000008 486 DX 1989 1,180,000 
ee a Pentium 1993 3,100,000 
leon . Pentium Il 1997 7,500,000 
Pentium Il 1999 24,000,000 
Eee Pentium 4 2000 42,000,000 
so0000000 ° Itanium 2 2003 220,000,000 
fe Itanium 2 w/9MB cache 2004 — 592,000,000 
: 1 ae ~ Dual-core Itanium 2 2006 —_1,700,000,000 
FIGURE 12.17 Scatterplot of the number of SCRE OTA: . we eee 
transistors on a computer chip from 1971 to 2010. 8-core Xeon Nehalem-EX 2010 — 2,300,000,000 


Figure 12.17 shows the growth in the number of transistors on a computer chip 
from 1971 to 2010. Notice that we used “years since 1970” as the explanatory vari- 
able. We'll explain this later. If Moore’s law is correct, then an exponential model 
should describe the relationship between the variables. 
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In(transistors) 


R777 


Residual 


re PROBLEM: 
20 ie (a) Ascatterplot of the natural logarithm (log base ¢ or In) of the number of 
18 ° transistors on a computer chip versus years since 1970 is shown. Based on 
ss a this graph, explain why it would be reasonable to use an exponential model to 
is me describe the relationship between number of transistors and years since 1970. 
10 i (b) Minitab output froma linear regression analysis on the transformed data 
; x is shown below. Give the equation of the least-squares regression line. Be sure 
0 10 20 30 40 to define any variables you use. 
Years since 1970 
Predictor Coef SE Coef T P 
Constant 7.0647 0.2672 26.44 0.000 
Years since 1970 0.36583 0.01048 34.91 0.000 
S=0.544467 R-Sq=98.9% R-Sq(adj) =98.8% 
(c) Use your model from part (b) to predict the number of transistors on an 
ed e Intel computer chip in 2020. Show your work. 
0.5 F (4) Aresidual plot for the linear regression in part (b) is shown at left. Discuss 
wey ° what this graph tells you about the appropriateness of the model. 
ae ' ‘ SOLUTION: 
A a (a) lfan exponential model describes the relationship between two variables x 
“li : : o ; and y, then we expect a scatterplot of (x, In y) to be roughly linear. The scatterplot 
0 10 20 30 40 


Years since 1970 


of In(transistors) versus years since 1970 has a fairly linear pattern, especially 
through the year 2000. So an exponential model seems reasonable here. 


(b) In(transistors) = 7.0647 + 0.36583(years since 1970) 
(c) Because 2020 is 50 years since 1970, we have 


(ee 
In(transistors) = 7.0647 + 0.36583(50) = 25.3562 
To find the predicted number of transistors, we use the definition of a logarithm as an exponent: 


— ——————— 
In(transistors) = 25.3562 = log, (transistors) = 25.3562 


“transistors = 629552 ~ 1.028-10" 


This model predicts that an Intel chip made in 2020 will have about 100 billion transistors. 


(d) The residual plot shows a distinct pattern, with the residuals going from positive to negative to 
positive as we move from left to right. But the residuals are small in size relative to the transformed 
yvalues. Also, the scatterplot of the transformed data is much more linear than the original scat- 
terplot. We feel reasonably comfortable using this model to make predictions about the number of 


transistors ona computer chip. 


For Practice Try Exercise 


Make sure that you understand the big idea here. The necessary transforma- 
tion is carried out by taking the logarithm of the response variable. The crucial 
property of the logarithm for our purposes is that if a variable grows exponentially, 


its logarithm grows linearly. 
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THINK 
ABOUT IT 


How do we find the exponential model for predicting y from x? 
The least-squares line for the transformed data in the computer chip example is 


In (transistors) = 7.0647 + 0.36583 (years since 1970) 


If we use the definition of the logarithm as an exponent, we can rewrite this 
equation as 


Farieistors = 9/.0647 +0.36583(years since 1970) 


Using properties of exponents, we can simplify this as follows: 


ee. 4 eae ah f 
transistors = e! 0647 . 036583 years since 1970) using the fact that b™p" = pm +n 
transistors = e! 0647 F (greene? eas since 1970) using the fact that (b™)” = pm 


transistors = 1169.93 - (1.4417 1)%85ince 1979) simplifying 


This equation is now in the familiar form of an exponential model y = ab* with 
a = 1169.93 and b = 1.44171. 

We could use the exponential model to predict the number of transistors on 
an Intel chip in 2020: transistors = 1169.93(1.44171)° ~ 1.0281 - 10!!. This is 
the same prediction we got earlier. How does this compare with the prediction 
from Moore’s law? Suppose the number of transistors on an Intel computer chip 
doubles every 18 months (1.5 years). Then in the 49 years from 1971 to 2020, the 
number of transistors would double 49/1.5 = 32.67 times. So the predicted num- 
ber of transistors on an Intel chip in 2020 would be 


fransistors = 2250(2)>2° = 1.54- 103 


Moore’s law predicts more rapid exponential growth than our model does. 
.—_—_—_— 1 


The calculation at the end of the Think about It feature might give you some 
idea of why we used years since 1970 as the explanatory variable in the example. 
To make a prediction, we substituted the value x = 50 into the equation for the 
exponential model. This value is the exponent in our calculation. If we had used 
years as the explanatory variable, our exponent would have been 2020. Such a 
large exponent can lead to overflow errors on a calculator. 


Putting It All Together: Which Transformation 
Should We Choose? 


Suppose that a scatterplot shows a curved relationship between two quantitative 
variables x and y. How can we decide whether a power model or an exponential 
model better describes the relationship? The following example shows the strat- 
egy we should use. 
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What’s a Planet, Anyway? 


Power models and logarithm transformations 


On July 31, 2005, a team of astronomers announced that they had discoy- 
ered what appeared to be a new planet in our solar system. They had first 
observed this object almost two years earlier using a telescope at Caltech’s 
Palomar Observatory in California. Originally named UB313, the potential 
planet is bigger than Pluto and has an average distance of about 9.5 billion 
miles from the sun. (For reference, Earth is about 93 million miles from the 
sun.) Could this new astronomical body, now called Eris, be a new planet? 


At the time of the discovery, there were nine known planets in our solar 
system. Here are data on the distance from the sun and period of revo- 
lution of those planets. Note that distance is measured in astronomical 
units (AU), the number of Earth distances the object is from the sun.'” 


Distance from sun Period of revolution 
250 + e Planet (astronomical units) (Earth years) 
2005 Mercury 0.387 0.241 
= 150 4 : Venus 0.723 0.615 
= 100 - : Earth 1.000 1.000 
2! Mars 1.524 1.881 
Cs Jupiter 5.203 11.862 
0-_# 
! Saturn 9.539 29.456 
0 10 20 30 40 
Distance (AU) Uranus 19.191 84.070 
Neptune 30.061 164.810 
FIGURE 12.18 Scatterplot of planetary distance from the sun Pluto 39.529 248.530 


and period of revolution. 


Figure 12.18 is a scatterplot of the planetary data. There appears to be a strong 
curved relationship between distance from the sun and period of revolution. 


In August 2006, the International 


; PROBLEM: The graphs below show the results of two different transformations of the data. 
Astronomical Union agreed on a new 


definition of “planet.” Both Pluto Figure 12.19(a) plots the natural logarithm of period against distance from the sun for all nine plan- 
and Eris were classified as “dwarf ets. Figure 12.19(b) plots the natural logarithm of period against the natural logarithm of distance 
planets.” from the sun for the nine planets. 
6 
5 
= 4 a 
BS BS 
8 3 s 
y ~~ 
s 1 s 
0 
-1 
-2 
0 10 20 30 40 
Distance (AV) In (distance) 
(a) (6) 


FIGURE 12.19 (a) A scatterplot of In(period) versus distance. (b) A scatterplot of In(period) versus In(distance). 
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(a) Explain why a power model would provide a more appropriate description of the relationship 
between period of revolution and distance from the sun than an exponential model. 


(b) Minitab output from a linear regression analysis on the transformed data in Figure 12.19(b) is 


shown below. Give the equation of the least-squares regression line. Be sure to define any variables 
you use. 


Predictor Coef 


SE Coef T Pp 
Constant 0.0002544 0.0001759 1.45 0.191 
In(distance) 1.49986 0.00008 18598 .27 0.000 


S=0.000393364 R-Sq=100.0% R-Sq(adj) =100.0% 


Residual 


In (distance) 


(c) Use your model from part (b) to predict the period of revolution for 
Eris, which is 9,500,000,000/93,000,000 = 102.15 AU from the 
sun. Show your work. 


(4) Aresidual plot for the linear regression in part (b) is shown at left. 
Do you expect your prediction in part (c) to be too high, too low, or about 
right? Justify your answer. 


SOLUTION: 


(a) The scatterplot of In(period) versus distance is clearly curved, so 

an exponential model would not be appropriate. However, the graph of 
In(period) versus In(distance) has a strong linear pattern, indicating that 
a power model would be more appropriate. 


a 
(b) In(period) = 0.0002544 + 1.49986 In(distance) 


(c) Eris’s average distance from the sun is 102.15 AU. Using this value for distance in our model 
from part (b) gives 


— 
In(period) = 0.0002544 + 1.49986 In(102.15) = 6.939 
To predict the period, we have to undo the logarithm transformation: 
period = ¢°9°9 ~ 1032 years 


We wouldn't want to wait for Eris to make a full revolution to see if our prediction is accurate! 

(d) Eris’s value for In(distance) is In(102.15) = 4.626, which would fall at the far right of the 
residual plot, where all the residuals are positive. Because residual = actual y — predicted yseems 
likely to be positive, we would expect our prediction to be too low. 


For Practice Try Exercise 


Period (yr) 


The scatterplot of the original data with the power model 
added appears in Figure 12.20. It seems remarkable that peri- 
od of revolution is closely related to the 1.5 power of distance 
from the sun. Johannes Kepler made this fascinating discovery 
about 400 years ago without the aid of modern technology—a 
result known as Kepler’s third law. 


What if the scatterplots of (log x, log y) and (x, log y) both 


20 25 
Distance (AU) 


FIGURE 12.20 Planetary data with power model. 


look linear? Fit a least-squares regression line to both sets of 
transformed data. Then compare residual plots and look for 
the one with the most random scatter. If the residual plots look 
roughly the same, use the values of s and r’ to decide whether 
a power model or an exponential model is a better choice. 
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oo, 


We have used statistical software to do all the transformations and linear regres- 
sion analysis in this section so far. Now let’s look at how the process works on a 
graphing calculator. 


e 


TRANSFORMING TO ACHIEVE 
TECHNOLOGY | WE ARITY ON THE CALCULATOR 


TI-Nspire instructions in Appendix B; HP Prime instructions on the book’s Web site. 


We'll use the planet data to illustrate a general strategy for performing transformations with logarithms on the TI-83/84 
and T1-89. A similar approach could be used for transforming data with powers and roots. 


TI-83/84 TI-89 
e Enter the values of the explanatory variable in L1/listl and the values of the response variable in L2/list2. Make a 


scatterplot of y versus x and confirm that there is a curved pattern. 
NORMAL FLOAT AUTO REAL RADIAN CL Fis am 
fesiletan|reccdnebrarnlaanlorsulred= 
Fl 


Ploti:LasL2 
o 
2 Oo 
S Oo 
i) . in| 
+ +- + + + 
oe 39: . 241 

R=.387 Y=.244 MAIN RAD AUTO FUNC 


¢ Define L3/list3 to be the natural logarithm (In) of LI/list] and L4/list4 to be the natural logarithm of L2/list2. ‘To 
see whether a power model fits the original data, make a plot of In y (L4/list+) versus In x (L3/list3) and look for 
linearity. To see whether an exponential model fits the original data, make a plot of In y (L4/list+) versus x (L1/list1) 
and look for linearity. 


x -.949331 uct -1.42296 = yet 71.42296 


pace. ee Op He EP EE Sl 
eee yarns ease? yarns USE €398 OR TYPE * CESCI=CANCEL USE €3¢4 OR TYPE * (ESCI=CANCEL 


e Ifa linear pattern is present, calculate the equation of the least-squares regression line and store it in Y1. For the 
planet data, we executed the command LinReg (a+bx) L3,L4, Y1. 


NORMAL FLOAT AUTO REAL RADIAN CL 
Fi Fer [Feri rcheecleen lee 


y=atbx 


a=2.5444423e -4 
b=1.499860986 
r2=, 9999999798 
r=. 9999999899 


oe FI ee Ie 
list4 [1 J=-1. 4229583454915 
MAIN RAD AUTO FUNC 478 


Construct a residual plot to look for any departures from the linear pattern. For Xlist, enter the list you used as the 
explanatory variable in the linear regression calculation. For Ylist, use the RESID list stored in the calculator. For the 
planet data, we used L3/list3 as the Xlist. 
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Ploti:LosRESID 


NORMAL FLOAT AUTO REAL RADTAN CL 
Ta 
bat 


# 


Po 
woe. 949531 tyct 0006S) 


X=".9493306 Y=6.511ZE74 USE #4 OR TYPE + CESCJSCANCEL 


e ‘To make a prediction for a specific value of the explanatory variable, compute log x or In x, if appropriate. Then use 
Y1(k) to obtain the predicted value of log y or In y. To get the predicted value of y, use 10" Ans or e* Ans to undo the 
logarithm transformation. Here’s our prediction of the period of revolution for Eris, which is at a distance of 102.15 


AU from the sun: 
FELIS Talal 
Teelshaldeb oth 
1n(102.15) : rl = a 
weqaney 4.626442321. elnc¢i92.15) 4.62644232126 
6. 939274784 wyi¢4.6264423212636) 
ayy Be PRATT, 8 S392 7478401 
| seseposnssaveneacseseieveseetOBen OL OOS: a 2. 9592747840111 
1032, 02150501 
MAIN Rat AUTO FUNC 3/30 


CHECK YOUR UNDERSTANDING 


One sad fact about life is that we'll all die someday. Many adults plan ahead for their even- 
tual passing by purchasing life insurance. Many different types of life insurance policies 
are available. Some provide coverage throughout an individual’s life (whole life), while 
others last only for a specified number of years (term life). The policyholder makes regular 
payments (premiums) to the insurance company in return for the coverage. When the in- 
sured person dies, a payment is made to designated family members or other beneficiaries. 

How do insurance companies decide how much to charge for life insurance? They rely 
on a staff of highly trained actuaries—people with expertise in probability, statistics, and 
advanced mathematics—to establish premiums. For an individual who wants to buy life 
insurance, the premium will depend on the type and amount of the policy as well as on 
personal characteristics like age, sex, and health status. 

The table shows monthly premiums for a 10-year term-life insurance policy worth 


$1,000,000.'8 
Age (years) Monthly premium 
40 $29 
45 $46 
50 $68 
55 $106 
60 $157 


65 $257 


7 
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The Fathom screen shots below show three possible models for predicting monthly 
premium from age. Option | is based on the original data, while Options 2 and 3 involve 
transformations of the original data. Each screen shot includes a scatterplot with a least- 
squares regression line added and a residual plot. 


insurance 


37 J 3.9 
Age : Inage 


[= Premium = -343 - §.63Age. >= 0.90 — hnpremium = -12 98 + 4.416inage; +2 = 0.99 — inpremium « -0.063 + 0.08S94g¢: #2 = 1.00 


OPTION 1 OPTION 2 OPTION 3 
1. Use each model to predict how much a 58-year-old would pay for such a policy. 
Show your work. 


2. What type of function—linear, power, or exponential —best describes the 
relationship between age and monthly premium? Explain. 


e 


Do Longer Drives Mean Lower 
Scores on the PGA Tour? 


In the chapter-opening Case Study (page 737), we examined data on the mean 
drive distance (in yards) and mean score per round for an SRS of 19 of the 197 
players on the PGA Tour in a recent year. Here is some Minitab output from 
a least-squares regression analysis on these data: 


Coef SE Coef T Pp 
76.904 3.808 20.20 0.000 
Avg. distance —0.02016 O.01379 =], 53 0.145 


Predictor 


Constant 


S=0.618396 R-Sq=12.1% R-Sq(adj) =6.9% 70.5 


280 290 


Mean distance (yards) 


300 
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Frequency 


157 
. 
1.04 
. 
. . 
3 0.54 e 
.—lcra 
0.0 7 . - = 
° o 5 @ 
0.54 > 
. 
———EEeEeE—————————————Es 
270 280 290 300 310 
Avg. distance 


1. Calculate the residual for the player with a mean drive distance of 
275.4 yards and a mean score per round of 72.1. Show your work. 

2. Interpret the value of s in this setting and explain what parameter 
s 1s estimating. 

3. Do these data give convincing evidence at the a = 0.05 level that 
the slope of the population regression line is negative? 

4. Which kind of mistake—a Type I error or a Type II error—could 
you have made in Question 3? Justify your answer. 


Summary 


Curved relationships between two quantitative variables can sometimes be 
changed into linear relationships by transforming one or both of the vari- 
ables. Once we transform the data to achieve linearity, we can fit a least- 
squares regression line to the transformed data and use this linear model to 
make predictions. 


When theory or experience suggests that the relationship between two vari- 
ables follows a power model of the form y = ax?, there are two transforma- 
tions involving powers and roots that can linearize a curved pattern in a scat- 
terplot. Option |: Raise the values of the explanatory variable x to the power p, 
then look at a graph of (x?, y). Option 2: Take the pth root of the values of the 
response variable y, then look at a graph of (x,Wy) 


Another useful strategy for straightening a curved pattern in a scatterplot is to 
take the logarithm of one or both variables. When a power model describes 
the relationship between two variables, a plot of log y (In y) versus log x (In x) 
should be linear. 


In a linear model of the form y = a + bx, the values of the response variable 
are predicted to increase by a constant amount b for each increase of | unit 
in the explanatory variable. For an exponential model of the form y = ab*, 
the predicted values of the response variable are multiplied by a factor of b 
for each increase of | unit in the explanatory variable. When an exponential 
model describes the relationship between two variables, a plot of log y (In y) 
versus x should be linear. 


31. 


(a) 
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TECHNOLOGY 
CORNER 


TI-Nspire Instructions in Appendix B; HP Prime instructions on the book’s Web site. 


30. Transforming to achieve linearity on the calculator 


Exercises 


The swinging pendulum Mrs. Hanrahan’s 
precalculus class collected data on the length (in 
centimeters) of a pendulum and the time (in sec- 
onds) the pendulum took to complete one back-and- 
forth swing (called its period). Here are their data: 


Length (cm) Period (s) 
16.5 0.777 
WS 0.839 
196 0.912 
22.5 0.878 
28.5 1.004 (©) 
hss) 1.087 
34.5 1.129 
SiO Tal 
43.5 1.290 
46.5 EO wal 
106.5 ZA 
Make a reasonably accurate scatterplot of the data 
by hand, using length as the explanatory variable. 
Describe what you see. 
The theoretical relationship between a pendulum’s 
length and its period is 
Be 


Z 
period = ve V length 


where g is a constant representing the acceleration 
due to gravity (in this case, g = 980 cm/s”). Use the 
following graph to identify the transformation that 

was used to linearize the curved pattern in part (a). 


page 781 


Use the following graph to identify the transforma- 
tion that was used to linearize the curved pattern in 


part (a). 


20 30 40 50 6 70 80 9% 100 110 
Length (cm) 


Boyle’s law If you have taken a chemistry or physics 
class, then you are probably familiar with Boyle’s 
law: for gas in a confined space kept at a constant 
temperature, pressure times volume is a constant (in 
symbols, PV = k). Students collected the following 
data on pressure and volume using a syringe and a 
pressure probe. 
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Volume (cubic centimeters) | Pressure (atmospheres) = 
6 2.9589 oc) ° ‘ 
8 2.4073 . m . 
5 0.00 7 
10 1.9905 3 ne. ° 
12 1.7249 -0.05 
14 1.5288 : 
16 1.3490 12 a le eee 
18 1.2223 ee 
20 1.1201 


Transformation 2: (length, period’) 
Predictor  Coeft SHECoct P 
Constant —(,1S465 O.05802 —2.57 0.026 


(a) Make a reasonably accurate scatterplot of the data 
by hand using volume as the explanatory variable. 
Describe what you see. 


(b) Ifthe true relationship between the pressure and Bee SL eee leer toe de 


volume of the gas is PV = k, we can divide both sides S=0.105469 R-Sq=99.2% R-Sq(adj) =99.1% 
of this equation by V to obtain the theoretical model 
P=RkVW, orP = K(1/V). Use the graph below to 


identity the transformation that was used to linearize . A 
the curved pattern in part (a). ie 
3 5 ‘ : 
i 0.0 . 
04 
0.2 


20 30 40 50 60 70 80 90 100 110 
Length (cm) 


Pressure (atm) 


Do each of the following for both transformations. 


0.050 0.075 0.100 0125 0.150 0.175 (a) 


av Give the equation of the least-squares regression line. 


Define any variables you use. 


(c) Use the graph below to identify the transformation that 


was used to linearize the curved pattern in part (a). (b) Use the model from part (a) to predict the period of a 


pendulum with length 80 centimeters. Show your work. 


09! - 34. Boyle’s law Refer to Exercise 32. Here is Minitab 
on] . output from separate regression analyses of the two 
63 . sets of transformed pressure data: 
$ 06 ° 
oS 7 Transformation 1: Se pressure 
044 ° volume 
co? Predictor Coeft SE Coef T P 
5.0 75 10.0 12.5 15.0 17.5 20.0 
Volaine (cube centiaceters) | Constant 0.36774 0.04055 2,07 0.000) 
oo 1/V 15.8994 0.4190 SH._95)  0).000 
33. The swinging pendulum Refer to Exercise 31. Here S=0.044205 R-Sq=99.6% R-Sq(adj) =99.5% 


a] 770 is Minitab output from separate regression analyses 
& of the two sets of transformed pendulum data: 


0.050 e 


0.025 ° 


Transformation 1: (Vlength, period) 
Predictor Cock SH Cock P 


0.000 


Residual 
. 


0.025) , 
Constante — O08 59 40m 5046s Ol Onda 3) ‘ 
-0.050 
sqrt OR 2099993 0S 008322) 252-23) 001010) e 
-0.0754__ zs 2 = = 
(length) 0.050 0.075 0.100 0.125 0.150 0.175 


iv 


S=0.0464223 R-Sq=98.6% R-Sq(adj) =98.5% 
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Transformation 2: (volume, 1) 
pressure 
Predictor Coef SE Coef aL P 
Constant 0.100170 O,0C3 779 26.51 0.000 
Volume OMOSISAskoM O00 27 4 eras 223 0210100 


S=0.003553 


R-Sq=100.0% R-Sq(adj) =100.0% 


Residual 
> Oo 9 
38 


-0,003 ‘ ‘ 


a 


50 75 100 125 150 175 200 
Volume (cubic centimeters) 


Do each of the following for both transformations. 


Give the equation of the least-squares regression line. 
Define any variables you use. 


Use the model from part (a) to predict the pressure 
in the syringe when the volume is 17 cubic centime- 
ters. Show your work. 


The swinging pendulum Refer to Exercise 31. We 
took the logarithm (base 10) of the values for both 
variables. Some computer output from a linear regres- 
sion analysis on the transformed data is shown below. 


Regression Analysis: log(Period) versus log(Length) 


Predictor Coef SE Gocki 1 Pp 
Constant =(372575 O.03808 =19,55 ©. 00d 
log (Length) OQ Sakyoal O02 'sabal 2O<'59) 0) OO) 
S= 00185568 R-Sq=97 .9% R-Sq(adj) =97.7% 
0.3 
0.2 
2 
é 0.1 
3 
0.0 
-0.1 
12! a3 1.4 15) 1.6 He/ 1.8 1.9 2.0 Pal 
log(Length) 
0.03 
0.02 
0.01 
3 0.00 
3 
-0.01 
-0.02 
-0.03 


SL elec ee bot ed) Os = 7d od Oe ed 
log(Length) 


36. 


Predictor 
Constant 
log(Volume) —0.81344 


S=0.00486926 


BO: 
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Based on the output, explain why it would be reason- 
able to use a power model to describe the relation- 
ship between the length and period of a pendulum. 


Give the equation of the least-squares regression line. 
Be sure to define any variables you use. 


Boyle’s law Refer to Exercise 32. We took the loga- 
rithm (base 10) of the values for both variables. Some 
computer output from a linear regression analysis on 
the transformed data is shown below. 


Regression Analysis: log(Pressure) versus log(Volume) 
Close SPECOC EE P 

se Bt I 29 OO Ls 99.39 (0,000 
O,01020 =—79.73 0.000 


R-Sq=99.9% R-Sq(adj) =99.9% 


log(Pressure) 


0.8 0.9 1.0 leak a2, ils} 
log(Volume) 


Residual 


0.8 0.9 1.0 ast a2 ag 
log(Volume) 


Based on the output, explain why it would be reason- 
able to use a power model to describe the relation- 
ship between pressure and volume. 


Give the equation of the least-squares regression line. 
Be sure to define any variables you use. 


The swinging pendulum Use your model from 
Exercise 35 to predict the period of a pendulum with 
length 80 centimeters. Show your work. 


Boyle’s law Use your model from Exercise 36 to 
predict the pressure in the syringe when the volume 
is 17 cubic centimeters. Show your work. 


Brawn versus brain How is the weight of an 
animal’s brain related to the weight of its body? 
Researchers collected data on the brain weight (in 
grams) and body weight (in kilograms) for 96 species 
of mammals.!” The following figure is a scatterplot of 


788 


40. 


41, 
776 


CHAPTER 12 MORE ABOUT REGRESSION 


the logarithm of brain weight against the logarithm 
of body weight for all 96 species. ‘The least-squares 
regression line for the transformed data is 


eS 
log y = 1.01 + 0.72 log x 


e 
oe 
e id x 
a Seal Ze 
= e* e 
= o i * 
wo eee 
= © 0% eo 
& 2- oe & ~ 
i Pee ee fe 
2 Bo es 
S PI i ted 
g ya 
E y 
ale 2 a 
x ae le 
e eee ‘ * 
oj «°° 
rede 
T T T T T 
=i 0 1 2} 3 


Logarithm of body weight 


Based on footprints and some other sketchy evi- 
dence, some people believe that a large apelike ani- 
mal, called Sasquatch or Bigfoot, lives in the Pacific 
Northwest. His weight is estimated to be about 280 
pounds, or 127 kilograms. How big is Bigfoot’s brain? 
Show your method clearly. 


Determining tree biomass It is easy to measure 

the “diameter at breast height” of a tree. It’s hard to 
measure the total “aboveground biomass” of a tree, 
because to do this you must cut and weigh the tree. 
The biomass is important for studies of ecology, so 
ecologists commonly estimate it using a power model. 
Combining data on 378 trees in tropical rain forests 
gives this relationship between biomass y measured in 
kilograms and diameter x measured in centimeters:”” 


In y = —2.00 + 2.42 Inx 


Use this model to estimate the biomass of a tropical 
tree 30 centimeters in diameter. Show your work. 


Killing bacteria Expose marine bacteria to X-rays 
for time periods from | to 15 minutes. Here are the 
number of surviving bacteria (in hundreds) on a 
culture plate after each exposure time:7! 


Time t County Time t County 

1 305 9 56 
2 211 10 38 
3 197 11 36 
4 166 12 32 
5 142 13 21 
6 106 14 19 
i 104 15 18) 
8 60 


(a) Make a reasonably accurate scatterplot of the data 
by hand, using time as the explanatory variable. 
Describe what you see. 


(b) Ascatterplot of the natural logarithm of the number 
of surviving bacteria versus time is shown below. 
Based on this graph, explain why it would be reason- 
able to use an exponential model to describe the 
relationship between count of bacteria and time. 


Time 


(c) Minitab output from a linear regression analysis on 
the transformed data is shown below. 


Predictor Coef SE Coef T P 
Constant 5.97316 0.05978 99.92 0.000 
Time —0 AIA As 0 0057/5 —S3.4a2 0, WOW 


S=0.110016 R-Sq=98.8% R-Sq(adj) =98.7% 
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Give the equation of the least-squares regression line. 
Be sure to define any variables you use. 


(d) Use your model to predict the number of surviving 
bacteria after 17 minutes. Show your work. 


42. Light through the water Some college students 
collected data on the intensity of light at various 
depths in a lake. Here are their data: 


Depth (m) Light intensity (lumens) 
5 168.00 
6 120.42 
i 86.31 
8 61.87 
9 44.34 
10 31.78 
11 22.18 


(a) 


(b) 


(c) 
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Make a reasonably accurate scatterplot of the data 
by hand, using depth as the explanatory variable. 
Describe what you see. 


A scatterplot of the natural logarithm of light in- 
tensity versus depth is shown below. Based on this 
graph, explain why it would be reasonable to use 
an exponential model to describe the relationship 
between light intensity and depth. 


a ow 
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Minitab output from a linear regression analysis on 
the transformed data is shown below. 


Predictor Coef SE Coef T P 
Constant 6.78910 0.00009 78575.46 0.000 
Depth (m) —0.333021 0.000010 —31783.44 0.000 


S=0.000055 


R-Sq=100.0% R-Sq(adj) =100.0% 
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Give the equation of the least-squares regression line. 


Be sure to define any variables you use. 


Use your model to predict the light intensity at a 
depth of 12 meters. Show your work. 


Follow the bouncing ball Students in Mr. Handford’s 
class dropped a kickball beneath a motion detector. 
The detector recorded the height of the ball as it 


bounced up and down several times. Here are the 
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Here is a scatterplot of the data: 
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(a) The following graphs show the results of two 
different transformations of the data. Would an 
exponential model or a power model provide a better 
description of the relationship between bounce 
number and height? Justify your answer. 
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(b) Minitab output from a linear regression analysis on 
the transformed data of log(height) versus bounce 
number is shown below. Give the equation of the 
least-squares regression line. Be sure to define any 
variables you use. 


heights of the ball at the highest point on the first five 


bounces: 


Bounce number 


Height (ft) 


Predictor Coef SE Coef T P 
Constant 0.45374 0) Om Beis) 3A 7S OOO@ 
Bounce =O) L750) 0 OWING —AE.08 0 OOO) 


S=0.0132043 


R-Sq=99.6% R-Sq(adj) =99.5% 


2.240 


(c) 


Use your model from part (b) to predict the highest 


1 
2 
3 
4 
5 


1.620 
1.235 
0.958 
0.756 


point the ball reaches on its seventh bounce. Show 
your work. 


(d) Aresidual plot for the linear regression in part (b) is 
shown on the next page. Do you expect your prediction 
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in part (c) to be too high, too low, or about right? Justify 
your answer. 


Residual 


44, Counting carnivores Ecologists look at data to 
learn about nature’s patterns. One pattern they have 
found relates the size of a carnivore (body mass in 
kilograms) to how many of those carnivores there 
are in an area. A good measure of “how many” is to 
count carnivores per 10,000 kilograms (kg) of their 
prey in the area. The table below gives data for 25 


6 F 2 
Carnivore species.”” 


Body mass Abundance 
Carnivore species (kg) (per 10,000 kg of prey) 
Least weasel 0.14 1656.49 
Ermine 0.16 406.66 
Small Indian mongoose 0.55 514.84 
Pine marten ils 31.84 
Kit fox 2.02 15.96 
Channel Islands fox 2.16 145.94 
Arctic fox 3.19 21.63 
Red fox 4.6 CP 
Bobcat 10.0 9.75 
Canadian lynx V2 4.79 
European badger 13.0 3.5) 
Coyote 180) Iles 
Ethiopian wolf 14.5 2.70 
Eurasian lynx 20.0 0.46 
Wild dog 25.0 1.61 
Dhole 25.0 0.81 
Snow leopard 40.0 1.89 
Wolf 46.0 0.62 
Leopard 46.5 6.17 
Cheetah 50.0 2.29 
Puma 51.9 0.94 
Spotted hyena 58.6 0.68 
Lion 142.0 3.40 
Tiger 181.0 0.33 
Polar bear 310.0 0.60 


Here is a scatterplot of the data. 


prey) 
$8 


Abundance (per 10,000 kg prey’ 


(a) The following graphs show the results of two 
different transformations of the data. Would an 
exponential model or a power model provide a better 
description of the relationship between body mass 
and abundance? Justify your answer. 
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log(abundance) 
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log(body mass) 


(b) Minitab output from a linear regression analysis 
on the transformed data of log(abundance) versus 
log(body mass) is shown below. Give the equation 
of the least-squares regression line. Be sure to define 
any variables you use. 


Predictor Coef SHICoek Vr P 
Constant eS 503 0.1342 14253 O.000 
log(body —1.04811 0.09802 -—10.69 0.000 
mass) 


S=0.423352 R-Sq=83.3% R-Sq(adj) =82.5% 


(c) Use your model from part (b) to predict the 
abundance of black bears, which have a body mass 
of 92.5 kilograms. Show your work. 


(d) A residual plot for the linear regression in part (b) is 
shown at top right. Explain what this graph tells you 
about the appropriateness of the model. 


Section 12.2 Transforming to Achieve Linearity de, ; aot 


a (a) Make an appropriate scatterplot for predicting hori- 
: zontal distance traveled from ramp height. Describe 
> CR, ° what you see. 
: 0.0 ——*? _— (b) Use transformations to linearize the relationship. 
@ , a Does the relationship between distance and height 
ee , : seem to follow an exponential model or a power 
x — model? Justify your answer. 
10 -05° 060 O05 10 15 20 25 y 
log(body mass) (c) Perform least-squares regression on the transformed 
data. Give the equation of your regression line. De- 
45. Heart weights of mammals Here are some data on fine any variables you use. 


the h f vari 1s : 
ope oe (d) Use your model from part (c) to predict the 


Length of cavity of left horizontal distance a ball would travel if the ramp 
Mammal ventricle (cm) Heart weight (g) height was 700. Show your work. 
Mouse 0.55 0.13 Multiple Choice: Select the best answer for Exercises +7 
Rat 1.0 0.64 to 50. 
ba ae ae 47. Suppose that the relationship between a response 
Dog 4.0 102 : : : 
a BG oe variable y and an explanatory variable x is modeled 
ae by y = 2.7(0.316)". Which of the following scatter- 
Ox 12.0 2030 1 : . . 
plots would approximately follow a straight line? 
Horse 16.0 3900 
(a) A plot of y against x 
(a) Make an appropriate scatterplot for predicting heart (b) A plot of y against log x 
weight from length. Describe what you see. (c) A plot of log y against x 
(b) Use transformations to linearize the relationship. (d) A plot of log y against log x 
Does the relationship between heart weight and ; 
length seem to follow an exponential model or a (c) A plot of Vy against x. 


power model? Justify your answer. 48. Some high school physics students dropped a ball 
and measured the distance fallen (in centimeters) at 
various times (in seconds) after its release. If you 
have studied physics, then you probably know that 
the theoretical relationship between the variables is 


(c) Perform least-squares regression on the transformed 
data. Give the equation of your regression line. 
Define any variables you use. 


(d) Use your model from part (c) to predict the heart distance = 490(time)’. A scatterplot of the students’ 
weight of a human who has a left ventricle 6.8 centi- data showed a clear curved pattern. At 0.68 seconds 
meters long. Show your work. after release, the ball had fallen 220.4 centimeters. 


How much mote or less did the ball fall than the 


46. Galileo’s experiment Galileo studied motion by be Ue ee men ee 
eoretical model predicts? 


rolling balls down ramps. He rolled a ball down a 
ramp with a horizontal shelf at the end of it so that (a) More by 226.576 centimeters 

the ball was moving horizontally when it started to ; 

fall off the shelf. The top of the ae was placed at De Toray o iaiccarmnits 

different heights above the floor (that is, the length of (c) No more and no less 

the ramp varied), and Galileo measured the horizon- 5 

tal Re the i traveled before it hit the floor. ep 207 acealc 

Here are Galileo’s data. (We won’t try to describe the ( 

obsolete seventeenth-century units Galileo used to 49. A scatterplot of x = Super Bowl number and 
measure distance and height.)* y = cost of a 30-second advertisement on the Super 
Bowl broadcast (in dollars) shows a strong, posi- 
tive, nonlinear association. A scatterplot of In(cost) 


e) Less by 6.176 centimeters 


Height Distance 


1000 = 1500 versus Super Bow! number is roughly linear. ‘The 
828 1340 least-squares regression line for this association is 
800 1328 In(cost) = 10.97 + 0.0971 (Super Bowl number). 
600 1172 Predict the cost of a 30-second advertisement for 


300 800 Super Bowl 40. 
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(a) $3 (d) $83,132 Do you have a tattoo? 
(b) $15 (e) $2,824,947 

(c) $58,153 


50. A scatterplot of y versus x shows a positive, nonlin- 
ear association. ‘lwo different transformations are 
attempted to try to linearize the association: using 
the logarithm of the y values and using the square 
root of the y values. ‘Two least-squares regression 
lines are calculated, one that uses x to predict 
log(y) and the other that uses x to predict Vy. 
Which of the following would be the best reason 
to prefer the least-squares regression line that uses 
x to predict log(y)? 


Exercises 53 and 54 refer to the following setting. About 1100 


2. 

aC aoa lee high school teachers attended a weeklong summer institute 

(b) ‘The standard deviation of the residuals is smaller. for teaching AP® classes. After hearing about the survey 

(OM Tesons Reale: in Exercise 52, the teachers in the AP® Statistics class 
wondered whether the results of the tattoo survey would 

(d) The residual plot has more random scatter. be similar for teachers. They designed a survey to find out. 

(e) Whe distibution of residualsis more Nomnall The class opted to take a random sample of 100 teachers at 


ad.. 


— 


Shower time (1.3, 2.2, 6.3, 7.3) Marcella takes a 
shower every morning when she gets up. Her time 
in the shower varies according to a Normal distribu- 
tion with mean 4.5 minutes and standard deviation 


the institute. One of the questions on the survey was 


Do you have any tattoos on your body? 


(Circle one) YES NO 


Ne renee 53. ‘Tattoos (8.2, 9.2) Of the 98 teachers who responded, 
e 23.5% said that they had one or more tattoos. 

(rating pao eebil ty Wel Nl eeceT aisles 2355) ve (a) Construct and interpret a 95% confidence interval 
tween 3 and 6 minutes on a randomly selected day. fondle etal proportion oiler leceacthe 
Suoweyormnors AP® institute who would say they had tattoos. 

(b) y fell por EU Mate NOES iI i pote (b) Does the interval in part (a) provide convincing evi- 
shed acon eutie: bythe orale ayo dence that the proportion of teachers at the institute 
cies with tattoos is not 0.14 (the value cited in the Harris 

(c) Suppose we choose 10 days at random and record Poll report)? Justify your answer. 
he eae ee ele oe (c) ‘Two of the selected teachers refused to respond to 
Be jane : ihe eo oheeee the survey. If both of these teachers had responded, 
es ee y could your answer to part (b) have changed? Justify 

your answer. 

@ Peta he He ae) 54. ‘Tattoos (4.1) One of the first decisions the class had 

» ; > to make was what kind of sampling method to use. 
Show your work. € 
(a) ‘They knew that a simple random sample was the 
52. ‘Tattoos (8.2) What percent of U.S. adults have 


one or more tattoos? ‘The Harris Poll conducted an 
online survey of 2302 adults during January 2008. 
According to the published report, “Respondents for 
this survey were selected from among those who have 
agreed to participate in Harris Interactive surveys.””” 
The pie chart at top right summarizes the responses 
from those who were surveyed. Explain why it would 
not be appropriate to use these data to construct a 
95% confidence interval for the proportion of all 
US. adults who have tattoos. 


(b) 


“preferred” method. With 1100 teachers in 

40 different sessions, the class decided not to use an 
SRS. Give at least two reasons why you think they 
made this decision. 


The AP® Statistics class believed that there might 
be systematic differences in the proportions of 
teachers who had tattoos based on the subject areas 
that they taught. What sampling method would you 
recommend to account for this possibility? Explain a 
statistical advantage of this method over an SRS. 
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Free Response AP® Problem, Yay! 


The following problem is modeled after actual AP® Statistics exam 
free response questions. Your task is to generate a complete, con- 
cise response in 15 minutes. 


Directions: Show all your work. Indicate clearly the methods 
you use, because you will be scored on the correctness of your 
methods as well as on the accuracy and completeness of your 
results and explanations. 


A random sample of 14 golfers was selected from the 
147 players on the Ladies Professional Golf Association 
(LPGA) tour in a recent year. The total amount of money 
won during the year (in dollars) and the scoring average 
for each player in the sample was recorded. Lower scoring 
averages are better in golf. 

The scatterplot below displays the relationship between 
money and scoring average for these 14 players. 


1800000 4 
1600000 + 
1400000 5 
1200000 5 
1000000 5 
800000 + 
600000 + 
400000 + 
200000 + A ; A . 
0 e 


2 B 
Scoring average 


Explain why it would not be appropriate to con- 
struct a confidence interval for the slope of the 
least-squares regression line relating money to 
scoring average. 


A scatterplot of the natural logarithm of money versus 
scoring average is shown at top right along with some com- 


puter output for a least-squares regression using the trans- 
formed data. 


In(Money) 


2 B 
Scoring average 


SE Coef 
Constant LI e537 7.035 Li... 02 0.000 
—0.90470 0.09679 =9.,.35 0.000 


Predictor Coef 


Scoring 
average 


S=0.475059 R-Sq=87.9%  R-Sq(adj) =86.9% 

(b) Predict the amount of money won for an LPGA 
golfer with a scoring average of 70. 

(c) Calculate and interpret a 95% confidence interval 
for the slope of the least-squares regression line re- 
lating In(money) to scoring average. Assume that 


the conditions for inference have been met. 


After you finish, you can view two example solutions on the book’s 
Web site (www.whfreeman.com/tps5e). Determine whether you 
think each solution is “complete,” “substantial,” “developing,” or 
“minimal.” If the solution is not complete, what improvements would 
you suggest to the student who wrote it? Finally, your teacher will 
provide you with a scoring rubric. Score your response and note 
what, if anything, you would do differently to improve your own 
score. 
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Chapter Review 


Section 12.1: Inference for Linear Regression 


In this section, you learned how to conduct inference about 
the slope of a population (true) least-squares regression 
line. The sampling distribution of the sample slope b is the 
foundation for doing inference about the population (true) 
slope 3. When the conditions are met, the sampling distri- 
bution of b has an approximately Normal distribution with 


on 

There are five conditions for performing inference 
about a population (true) slope. Remember them with the 
acronym LINER. 


mean jt, = @ and standard deviation o, = 


e ‘The Linear condition says that the mean value of the re- 
sponse variable /1, falls on the population (true) regression 
line 44, = a + Gx. To check the Linear condition, verify 
that there are no leftover patterns in the residual plot. 


e ‘The Independent condition says that individual observa- 
tions are independent of each other. To check the Inde- 
pendent condition, verify that the sample size is less than 
10% of the population size. Also, convince yourself that 
knowing the response for one individual won't help you 
predict the response for another individual. 


e ‘The Normal condition says that the distribution of y val- 
ues is approximately Normal for each value of x. To check 
the Normal condition, graph a dotplot, histogram, or Nor- 
mal probability plot of the residuals and verify that there 
are no outliers or strong skewness. 


e ‘The Equal SD condition says that for each value of x, 
the distribution of y should have the same standard de- 
viation. ‘To check the Equal SD condition, verify that 
the residuals have roughly the same amount of scatter 
around the residual = 0 line for each value of x on the 
residual plot. 


e ‘The Random condition says that the data are from a ran- 
dom sample or a randomized experiment. ‘To check the 
Random condition, verify that randomness was properly 
used in the data collection process. 


To construct and interpret a confidence interval for the 
slope of the population (true) least-squares regression line, 
follow the familiar four-step process. The formula for the 
confidence interval is b + t*SE,, where ¢* is the ¢ critical 
value with df =n — 2. The standard error of the slope SE, 
describes how far the sample slope typically varies from the 


population (true) slope in repeated random samples or ran- 
dom assignments. die formula for the standard error of the 
Swi — || 
typically provided with standard computer output for least- 
squares regression. 

In most cases, when you conduct a significance test for 
the slope of the population (true) least-squares regression 
line, the null hypothesis is Hp:3 = 0. This hypothesis says 
that a straight-line relationship between x and y is of no 
value for predicting y. To do the calculations, use the test 


= Bo 


slope is SE, = . The standard error of the slope is 


with df = n- 2. The value of the test sta- 


statistic t = 


tistic, along with a two-sided P-value, is typically provided 
with standard computer output for least-squares regression. 


Section 12.2: Transforming to Achieve Linearity 


When the association between two variables is nonlinear, 
transforming one or both of the variables can result in a lin- 
ear association. 

If the association between two variables follows a power 
model in the form y = ax?, there are several transforma- 
tions that will result in a linear association. 


e Raise the values of x to the power of p and plot y versus x?. 
¢ Calculate the pth root of the y values and plot Wy versus x. 


e Calculate the logarithms of the x values and the y values 
and plot log(y) versus log(x). You can use base 10 loga- 
rithms (log) or base e logarithms (In). 


If the association between two variables follows an exponen- 
tial model in the form y = ab*, transform the data by com- 
puting the logarithms of the y values and plot log(y) versus x 
(or In(y) versus x). 

Once you have achieved linearity, calculate the equa- 
tion of the least-squares regression line using the trans- 
formed data. Remember to include the transformed 
variables when you are writing the equation of the line. 
Likewise, when using the line to make predictions, make 
sure that the prediction is in the original units of y. If you 
transformed the y variable, you will need to undo the trans- 
formation after using the least-squares regression line. 

‘To decide between two or more transformations, look 
at the residual plots and choose the one with the most ran- 
dom scatter. 


What Did You Learn? 


Learning Objective Section Related Example 


on Page(s) 


Relevant Chapter 
Review Exercise(s) 


Check the conditions for performing inference about the slope ( of 


the population (true) regression line. 12.1 745 R12.2, R12.3, R12.4 
Interpret the values of a, b, s, SE,, and r* in context, and determine 
these values from computer output. 121 748, 754 R12.1 


Construct and interpret a confidence interval for the slope of the 


population (true) regression line. an 749 R12.3 
Perform a significance test about the slope ( of the population 
(true) regression line. 12.1 754 R12.2 


Use transformations involving powers and roots to find a power 
model that describes the relationship between two variables, and 
use the model to make predictions. 12.2 768, 770 R12.5 


Use transformations involving logarithms to find a power model or 
an exponential model that describes the relationship between two 
variables, and use the model to make predictions. 122 WI2, 103, US R12.6 


Determine which of several transformations does a better job of 
producing a linear relationship. 122 719 R12.6 


Chapter 12 Chapter Review Exercises 


These exercises are designed to help you review the impor- skilled workers who do the casting? If there is a clear pat- 
tant ideas and methods of the chapter. tern, it can be used to direct new workers or to automate 


26 
Exercises R12.1 to R12.3 refer to the following setting. In UAE DIG IC IUS IIE ERME SEEN SIR An SO: 


the casting of metal parts, molten metal flows through a 
“gate” into a die that shapes the part. The gate velocity 
(the speed at which metal is forced through the gate) plays 
a critical role in die casting. A firm that casts cylindrical 
aluminum pistons examined a random sample of 12 


A least-squares regression analysis was performed on 
the data. Some computer output and a residual plot are 
shown below. A Normal probability plot of the residuals 
(not shown) is roughly linear. 


pistons formed from the same alloy of metal. What is the Predictor  Coef SE Coef T P 
relationship between the cylinder wall thickness (inches) Constant 70.44 52.90 i.33 O.2i2 
and the gate velocity (feet per second) chosen by the Thickness 274.78 88.18 CESS oe 
S = 56.3641 R-Sq = 49.3% R-Sq(adj) = 44.2% 
350 
ie a  __________ 
3S 300 e 
504 e 
i 250 a ” 
g 5 of = eS % 
= 200 ° ° 
3 : 
8 -50} 
S 150 
> e 
-100 | 
100 ° 
02 03 O04 O5 O06 O07 O08 0.9 0.2 03 O04 O05 06 O07 O08 09 
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R12.1 
(a) 


R12.2 


R12.3 


R124 
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Casting aluminum 


Describe what the scatterplot tells you about the 
relationship between cylinder wall thickness and 
gate velocity. 

What is the equation of the least-squares regression 
line? Define any variables you use. 

One of the cylinders in the sample had a wall 
thickness of 0.4 inches. The gate velocity cho- 

sen for this cylinder was 104.8 feet per second. 
Does the regression line in part (b) overpredict or 
underpredict the gate velocity for this cylinder? By 
how much? Show your work. 

Is a linear model appropriate in this setting? Justify 
your answer with appropriate evidence. 

Interpret each of the following in context: 

(i) The slope 

(ii) s 

(iii) 2 

(iv) The standard error of the slope 

Casting aluminum Do the data provide convincing 
evidence at the a = 0.05 level of a linear relationship 
between thickness and gate velocity in the population 
of pistons formed from this alloy of metal? 


Casting aluminum Construct and interpret a 
95% confidence interval for the slope of the popu- 
lation regression line. Explain how this interval is 
consistent with the results of Exercise R12.2. 


SAT essay —is longer better? Following the debut 
of the new SAT Writing test in March 2005, Dr. 
Les Perelman from the Massachusetts Institute of 
Technology recorded the number of words and 
score for each essay in a sample provided by the 
College Board. A least-squares regression analysis 


Fitted Line Plot 
Predicted Score = 1.173 + 0.01037 Words 
9 
8 
7 
6 © 
£5 
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Ss 
2 s 0.792095 
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R12.5 


Predictor 
Constant 


Distance* (—2) 


S=0.00248369 


(a) 
(b) 


(c) 


was performed on these data. The two graphs at bot- 
tom left display the results of that analysis. Explain 
why the conditions for performing inference are not 
met in this setting. 


Light intensity In a physics class, the intensity 
of a 100-watt lightbulb was measured by a sen- 
sor at various distances from the light source. A 
scatterplot of the data is shown below. Note that 
a candela is a unit of luminous intensity in the 
International System of Units. 
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Physics textbooks suggest that the relationship be- 
tween light intensity y and distance x should follow 
an “inverse square law,” that is, a power law model 


4 1 
of the form y = ax * = a-. We transformed the 
2 


distance measurements by squaring them and then 
taking their reciprocals. Some computer output 
and a residual plot from a least-squares regression 
analysis on the transformed data are shown below. 
Note that the horizontal axis on the residual plot 
displays predicted light intensity. 


Coef SE Coet T P 
=O 000595 ©, O@1E2I, —O.33 ©, 7/'Sal 


ORZS9624 70 003/237 92.516) 2 0F 10,010 


R-Sq=99.9%  R-Sq(adj) =99.9% 


Residual 
° 
[—J 
o 
7 
. 
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Predicted intensity (candelas) 


Did this transformation achieve linearity? Give 
appropriate evidence to justify your answer. 

What is the equation of the least-squares regression 
line? Define any variables you use. 

What would you predict for the intensity of a 
100-watt bulb at a distance of 2.1 meters? Show 
your work. 


R12.6 An experiment was conducted to determine the ef 


fect of practice time (in seconds) on the percent of 
unfamiliar words recalled. Here is a Fathom scat- 
terplot of the results with a least-squares regression 
line superimposed. 


g 
i 


10 15 20 
Time 
— Recall_pet = 40.4 + 1.73Time; 2 = 0.86 


Sketch a residual plot. Be sure to label your axes. 


Explain why a linear model is not appropriate for 
describing the relationship between practice time 
and percent of words recalled. 


We used Fathom to transform the data in hopes of 
achieving linearity. The screen shots on the right 
show the results of two different transformations. 
Would an exponential model or a power model 
describe the relationship better? Justify your 
answer. 
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| Scatter Plot P% 


intime 
— Inrecall = 3.48 + 0.293intime; r2 = 0.98 


Inrecall 


Time 
— inrecall = 3.69 + 0.0304Time; r? = 0.75 


(d) Use each model to predict the word recall for 


25 seconds of practice. Show your work. Which 
prediction do you think will be better? 


Chapter 12 AP® Statistics Practice Test 


Section I: Multiple Choice Select the best answer for each question. 


Tze 


(a) 


Which of the following is not one of the conditions 
that must be satisfied in order to perform inference 
about the slope of a least-squares regression line? 


For each value of x, the population of y-values is 
Normally distributed. 


(b) The standard deviation o of the population of y-values 


corresponding to a particular value of x is always the 
same, regardless of the specific value of x. 


(c) ‘The sample size—that is, the number of paired ob- 


servations (x, y)— exceeds 30. 


(d) There exists a straight line y = a + (x such that, for 


each value of x, the mean ju, of the corresponding 
population of y-values lies on that straight line. 


(e) ‘The data come from a random sample or a random- 


ized experiment. 


We 


Cheerios 


Students in a statistics class drew circles of varying 
diameters and counted how many Cheerios could be 
placed in the circle. The scatterplot shows the results. 


180 - 
160 
140 
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The students want to determine an appropriate equation 
for the relationship between diameter and the number 
of Cheerios. ‘The students decide to transform the data 
to make it appear more linear before computing a least- 
squares regression line. Which of the following transforma- 
tions would be reasonable for them to try? 
I. Plot the square root of the number of Cheerios 
against diameter. 
IL. Plot the cube of the number of Cheerios against 
diameter. 
ILL. Plot the log of the number of Cheerios against the 
log of the diameter. 
IV. Plot the number of Cheerios against the log of the 


diameter. 
(a) Land II (c) IL and III (e) Land IV 
(b) TandIIl  (d) Iland IV 


T12.3 Inference about the slope (3 of a least-squares 
regression line is based on which of the following 
distributions? 

(a) The ¢ distribution with n — | degrees of freedom 
(b) The standard Normal distribution 


(c) The chi-square distribution with n — | degrees of 
freedom 


(d) The ¢ distribution with n — 2 degrees of freedom 


(e) The Normal distribution with mean jz and standard 
deviation 7 


Exercises T12.4 through T12.8 refer to the following setting. 
An old saying in golf is “You drive for show and you putt for 
dough.” The point is that good putting is more important 
than long driving for shooting low scores and hence winning 
money. To see if this is the case, data from a random sample 
of 69 of the nearly 1000 players on the PGA Tour’s world 
money list are examined. ‘The average number of putts per 
hole and the player’s total winnings for the previous season 
are recorded. A least-squares regression line was fitted to the 
data. The following results were obtained from statistical 
software. 


Predictor Coef SE Coef T P 
Constant Pegs iTg S023 182 6.86 0.000 
Avg. Putts —4139198 ISOS} E) TAL KKK Ok RK 
S=281777 R-Sq=8.1% R-Sq(adj) =7.8% 


112.4 The correlation between total winnings and average 
number of putts per hole for these players is 


(a) —0.285. (c) —0.007. — (e) 0.285. 
(b) —0.081. (d) 0.081. 


112.5 Suppose that the researchers test the hypotheses 
Ho: 8 = 0, Hy: 3 < 0. The value of the t statistic for 
this test is 

(ai Z.61, (c) 0.081. 
(by 24 (dl) 2 4. 


T12.6 The P-value for the test in Question T'12.5 is 
0.0087. A correct interpretation of this result is that 


(e) —20.24. 


(a) the probability that there is no linear relationship 
between average number of putts per hole and total 
winnings for these 69 players is 0.0087. 

(b) the probability that there is no linear relationship 
between average number of putts per hole and to- 
tal winnings for all players on the PGA Tour's world 
money list is 0.0087. 

(c) ifthere is no linear relationship between average num- 
ber of putts per hole and total winnings for the play- 
ers in the sample, the probability of getting a random 
sample of 69 players that yields a least-squares regres- 
sion line with a slope of —4139198 or less is 0.0087. 

(d) if there is no linear relationship between average 
number of putts per hole and total winnings for the 
players on the PGA Tour’s world money list, the 
probability of getting a random sample of 69 players 
that yields a least-squares regression line with a slope 


of —4139198 or less is 0.0087. 


(e) the probability of making a Type I error is 0.0087. 
T12.7 A 95% confidence interval for the slope @ of the 
population regression line is 


(a) 7,897,179 + 3,023,782. 
(b) 7,397,179 + 6,047,564. 
) 
) 


Se 


(@) =, IBIS se 193.371. 
(Gl) 4, I2Q IGS 2 3s el0N7. 
(@) =D BOIS SE 3.390, 742. 


T12.8 A residual plot from the least-squares regression is 
shown below. Which of the following statements is 
supported by the graph? 


1,000,000 4 
00,000 4 
600,000 4 ° 
400,000 4 
200,000 4 

0 ee 
~200,000 4 ° oo Me 
~400,000 4 
—600,000 4 
800,000 4 

1,000,000 4 


Residuals 


(00 le /2 ieee SO ie le7i/Smeel SOO ale S25meelessl) 
Average Putts per Round 


(a) ‘The residual plot contains dramatic evidence that 
the standard deviation of the response about the 
population regression line increases as the average 
number of putts per round increases. 

(b) ‘The sum of the residuals is not 0. Obviously, there is 
a major error present. 

(c) Using the regression line to predict a player’s total 
winnings from his average number of putts almost 
always results in errors of less than $200,000. 

(d) For two players, the regression line underpredicts 
their total winnings by more than $800,000. 

(e) The residual plot reveals no correlation between av- 
erage putts per round and prediction errors from the 
least-squares line for these players. 


112.9 Which of the following would provide evidence 
that a power law model of the form y = ax’, where 
b # O and b # 1, describes the relationship between 
a response variable y and an explanatory variable x? 


(a) A scatterplot of y versus x looks approximately 
linear. 


(b) A scatterplot of In y versus x looks approximately 
linear. 

(c) A scatterplot of y versus In x looks approximately 
linear. 

(d) A scatterplot of In y versus In x looks approximately 
linear. 

(e) None of these 
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T12.10 We record data on the population of a particular 
country from 1960 to 2010. A scatterplot reveals 
a clear curved relationship between population 
and year. However, a different scatterplot reveals 
a strong linear relationship between the logarithm 
(base 10) of the population and the year. The least- 
squares regression line for the transformed data is 


—— SSS 
log (population) = —13.5 + 0.01 (year) 
Based on this equation, the population of the 
country in the year 2020 should be about 
(a) 6.7. (c) 5,000,000. (e) 8,120,000. 
(b) 812. (d) 6,700,000. 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


T12.11 Growth hormones are often used to increase the 
weight gain of chickens. In an experiment using 
15 chickens, 3 chickens were randomly assigned 
to each of 5 different doses of growth hormone (0, 
0.2, 0.4, 0.8, and 1.0 milligrams). The subsequent 
weight gain (in ounces) was recorded for each 
chicken. A researcher plots the data and finds that 
a linear relationship appears to hold. Computer 
output from a least-squares regression analysis for 
these data is shown below. Assume that the condi- 
tions for performing inference about the slope (3 of 
the true regression line are met. 


Predictor Coef SE Coef T P 


Constant 4.5459 0.6166 737 <O,.O@OsL 
Dose 4.8323 1.0164 4.75 0.0004 
S$=3.135 R-Sq = 38.4% R= Sei (exch) = 37.75 


(a) What is the equation of the least-squares regression 
line for these data? Define any variables you use. 

(b) Interpret each of the following in context: 

(i) The slope 

(ii) ‘The y intercept 

(iii) s 

(iv) ‘The standard error of the slope 

(v) 

(c) Do the data provide convincing evidence of a linear 
relationship between dose and weight gain? Carry 
outa significance test at the a = 0.05 level. 

(d) Construct and interpret a 95% confidence interval 
for the slope parameter. 


112.12 Foresters are interested in predicting the amount 
of usable lumber they can harvest from various tree 
species. They collect data on the diameter at breast 
height (DBH) in inches and the yield in board 
feet of a random sample of 20 Ponderosa pine trees 
that have been harvested. (Note that a board foot is 


defined as a piece of lumber 12 inches by 12 inches 
by | inch.) A scatterplot of the data is shown below. 


300. 

250: 

i= 
z 

3 
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a 
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(a) Some computer output and a residual plot from a 
least-squares regression on these data appear below. 
Explain why a linear model may not be appropriate 
in this case. 


Predictor Coef SE Coef T P 

Constant dal . ILS 6 QE =I AS 0. MO@ 
DBH (inches) AL O43 O.57/52 ALS} ALS) (0) , ONC) 
S=20.3290 R-Sq= 95.3% R-Sq(adj) =95.1% 


Residual 
= o 


-20 
-30 
-40 
20 25 30 35 40 
DBH (inches) 


The foresters are considering two possible trans- 
formations of the original data: (1) cubing the 
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diameter values or (2) taking the natural logarithm 
of the yield measurements. After transforming 

the data, a least-squares regression analysis is 
performed. Some computer output and a residual 
plot for each of the two possible regression models 
follow. 


Option 1: Cubing the diameter values 


Predictor Coef SE Coef aT P 
Constant 2,078 5.444 O.38 0.707 
DBH*3 OL0042597 0. 000TS49 27.50 0.000 
S— 14.3601 R-Sq = 97.7% R-Sq(adj) =97.5% 

= 

é 0 

3 


Option 2: Taking natural logarithm of yield measurements 
Predictor Coef SEC Oc imma iE 


1 ASLY) OLS 6.86 0.000 
0.113417 0.006081 18.65 0.000 


Constant 
DBH (inches) 
S=0.214894 


R-Sq=95.1% R-Sq(adj) =94.8% 


Residual 


3.0 3.5 4.0 4.5 5.0 5.5 6.0 


(b) Use both models to predict the amount of usable 
lumber from a Ponderosa pine with diameter 30 
inches. Show your work. 

(c) Which of the predictions in part (b) seems more 
reliable? Give appropriate evidence to support your 
choice. 
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Section I: Multiple Choice Choose the best answer. 


AP4.1 A major agricultural company is testing a new 
variety of wheat to determine whether it is more 
resistant to certain insects than is the current wheat 
variety. The proportion of a current wheat crop lost 
to insects is 4%. ‘Thus, the company wishes to test 
the following hypotheses: 


Ho:p = 0.04 
Ho = 
Which of the following significance levels and 


sample sizes would lead to the highest power for this 
test? 


= 200 anda = 0.01 


(a) n 
(b) n = 400 and a = 0.05 
(c) n = 400 and a = 0.01 


(d) n = 500 anda = 0.01 
(e) n = 500 and a = 0.05 


AP4.2 If P(A) = 0.24 and P(B) = 0.52 and events A and B 
are independent, what is P(A or B)? 


(a) 0.1248  (b) 0.28 


(c) 0.6352 (d) 0.76 


(e) The answer cannot be determined from the given 
information. 


AP4.3 As part of a bear population study, data were gathered 
on a sample of black bears in the western United 
States to examine the relationship between the 
bear’s neck girth (distance around the neck) and the 
weight of the bear. A scatterplot of the data reveals a 
straight-line pattern. The r”-value from a least-squares 
regression analysis was determined to be 1? = 0.963. 
Which one of the following is the correct value and 
corresponding interpretation for the correlation? 


The correlation is —0.963, and 96.3% of the varia- 
tion in a bear’s weight can be explained by the 
least-squares regression line using neck girth as the 
explanatory variable. 


— 
~p 
<= 


(b) The correlation is 0.963. There is a strong positive 
linear relationship between a bear’s neck girth and 
its weight. 

(c) The correlation is 0.981, and 98.1% of the variation 
in a bear’s weight can be explained by the least- 


squares regression line using neck girth as the ex- 
planatory variable. 


(d) The correlation is —0.981. There is a strong negative 
linear relationship between a bear’s neck girth and 
its weight. 

(e) The correlation is 0.981. There is a strong positive 
linear relationship between a bear’s neck girth and 
its weight. 

AP4.4 The school board in a certain school district obtained 
a random sample of 200 residents and asked if they 
were in favor of raising property taxes to fund the hir- 
ing of more statistics teachers. The resulting confi- 
dence interval for the true proportion of residents in 
favor of raising taxes was (0.183, 0.257). The margin 
of error for this confidence interval is 


(a) 0.037. (c) 0.220. 
(b) 0.183. (d) 0.257. 


AP4.5 After a name-brand drug has been sold for several 
years, the Food and Drug Administration (FDA) will 
allow other companies to produce a generic equiva- 
lent. The FDA will permit the generic drug to be 
sold as long as there isn’t convincing evidence that it 
is less effective than the name brand drug. For a pro- 
posed generic drug intended to lower blood pressure, 
the following hypotheses will be used: 


(e) 0.740. 


Ho: Hc = by versus Hg: fg < bn 
where 
fig = true mean reduction in blood pressure us- 
ing the generic drug 
Jin = true mean reduction in blood pressure us- 
ing the name-brand drug. 
In the context of this situation, which of the follow- 
ing describes a Type I error? 


(a) The FDA finds convincing evidence that the ge- 
neric drug is less effective, when in reality it is less 
effective. 


S 


The FDA finds convincing evidence that the generic 
drug is less effective, when in reality it is equally 
effective. 

(c) The FDA fails to find convincing evidence that the 
generic drug is less effective, when in reality it is less 
effective. 

(d) The FDA fails to find convincing evidence that the 
generic drug is less effective, when in reality it is 
equally effective. 

(e) The FDA finds convincing evidence that the generic 

drug is equally effective, when in reality it is less 

effective. 


AP4.6 Which of the following sampling plans for estimating 
the proportion of all adults in a medium-sized town 
who favor a tax increase to support the local school 
system does not suffer from undercoverage bias? 
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(a) Arandom sample of 250 names from the local phone 


book 


(b) A random sample of 200 parents whose children at- 
tend one of the local schools 


(c) Asample consisting of 500 people from the city who 
take an online survey about the issue 


(d) A random sample of 300 homeowners in the town 


(e) A random sample of 100 people from an alphabeti- 
cal list of all adults who live in the town 


AP4.7 Which of the following is a categorical variable? 
(a) The weight of automobiles 


(b) ‘The time required to complete the Olympic marathon 
(c) The average gas mileage of a hybrid car 
( 


d) The brand of shampoo purchased by shoppers in a 
grocery store 


(e) The average closing price of a particular stock on the 
New York Stock Exchange 


AP4.8 A large machine is filled with thousands of small 
pieces of candy, 40% of which are orange. When 
money is deposited, the machine dispenses 60 
randomly selected pieces of candy. The machine 
will be recalibrated if a group of 60 candies con- 
tains fewer than 18 that are orange. What is the 
approximate probability that this will happen if the 
machine is working correctly? 


03-04 0.3-0.4 
elas (0.4)(0.6) ee ~*~ 0406) 
60 V6 
03-04 04-03 
he es 1 | yee aE 
0) 2 < Tan | 45 voxo7 
60 60 
03-04 
2S OHO) 
60 


AP4.9 A random sample of 900 students at a very large 
university was asked which social-networking site 
they used most often during a typical week. Their 
responses are shown in the table below. 


Networking site Male Female Total 
Facebook 221 283 504 
Twitter 42 38 80 
LinkedIn 108 87 195 
Pinterest 23 26 49 
MySpace 29 43 72 
Total 423 477 900 
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Assuming that gender and preferred networking observe its color, and replace it in the set. This is 
site are independent, how many females do you done a total of four times. Let X be the number of 
expect to choose LinkedIn? red cards observed in these four trials. The random 
(a) 18.85 (c) 87.00 (e) 103.35 variable X has which of the following probability 
AP4.10 Insurance adjusters are always vigilant about being (a) The Normal distribution with mean 2 and standard 
overcharged for accident repairs. The adjusters deviation | 
suspect that Repair Shop | quotes higher estimates (b) The binomial distribution with n = 10 and p = 0.5 
than Repair Shop 2. To check their suspicion, (c) The binomial distribution with n = 5 and p = 0.5 
the adjusters randomly select 12 cars that were re- (@y auhs Waal a Wanton dine serene ts 
cently involved in an accident and then take each iene ; 
of the cars to both repair shops to obtain separate (e) The geometric distribution with p = 0.5 
ea Upelet of the cost to fix the vehicle. The esti- AP4.13 A study of road rage asked random samples of 
mates are given below in hundreds of dollars. 596 men and 523 women about their behavior 
while driving. Based on their answers, each 
Car: 1 2 3 4 5 6 respondent was assigned a road rage score ona 
Shop1: 21.2 252 390 113 15.0 1841 scale of 0 to 20. The respondents were chosen 
Shop2: 213 241 3968 115 137 176 by random digit dialing of telephone numbers. 
eT Are the conditions for two-sample t inference 
Car: 7 8 9 10 11 12 satisfied? 
Shopi: 25.3 23.2 124 426 27.6 12.9 (a) Maybe. The data came from independent random 
Shop 2: 248 21.3 121 420 267 125 samples, butwe need to examine the data to check for 
Normality. 
Assuming that the conditions for inference are (b) No. Road rage scores in a range between 0 and 20 
reasonably met, which of the following signifi- can’t be Normal. 
cance tests could legitimately be used to determine (c) No. A paired ¢ test should be used in this case. 
nee saint ane 5 
whether the adjusters’ suspicion is comecti (d) Yes. The large sample sizes guarantee that the 
(a) A paired t test corresponding population distributions will be 
(b) A two-sample t test Normal. 
(c) At test to see if the slope of the population regres- (e) Yes. We have two independent random samples 
sionthnens and large sample sizes, and the 10% condition is 


(d) A chi-square test for homogeneity met 


(e) A two-sample z test for comparing two proportions AP4.14 Do hummingbirds prefer store-bought food made 
from concentrate or a simple mixture of sugar 
and water? ‘To find out, a researcher obtains 

10 identical hummingbird feeders and fills 5, 
chosen at random, with store-bought food from 
concentrate and the other 5 with a mixture of 
sugar and water. The feeders are then randomly 
assigned to 10 possible hanging locations in the 
researcher’s yard. Which inference procedure 
should you use to test whether hummingbirds 
show a preference for store-bought food based on 


AP4.11 A survey firm wants to ask a random sample of 
adults in Ohio if they support an increase in the 
state sales tax from 5% to 6%, with the additional 
revenue going to education. Let p denote the 
proportion in the sample who say that they sup- 
port the increase. Suppose that 40% of all adults 
in Ohio support the increase. How large a sample 
would be needed to guarantee that the standard 
deviation of f is no more than 0.01? 


(a) 1500 (c) 2401 (e) 9220 amount consumed? 
(b) 2400 (d) 2500 (a) A one-sample z test for a proportion 
AP4.12 A set of 10 cards consists of 5 red cards and 5 (b) A two-sample z test for a difference in proportions 


black cards. The cards are shuffled thoroughly, ( 
and you choose one at random, observe its color, 

and replace it in the set. The cards are thoroughly (d 
reshuffled, and you again choose a card at random, (e) A paired t test 


) 
) 
c) Achi-square test for independence 
) Atwo-sample t test 

) 


AP4.15 A Harris Poll found that 54% of American adults 


don’t think that human beings developed from 
earlier species. The poll’s margin of error for 95% 
confidence was 3%. ‘This means that 


there is a 95% chance that the interval (51%, 57%) 
contains the true percent of American adults who 
do not think that human beings developed from 
earlier species. 


the poll used a method that provides an estimate 
within 3% of the truth about the population 95% 
of the time. 

if Harris takes another poll using the same method, 
the results of the second poll will lie between 51% 
and 57%. 


there is a 3% chance that the interval is correct. 


) the poll used a method that would result in an 


interval that contains 54% in 95% of all possible 
samples of the same size from this population. 


AP4.16 Two six-sided dice are rolled and the sum of the 


faces showing is recorded after each roll. Let 
X = the number of rolls required until a sum 
greater than 7 is obtained. If 100 trials are con- 
ducted, which of the following is most likely to 
be part of the probability distribution of X? 


(a) (b) 
Number of |= Number of Number of | Number of 

rolls X trials rolls X trials 

1 34 0 34 

2 20 1 20 

3 16 2 16 

4 10 | 10 

5 6 4 6 

6 6 B) 6 

i 3 6 3 

8 2 i 2 

9 1 8 1 

10 0 9 0 

in| 1 10 1 

12 0 11 0 

13 1 12 1 
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(c) (d) 
Number of = Number of Number of |= Number of 
rolls X trials rolls X trials 
i] 18 1 10 
Ds 23 2 9 
3 26 3 10 
4 15 4 a2 
5 9 5 ii 
6 6 6 13 
7 1 7 10 
8 0 8 i 
9 1 9 9 
10 0 10 10 
al) 0 11 2 
12 0 2 1 
13 i 
(e) 
Number of Number of Number of Number of 
rolls X trials rolls X trials 
1 2 8 17 
2 2 9 9 
3 5 10 4 
4 10 lu 2 
5 11 12 0 
6 ik 13 1 
if 22 
AP4.17 Women who are severely overweight suffer eco- 
nomic consequences, a study has shown. They 
have household incomes that are an average of 
$6710 lower. The findings are from an eight-year 
observational study of 10,039 randomly selected 
women who were 16 to 24 years old when the re- 
search began. Does this study give strong evidence 
that being severely overweight causes a woman to 
have a lower income? 
(a) Yes. The study included both women who were se- 
verely overweight and women who were not. 
(b) Yes. The subjects in the study were selected at 
random. 
(c) No. The study showed that there is no connection 
between income and being severely overweight. 
(d) No. The study suggests an association between in- 
come and being severely overweight, but we can’t 
draw a cause-and-effect conclusion. 
(e) There is not enough information to answer this 


question. 
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Questions AP 4.18 and 4.19 refer to the following situation. 
Could mud wrestling be the cause of a rash contracted by 
University of Washington students? ‘Two physicians at the 
University of Washington student health center wondered 
about this when one male and six female students com- 
plained of rashes after participating in a mud-wrestling 
event. Questionnaires were sent to a random sample of stu- 
dents who participated in the event. The results, by gender, 
are summarized in the following table. 


Men Women 
Developed rash 12 12 
No rash 38 Ww 


Some Minitab output for the previous table is given below. 
The output includes the observed counts, the expected 
counts, and the chi-square statistic. 


Expected counts are printed below observed 
counts 
MEN WOMEN Total 
Developed rash 12 12 24 
L622 WER aR) 
No rash 38 1, 50 
Byeirer As} 16322 
Total 50 24 74 
ChiSg=5.002 


AP4.18 The cell that contributes most to the chi-square 
statistic is 


(a) men who developed a rash. 

(b) men who did not develop a rash. 
(c) women who developed a rash. 

(d) women who did not develop a rash. 
(e) both (a) and (d). 


AP4.19 From the chi-square test performed in this study, 
we may conclude that 
(a) there is convincing evidence of an association be- 
tween the gender of an individual participating in 
the event and development of a rash. 
(b) mud wrestling causes a rash, especially for women. 
(c) there is absolutely no evidence of any relation be- 
tween the gender of an individual participating in 
the event and the subsequent development of a rash. 
(d) development of a rash is a real possibility if you partici- 
pate in mud wrestling, especially if you do so regularly. 
(e) the gender of the individual participating in the event 
and the development of a rash are independent. 
AP4.20 Random assignment is part of a well-designed 
comparative experiment because 
(a) it is more fair to the subjects. 
(b) it helps create roughly equivalent groups before 
treatments are imposed on the subjects. 


CHAPTER 12 MORE ABOUT REGRESSION 


(c) 


it allows researchers to generalize the results of 
their experiment to a larger population. 

it helps eliminate any possibility of bias in the 
experiment. 

it prevents the placebo effect from occurring. 

The following back-to-back stemplots compare 


the ages of players from two minor-league hockey 
teams (1|7 = 17 years). 


Team A Team B 
98777 1 788889 
44333221 2 00123444 
7766595 2 596679 
521 3 023 
86 q 59 


Which of the following cannot be justified from 
the plots? 


‘Team A has the same number of players in their 30s 
as does ‘Team B. 

The median age of both teams is the same. 

Both age distributions are skewed to the right. 

The age ranges of both teams are similar. 


There are no outliers by the 1.5IQR rule in either 
distribution. 


A distribution that represents the number of cars X 
parked in a randomly selected residential driveway 
on any night is given by 


Xt 0 1 2 3 4 
pi OT O2iHiHsCS 


Which of the following statements is correct? 

This is a legitimate probability distribution because 
each of the p;-values is between 0 and 1. 

This is a legitimate probability distribution because 
=x; is exactly 10. 


This is a legitimate probability distribution because 
each of the p;values is between 0 and | and the 
x; is exactly 10. 


This is not a legitimate probability distribution 
because > x; is not exactly 10. 
This is not a legitimate probability distribution 
because > p; is not exactly 1. 


Which sampling method was used in each of the 
following settings, in order from I to IV? 


. Astudent chooses for a survey the first 20 students 


to arrive at school. 


. Thename ofeach studentina school is written on 


a card, the cards are well mixed, and 10 names 
are drawn. 


Il. 


IV. 


(d) 


(e) 


AP4.26 


L 


A state agency randomly selects 50 people from 
each of the state’s senatorial districts. 


A city council randomly selects eight city blocks 
and then surveys all the voting-age residents of 


those blocks. 

Voluntary response, SRS, stratified, cluster 
Convenience, SRS, stratified, cluster 
Convenience, cluster, SRS, stratified 
Convenience, SRS, cluster, stratified 


Cluster, SRS, stratified, convenience 


Western lowland gorillas, whose main habitat is 
the central African continent, have a mean weight 
of 275 pounds with a standard deviation of 40 
pounds. Capuchin monkeys, whose main habitat 
is Brazil and a few other parts of Latin America, 
have a mean weight of 6 pounds with a standard 
deviation of 1.1 pounds. Both weight distributions 
are approximately Normally distributed. If a par- 
ticular western lowland gorilla is known to weigh 
345 pounds, approximately how much would a 
capuchin monkey have to weigh, in pounds, to 
have the same standardized weight as the lowland 
gorilla? 


4.08 (c) 7.93 
ET (d) 8.20 
There is not enough information to determine the 


weight of a capuchin monkey. 


Suppose that the mean weight of a certain type 
of pig is 280 pounds with a standard deviation of 
80 pounds. The weight distribution of pigs tends 
to be somewhat skewed to the right. A random 
sample of 100 pigs is taken. Which of the follow- 
ing statements about the sampling distribution of 
the sample mean weight x is true? 


It will be Normally distributed with a mean of 280 
pounds and a standard deviation of 80 pounds. 

It will be Normally distributed with a mean of 280 
pounds and a standard deviation of 8 pounds. 


It will be approximately Normally distributed with 
a mean of 280 pounds and a standard deviation of 
80 pounds. 


It will be approximately Normally distributed with 
a mean of 280 pounds and a standard deviation of 
8 pounds. 


There is not enough information to determine the 
mean and standard deviation of the sampling dis- 
tribution. 


Which of the following statements about the ¢ dis- 
tribution with degrees of freedom df is (are) true? 


It is symmetric. 
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If. It has more variability than the ¢ distribution with 
df + 1 degrees of freedom. 


III. As df increases, the ¢ distribution approaches the 
standard Normal distribution. 


(a) lonly (c) II only (e) I, Il, and Ill 
(b) II only (d) [and Ill 


AP4.27 A company has been running television commer- 
cials for a new children’s product on five differ- 
ent family programs during the evening hours in 
a large city over a one-month period. A random 
sample of families is taken, and they are asked to 
indicate which of the five programs they viewed 
most often and their rating of the advertised prod- 
uct. The results are summarized in the following 


table. 
Family program 
Product rating A B C D E 
Excellent 23 29 42 48 51 
Good 25 33 44 53 49 
Fair 31 29 25 16 10 
Poor 38 32 25 18 12 


The advertiser decided to use a chi-square test to 
see if there is a relationship between the family 
program viewed and the product’s rating. What 
would be the degrees of freedom for this test? 


(a) 3 (c) 12 (e) 19 
(b) 4 (d) 18 


Questions AP4.28 and AP4.29 refer to the following situ- 
ation. Park rangers are interested in estimating the weight 
of the bears that inhabit their state. The rangers have data 
on weight (in pounds) and neck girth (distance around the 
neck in inches) for 10 randomly selected bears. Some re- 
gression output for these data is shown below. 


450 4 
400 4 
350'5 
300 4 
250 4 
200 4 
150 + 
100 + 

50s 


Weight (pounds) 


T T T T T T T T 
15:0" 7S 20:0" 22:5) 95:0) 27:5) ~ 30108 3255) 


Neck girth (inches) 


Predictor Coef SE Coef T P 
Constant —241.70 SO Oui =—6..27 “02000 


Neck Girth ZO 2310) TA 9'5 M2 932 20% 0100 


S = 26.7565 R-Sq = 94.7% 
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AP4.28 Which of the following represents a 95% con- 
fidence interval for the true slope of the least- 
squares regression line relating the weight of a 
bear and its neck girth? 


(a) 20.2304 1.695  (d)_ 20.230 + 20.22 
(b) 20.230 + 3.83 (ce) 26.7565 + 3.83 
(c) 20.230 + 3.91 


AP4.29 A bear was recently captured whose neck girth was 
35 inches and whose weight was 466.35 pounds. 
If this bear were added to the data set given above, 
what would be the effect on the value of s? 


(a) It would decrease the value of s because the added 
point is an outlier. 


(b) It would decrease the value of s because the added 
point lies on the least-squares regression line. 


(c) It would increase the value of s because the added 
point is an outlier. 


(d) It would increase the value of s because the added 
point lies on the least-squares regression line. 


(e) It would have no effect on the value of s because the 
added point lies on the least-squares regression line. 


AP4.30 An experimenter wishes to test whether or not 
two types of fish food (a standard fish food and 
a new product) work equally well at producing 
fish of equal weight after a two-month feeding 
program. The experimenter has two identical 
fish tanks (1 and 2) to put fish in and is consider- 
ing how to assign 40 fish, each of which has a 
numbered tag, to the tanks. The best way to do 
this would be to 


(a) putall the odd-numbered fish in one tank, the even 
in the other, and give the standard food type to the 
odd-numbered ones. 


(b) obtain pairs of fish whose weights are roughly equal 
at the start of the experiment and randomly assign 
one to ‘Tank | and the other to ‘Tank 2, with the 
feed assigned at random to the tanks. 


(c) proceed as in part (b), but put the heavier of the 
pair into ‘Tank 2. 


(d) assign the fish completely at random to the two 
tanks and give the standard feed to Tank 1. 


(e) assign the fish to the tanks using any method that 
the researcher wants. The placebo effect doesn’t 
apply to fish. 


AP4.31 A city wants to conduct a poll of taxpayers to 
determine the level of support for constructing a 
new city-owned baseball stadium. Which of the 
following is the primary reason for using a large 
sample size in constructing a confidence interval 
to estimate the proportion of city taxpayers who 
would support such a project? 


(a) ‘To increase the confidence level 
(b) ‘To eliminate any confounding variables 
(c) ‘To reduce nonresponse bias 
(d) 

) 


d 


(e) To reduce undercoverage 


To increase the precision of the estimate 


AP4.32 A standard deck of playing cards contains 52 cards, 
of which 4 are aces and 13 are hearts. You are of 
fered a choice of the following two wagers: 


I. Draw one card at random from the deck. You 
win $10 if the card drawn is an ace. Otherwise, 
you lose $1. 


II. Draw one card at random from the deck. If the card 
drawn is a heart, you win $2. Otherwise, you lose $1. 
Which of the two wagers should you prefer? 

(a) Wager 1, because it has a higher expected value 

(b) Wager 2, because it has a higher expected value 

(c) Wager 1, because it has a higher probability of winning 

(d) Wager 2, because it has a higher probability of winning 

(e) Both wagers are equally favorable. 

AP4.33 Below are boxplots of SAT Critical Reading and 


Math scores for a randomly selected group of 
female juniors at a highly competitive suburban 


school. 
Se 
oo 
T T T T T 
400 500 600 700 800 


Scores 


Which of the following cannot be justified by the 


plots shown above? 


(a) The maximum Critical Reading score is higher 
than the maximum Math score. 

(b) Critical Reading scores are skewed to the right, 
whereas Math scores are somewhat skewed to the left. 

(c) The median Critical Reading score for females is 
slightly higher than the median Math score. 

(d) ‘There appear to be no outliers in the SAT’ score 
distributions. 

(e) The mean Critical Reading score and the mean 
Math score for females are about the same. 


AP4.34 A distribution of exam scores has mean 60 and 
standard deviation 18. If each score is doubled, 
and then 5 is subtracted from that result, what 
will be the mean and standard deviation, respec- 
tively, of the new scores? 


) mean = 115 and standard deviation = 31 
) mean = 115 and standard deviation = 36 
) mean = 120 and standard deviation = 6 
(d) mean = 120 and standard deviation = 31 
) mean = 120 and standard deviation = 36 
5 


In a clinical trial, 30 patients with a certain blood 
disease are randomly assigned to two groups. One 
group is then randomly assigned the currently 
marketed medicine, and the other group receives 
the experimental medicine. Each week, patients 
report to the clinic where blood tests are con- 
ducted. The lab technician is unaware of the kind 
of medicine the patient is taking, and the patient 
is also unaware of which medicine he or she has 
been given. This design can be described as 


(a) a double-blind, completely randomized experi- 
ment, with the currently marketed medicine and 
the experimental medicine as the two treatments. 

(b) asingle-blind, completely randomized experiment, 
with the currently marketed medicine and the ex- 
perimental medicine as the two treatments. 

(c) a double-blind, matched pairs design, with the cur- 
rently marketed medicine and the experimental 
medicine forming a pair. 

(d) a double-blind, block design that is not a matched 
pairs design, with the currently marketed medicine 
and the experimental medicine as the two blocks. 

(e) a double-blind, randomized observational study. 

AP4.36 A local investment club that meets monthly has 
200 members ranging in age from 27 to 81. Acu- 
mulative relative frequency graph is shown below. 
Approximately how many members of the club are 
more than 60 years of age? 


100 4 


80 4 


Cumulative relative frequency 


20 30 40 50 60 70 80 
Age of members (years) 
(a) 20 (c) 78 (e) 110 
(b) 44 (d) 90 


AP4.37 A manufacturer of electronic components is test- 
ing the durability of a newly designed integrated 
circuit to determine whether its life span is longer 
than that of the earlier model, which has a mean 
life span of 58 months. ‘The company takes a 
simple random sample of 120 integrated circuits 
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and simulates normal use until they stop work- 
ing. The null and alternative hypotheses used for 
the significance test are given by Ho: u = 58 and 
H,:p > 58. The P-value for the resulting one- 
sample t test is 0.035. Which of the following best 
describes what the P-value measures? 


(a) The probability that the new integrated circuit has 
the same life span as the current model is 0.035. 


(b) The probability that the test correctly rejects the null 
hypothesis in favor of the alternative hypothesis is 0.035. 


(c) The probability that a single new integrated circuit will 
not last as long as one of the earlier circuits is 0.035. 


(d) The probability of getting a sample statistic as far 
or farther from 58 if there really is no difference 
between the new and the old circuits is 0.035. 


(e) The probability of getting a sample mean for the 
new integrated circuit that is lower than the mean 
for the earlier model is 0.035. 


Questions AP4+.38 and AP4.39 refer to the following situa- 
tion. Do children’s fear levels change over time and, if so, in 
what ways? Little research has been done on the prevalence 
and persistence of fears in children. Several years ago, two 
researchers surveyed a randomly selected group of 94 third- 
and fourth-grade children, asking them to rate their level of 
fearfulness about a variety of situations. ‘Two years later, the 
children again completed the same survey. The researchers 
computed the overall fear rating for each child in both years 
and were interested in the relationship between these rat- 
ings. They then assumed that the true regression line was 


Mater rating = Olt B (initial rating) 


and that the assumptions for regression inference were satisfied. 
This model was fitted to the data using least-squares regression. 
The following results were obtained from statistical software. 


Predictor Coefficient St. Dev. 
Constant 0.877517 0.1184 
Initial Rating 0.397911 0.0676 


S$=0.2374 R-Sq= 0.274 


Here is a scatterplot of the later ratings versus the initial rat- 
ings and a plot of the residuals versus the initial ratings. 


e 
e 
2.1— oe 
e e e £ 
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3 ° ? Tone! Oo 
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i is e ee 6 * . je 
Qo isa i ° Ot -° 
e e . 
e Ae “ee 3 e 
e «> e . ‘e 
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Initial ratings 


AP4.38 Which of the following statements is supported by 
these plots? 


(a) There is no striking evidence that the assumptions 
for regression inference are violated. 


(b) The abundance of outliers and influential observa- 
tions in the plots means that the assumptions for 
regression are clearly violated. 


(c) These plots contain dramatic evidence that the 
standard deviation of the response about the true 
regression line is not approximately the same for 
each x-value. 


(d) These plots call into question the validity of the (a) 
assumption that the later ratings vary Normally 
about the least-squares line for each value of the (b) 
initial ratings. 

(e) A linear model isn’t appropriate here because the (c) 


residual plot shows no association. 


George’s initial fear rating was 0.2 higher than 
Jonny’s. What does the model predict about their 
final fear ratings? 

George’s will be about 0.96 higher than Jonny’s. 
George’s will be about 0.40 higher than Jonny’s. 
George’s will be about 0.20 higher than Jonny’s 
George’s will be about 0.08 higher than Jonny’s. 
George’s will be about the same as Jonny’s. 


AP4.40 The table below provides data on the political 


affiliation and opinion about the death penalty of 
850 randomly selected voters from a congressional 
district. 


Favor Oppose Total 
Republican 299 98 397 
Democrat TE 171 248 
Other 118 87 205 
Total 494 356 850 


Which of the following does not support the con- 
clusion that being a Republican and favoring the 
death penalty are not independent? 


299 |, 98 eae 
494" 356 850” 850 
299 , 397 (397)(494) 

494 7 850 i ao 
494, 299 

850° 397 


Section II: Free Response Show all your work. Indicate clearly the methods you use, because you will be graded on 
the correctness of your methods as well as on the accuracy and completeness of your results and explanations. 


AP4.41 The body’s natural electrical field helps wounds 
heal. If diabetes changes this field, it might explain 
why people with diabetes heal more slowly. A study 
of this idea compared randomly selected normal 
mice and randomly selected mice bred to spontane- 
ously develop diabetes. The investigators attached 
sensors to the right hip and front feet of the mice 
and measured the difference in electrical potential 
(in millivolts) between these locations. Graphs of 
the data for each group reveal no outliers or strong 
skewness. The following computer output provides 
numerical summaries of the data.”” 


AP4.42 


Variable N Mean StDev Minimum 

Diabetic mice 24 13.090 4.839 0510 

Normal mice 18 HO 02257 i275 9i'5, 4.950 

Ql Median Q3 Maximum (a) 
10'..03:8 12°26.5,0 OS'S 22600 

BrZ238 GeZi50 23 75 16.100 


The researchers want to know whether the dif- 
ference in mean electrical potentials between 
normal mice and mice with diabetes is statistically 
significant at the a = 0.05 level. Carry out a test 
and report your conclusion. 


Can physical activity in youth lead to mental 
sharpness in old age? A 2010 study investigating 
this question involved 9344 randomly selected, 
mostly white women over age 65 from four U.S. 
states. These women were asked about their levels 
of physical activity during their teenage years, 
thirties, fifties, and later years. ‘Those who reported 
being physically active as teens enjoyed the lowest 
level of cognitive decline—only 8.5% had cognitive 
impairment—compared with 16.7% of women who 
reported not being physically active at that time. 


State an appropriate pair of hypotheses that the 
researchers could use to test whether the propor- 
tion of women who suffered a cognitive decline was 


(b) 


(d) 


AP4.43 


(c) 


AP4.44 


significantly lower for women who were physically 
active in their youth than for women who were not 
physically active at that time. Be sure to define any 
parameters you use. 

Assuming the conditions for performing inference 
are met, what inference method would you use to 
test the hypotheses you identified in part (b)? Do 
not carry out the test. 

Suppose the test in part (b) shows that the propor- 
tion of women who suffered a cognitive decline was 
significantly lower for women who were physically 
active in their youth than for women who were not 
physically active at that time. Can we generalize 
the results of this study to all women aged 65 and 
older? Justify your answer. 

We cannot conclude that being physically active as 
a teen causes a lower level of cognitive decline for 
women over 65, due to possible confounding with 
other variables. Explain the concept of confound- 
ing and give an example of a potential confounding 
variable in this study. 


In a recent poll, randomly selected New York 
State residents at various fast-food restaurants were 
asked if they supported or opposed a “fat tax” on 
nondiet sugared soda. Thirty-one percent said 

that they were in favor of such a tax and 66% were 
opposed. But when asked if they would support 
such a tax if the money raised were used to fund 
health care given the high incidence of obesity in 
the United States, 48% said that they were in favor 
and 49% were opposed. 

In this situation, explain how bias may have been 
introduced based on the way the questions were 
worded and suggest a way that they could have 
been worded differently in order to avoid this bias. 
In this situation, explain how bias may have been 
introduced based on the way the sample was taken 
and suggest a way that the sample could have been 
obtained in order to avoid this bias. 

This poll was conducted only in New York State. 
Suppose the pollsters wanted to ensure that esti- 
mates for the proportion of people who would sup- 
port a tax on nondiet sugared soda were available 
for each state as well as an overall estimate for the 
nation as a whole. Identify a sampling method that 
would achieve this goal and briefly describe how 
the sample would be taken. 


Each morning, coffee is brewed in the school work- 
room by one of three faculty members, depending 
on who arrives first at work. Mr. Worcester arrives 
first 10% of the time, Dr. Currier arrives first 50% 
of the time, and Mr. Legacy arrives first on the re- 
maining mornings. The probability that the coffee 
is strong when brewed by Dr. Currier is 0.1, while 


AP4.45 
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the corresponding probabilities when it is brewed 
by Mr. Legacy and Mr. Worcester are 0.2 and 0.3, 
respectively. Mr. Worcester likes strong coffee! 


What is the probability that on a randomly selected 
morning the coffee will be strong? Show your work. 


Ifthe coffee is strong on a randomly selected morn- 
ing, what is the probability that it was brewed by 
Dr. Currier? Show your work. 


The following table gives data on the mean number 
of seeds produced in a year by several common tree 
species and the mean weight (in milligrams) of the 
seeds produced. ‘I'wo species appear twice because 
their seeds were counted in two locations. We 
might expect that trees with heavy seeds produce 
fewer of them, but what mathematical model best 
describes the relationship?” 


Tree species Seed count Seed weight (mg) 
Paper birch 27,239 0.6 
Yellow birch 12,158 1.6 
White spruce 7202 2.0 
Engelmann spruce 3671 3.3 
Red spruce 5051 3.4 
Tulip tree 13,509 9.1 
Ponderosa pine 2667 Sie 
White fir 5196 40.0 
Sugar maple 1751 48.0 
Sugar pine 1159 216 
American beech 463 247 
American beech 1892 247 
Black oak 93 1851 
Scarlet oak 525 1930 
Red oak 411 2475 
Red oak 253 2475 
Pignut hickory 40 3423 
White oak 184 3669 
Chestnut oak 107 4535 
(a) Based on the scatterplot below, is a linear model 


Seed weight (mg) 


appropriate to describe the relationship between 
seed count and seed weight? Explain. 


5000 4 
4000 5 
3000 4 
2000 5 


1000 4 
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Two alternative models based on transforming 
the original data are proposed to predict the seed 
weight from the seed count. Graphs and computer 
output from a least-squares regression analysis on 
the transformed data are shown below. 


Model A: 


In(weight) 
p 
L 


e e 
2-4 e 
afl e 
T T T T T T T 
0 5000 10,000 ~=—:15,000 =~. 20,000 ~=—- 25,000 ~—- 30,000 
Seed count 
37 
e 
24 3 
? 
1-7 
3 0 + 
2-1 
e eo 
27 
3-7 e ‘ 
=o T T T T T 
-4 -2 0 2 4 6 
Fitted value 
Predictor Coef SE Coef Tr P 
Constant 6.1394 O.5 726: TOS (0.000 
Seed —0.00033869 0.00007187 —4.71 0.000 
Count 
S=2.08100 R-Sq=56.6% R-Sq(adj) =54.1% 
Model B: 
is e 
85 Z e e 
e e 
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Fitted value 
Predictor Coef SE Coeft T P 
Constant £5491 1.081 14.55 0.000 
In(count) —1.5222 0.1470 =10.235 0.000 
S=1.16932 R-Sq= 86.3% R-Sq (adj) =85.5% 


(b) 


Which model, A or B, is more appropriate for pre- 
dicting seed weight from seed count? Justify your 
answer. 


Using the model you chose in part (b), predict the 
seed weight if the seed count is 3700. 


Interpret the value of r° for your model. 


Suppose a company manufactures plastic lids for 
disposable coffee cups. When the manufacturing 
process is working correctly, the diameters of the 
lids are approximately Normally distributed with 

a mean diameter of 4 inches and a standard devia- 
tion of 0.02 inches. ‘To make sure the machine is 
not producing lids that are too big or too small, 
each hour a random sample of 25 lids is selected 
and the sample mean is calculated. 

Describe the shape, center, and spread of the sam- 
pling distribution of the sample mean diameter, as- 
suming the machine is working properly. 


The company decides that it will shut down 

the machine if the sample mean diameter is less 
than 3.99 inches or greater than 4.01 inches, be- 
cause this indicates that some lids will be too small 
or too large for the cups. If the sample mean is less 
than 3.99 or greater than 4.01, all the lids from that 
hour are thrown away because the company does 
not want to sell bad products. 
Assuming that the machine is working properly, 
what is the probability that a random sample of 25 
lids will have a mean diameter less than 3.99 inch- 
es or greater than 4.01 inches? Show your work. 


Also, to look for any trends, each hour the 
company records the value of the sample mean on 
a chart, like the one at top right. 


Sample mean 
diameter (inches) 
PN 
So 
o 


Hour 3 


9) 
o 
Ne} 


One benefit of using this type of chart is that 
out-of-control production trends can be no- 
ticed before it is too late and lids have to be 
thrown away. For example, if the sample mean 
increased in 3 consecutive samples, this would 
suggest that something might be wrong with the 
machine. If this trend can be noticed before 
the sample mean gets larger than 4.01, then the 
machine can be fixed without having to throw 
away any lids. 


(c) Assuming that the manufacturing process is work- 


ing correctly, what is the probability that the sample 
mean diameter will be above the desired mean of 
4.00 but below the upper boundary of 4.01? Show 
your work. 


= 
oO 
wa 
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Assuming that the manufacturing process is work- 
ing correctly, what is the probability that in 5 con- 
secutive samples, + or 5 of the sample means will 
be above the desired mean of 4.00 but below the 
upper boundary of 4.01? Show your work. 


Which of the following results gives more convinc- 
ing evidence that the machine needs to be shut 
down? Explain. 


1. Getting a single sample mean below 3.99 or above 
4.01 


or 


2. Taking 5 consecutive samples and having at 
least + of the sample means be between 4.00 and 
401. 


Suggest a different rule (other than | and 2 stated 
in part (e)) for stopping the machine before it starts 
producing lids that have to be thrown away. Assum- 
ing that the machine is working properly, calculate 
the probability that the machine will be shut down 


when using your rule. 
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coin and also on the surface. 


Notes and Data Sources N/DS-5 


2. R. Vallone and A. 'T’versky, “The hot hand in basketball: on the 
misperception of random sequences,” Cognitive Psychology, 17 
(1985), pp. 295-314. 

3. Gur Yaari and Shmuel Eisenmann, The Hot (Invisible?) Hand: 
Can ‘Time Sequence Patterns of Success/Failure in Sports Be 
Modeled as Repeated Random Independent Trials? (2011) PLoS 
ONE 6(10): e24532. doi:10.1371/journal.pone.0024532 

4. The excerpts in Exercise 22 are from http://marilynvossavant. 
com/game-show-problem/. 

5. From the Web site of the Gallup Organization, www.gallup.com. 
Individual poll reports remain on this site for only a limited time. 

6. Information for the 1999-2000 academic year is from the 2003 
Statistical Abstract of the United States, Table 286. 

7. Data for 2006 from the Web site of Statistics Canada, www. 
statcan.ca. 

8. Gail Burrill, “Two-way tables: introducing probability using real 
data,” paper presented at the Mathematics Education into the Twenty- 
first Century Project, Czech Republic, September 2003. Burrill cites 
as her source H. Kranendonk, P. Hopfensperger, and R. Scheaffer, 
Exploring Probability, Dale Seymour Publications, 1999. 

9. From the EESEE story “Is It Tough to Crawl in March?” 

10. Pierre J. Meunier et al., “The effects of strontium ranelate on the 
risk of vertebral fracture in women with postmenopausal osteoporosis,” 
New England Journal of Medicine, 350 (2004), pp. 459-468. 

11. The table closely follows the grade distributions for these three 
schools at the University of New Hampshire in the fall of 2000, 
found in a selfstudy document at www.unh.edu/academic-affairs/ 
neasc/. The counts of grades mirror the proportions of UNH stu- 
dents in these schools. The table is simplified to refer to a university 
with only these three schools. 

12. Information about Internet users comes from sample surveys 
carried out by the Pew Internet and American Life Project, at www. 
pewinternet.org. 

13. Data on Roger Federer’s serve percentages from www. 
atpworldtour.com. 

14. Thanks to Michael Legacy for providing these data. 

15. This is one of several tests discussed in Bernard M. Branson, 
“Rapid HIV testing: 2005 update,” a presentation by the Centers for 
Disease Control and Prevention, at www.cde.gov. The Malawi 
clinic result is reported by Bernard M. Branson, “Point-of-care rapid 
tests for HIV antibody,” Journal of Laboratory Medicine, 27 (2003), 
pp. 288-295. 
16. ‘The National Longitudinal Study of Adolescent Health inter- 
viewed a stratified random sample of 27,000 adolescents, then rein- 
terviewed many of the subjects six years later, when most were aged 
19 to 25. These data are from the Wave III reinterviews in 2000 and 
2001, found at the Web site of the Carolina Population Center, 
www.cpe.unc.edu. 

17. Information about Internet users comes from sample surveys 
carried out by the Pew Internet and American Life Project, found 


online at www.pewinternet.org. The music-downloading data were 
collected in 2003. 

18. We got these data from the Energy Information Administration 
on their Web site at http://www.cia.gov/dnav/pet/pet_sum_mkt_ 
dcu_sct_m.htm. 

19. From the National Institutes of Health’s National Digestive 
Diseases Information Clearinghouse, found at http://digestive.niddk. 
nih.gov/. 

20. The probabilities given are realistic, according to the fundrais- 
ing firm SCM Associates, at semassoc.com. 


N/DS-6 Notes and Data Sources 


21. Probabilities from trials with 2897 people known to be free of 
HIV antibodies and 673 people known to be infected are reported 
in J. Richard George, “Alternative specimen sources: methods for 
confirming positives,” 1998 Conference on the Laboratory Science 
of HIV, found online at the Centers for Disease Control and 
Prevention, www.cdc.gov. 

22. Robert P. Dellavalle et al., “Going, going, gone: lost Internet 
references,” Science, 302 (2003), pp. 787-788. 

23. Margaret A. McDowell et al., “Anthropometric reference data 
for children and adults: U.S. population, 1999-2002,” National 
Center for Health Statistics, Advance Data from Vital and Health 
Statistics, No. 361 (2005), at www.cdc.gov/nchs. 

24. The General Social Survey exercises in this chapter present 
tables constructed using the search function at the GSS archive, 
sda.berkeley.edu/archive.htm. 

25. National population estimates for July 1, 2006, at the Census 
Bureau Web site www.census.gov. The table omits people who con- 
sider themselves to belong to more than one race. 

26. Data provided by Patricia Heithaus and the Department of 
Biology at Kenyon College. 

27. Thanks to Tim Brown, ‘The Lawrenceville School, for provid- 
ing the idea for this exercise. 


Chapter 6 

1. U.S. Supreme Court, Strauder v. West Virginia, 100 U.S. 303 
(1879) _ https://supreme.justia.com/cases/federal/us/100/303/case. 
html. 

2. U.S. Supreme Court, Berghuis v. Smith, Docket No. 08-1402 
http:/Avww.law.cornell.edu/supcet/html/08-1402.ZO0. html. 

3. In most applications, X takes a finite number of possible values. 
The same ideas, implemented with more advanced mathematics, 
apply to random variables with an infinite but still countable collec- 
tion of values. 

4. The Apgar score data came from National Center for Health 
Statistics, Monthly Vital Statistics Reports, Vol. 30, No. 1, 
Supplement, May 6, 1981. 

5. Information from www.ncsu.edu. 

6. The mean of a continuous random variable X with density 
function f(x) can be found by integration: 


bx = [xfx) dx 


This integral is a kind of weighted average, analogous to the 
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the integral 
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Chapter 1 
Introduction 


Answers to Check Your Understanding 

page 4: 1. The cars in the student parking lot. 2. He measured 
the car’s model (categorical), year (quantitative), color (categori- 
cal), number of cylinders (quantitative), gas mileage (quantitative), 
weight (quantitative), and whether it has a navigation system 
(categorical). 


Answers to Odd-Numbered Introduction Exercises 

1.1 Type of wood, type of water repellent, and paint color are cat- 
egorical. Paint thickness and weathering time are quantitative. 

1.3 (a) AP® Statistics students who completed a questionnaire on 
the first day of class. (b) Categorical: gender, handedness, and 
favorite type of music. Quantitative: height, homework time, and 
the total value of coins in a student’s pocket. (c) The individual is a 
female who is right-handed. She is 58 inches tall, spends 60 min- 
utes on homework, prefers Alternative music, and has 76 cents in 
her pocket. 

1.5 Student answers will vary. For example, quantitative variables 
could be graduation rate and student-faculty ratio, and categorical 
variables could be region of the country and type of institution 
(2-year college, 4-year college, university). 

1.7 b 


Section 1.1 


Answers to Check Your Understanding 

page 14: 1. Fly: 99/415 = 23.9%, Freeze time: 96/415 = 23.1%, 
Invisibility: 67/4415 = 16.1%, Superstrength: 43/415 = 10.4%, 
Telepathy: 110/415 = 26.5%. 2. A bar graph is shown below. It 
appears that telepathy, ability to fly, and ability to freeze time were 
the most popular choices, with about 25% of students choosing 
each one. Invisibility was the 4th most popular and superstrength 
was the least popular. 


Percent 
a 


Superpower preference 


page 18: 1. For the U.K. students: 54/200 = 27% said fly, 52/200 
= 26% said freeze time, 30/200 = 15% said invisibility, 20/200 = 
10% said superstrength, and 44/200 = 22% said telepathy. For the 
US. students: 45/215 = 20.9% said fly, 44/215 = 20.5% said freeze 
time, 37/215 = 17.2% said invisibility, 23/215 = 10.7% said super- 
strength, and 66/215 = 30.7% said telepathy. 2. A bar graph is 
shown in the next column. 3. There is an association between 
country of origin and superpower preference. Students in the U.K. 
are more likely to choose flying and freezing time, while students in 
the U.S. are more likely to choose invisibility or telepathy. 
Superstrength is about equally unpopular in both countries. 


U.K. 
Hus. 


Percent 
ees 
: 


Superpower preference 


Answers to Odd-Numbered Section 1.1 Exercises 
1.9 (a) 1% (b) A bar graph is given below. (c) Yes, because the 
numbers in the table refer to parts of a single whole. 


1.11 (a) A bar graph is given below. A pie chart would also be appro- 
priate because the numbers in the table refer to parts of a single 
whole. (b) Perhaps induced or C-section births are scheduled for 
weekdays so doctors don’t have to work as much on the weekend. 
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1.13 About 63% are Mexican and 9% are Puerto Rican. 

1.15 (a) The given percents represent fractions of different age 
groups, rather than parts of a single whole. (b) A bar graph is given 
below. 
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1.17 (a) The areas of the pictures should be proportional to the 
numbers of students they represent. (b) A bar graph is given below. 
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S-2 Solutions 


1.19 (a) 133 people; 36 buyers of coffee filters made of recycled 
paper. (b) 36.8% said “higher,” 24.1% said “the same,” and 39.1% 
said “lower.” Overall, 60.9% of the members of the sample think 
the quality is the same or higher. 

1.21 For buyers, 55.6% said higher, 19.4% said the same, and 25% 
said lower. For the nonbuyers, 29.9% said higher, 25.8% said the 
same, and 44.3% said lower. We see that buyers are much more 
likely to consider recycled filters higher in quality and much less 
likely to consider them lower in quality than nonbuyers. 

1.23 Americans are much more likely to choose white/pearl and 
red, while Europeans are much more likely to choose silver, black, 
or gray. Preferences for blue, beige/brown, green, and yellow/gold 
are about the same for both groups. 

1.25 A table and a side-by-side bar graph comparing the 
distributions of snowmobile use for environmental club members 
and nonmembers are shown below. There appears to be an associa- 
tion between environmental club membership and snowmobile 
use. The visitors who are members of an environmental club are 
much more likely to have never used a snowmobile and less likely 
to have rented or owned a snowmobile than visitors who are not in 
an environmental club. 


Not a member Member 


Never used 445/1221 = 36.4% 212/305 = 69.5% 
Snowmobile 497/1221 = 40.7% 77/305 = 25.2% 
renter 
Snowmobile 279/1221 = 22.9% 16/305 = 5.2% 
owner 
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used renter owner 
127 “d 
1.29 d 
1.31 b 
1.33 d 
1.35 Answers will vary. ‘Two possible tables are given below. 
10 40 30 20 
50 0 30 20 


Section 1.2 


Answers to Check Your Understanding 

page 29: 1. This distribution is skewed to the right and uni- 
modal. 2. ‘The midpoint of the 28 values is between | and 2. 
3. The number of siblings varies from 0 to 6. 4. There are two 
potential outliers at 5 and 6 siblings. 


page 32: 1. Both males and females have distributions that are 
skewed to the right, though the distribution for the males is more 
heavily skewed. ‘The midpoint for the males (9 pairs) is less than the 
midpoint for the females (26 pairs). ‘The number of shoes owned by 
females varies more (from 13 to 57) than for males (from 4 to 38). The 
male distribution has three likely outliers at 22, 35, and 38. The 
females do not have any likely outliers. 2. b 3. e 4. ¢ 

page 38: 1. One possible histogram is shown below. 2. The dis- 
tribution is roughly symmetric and bell-shaped. The typical IQ 
appears to be between 110 and 120 and the [Qs vary from 80 to 150. 
There do not appear to be any outliers. 


Frequency 


18 
16 
14 
12 
10 
8 
6 
4 
2 
0 


80 90 100 110 120 130 140 150 
1Q 


page 39: 1. This is a bar graph because field of study is a categori- 
cal variable. 2. No, because the variable is categorical and the cat- 
egories could be listed in any order on the horizontal axis. 


Answers to Odd-Numbered Section 1.2 Exercises 

1.37 (a) The graph is shown below. (b) The distribution is roughly 
symmetric with a midpoint of 6 hours. The hours of sleep vary from 
3 to 11. There do not appear to be any outliers. 
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Hours of sleep 


1.39 (a) This dot represents a game where the opposing team won 
by 1 goal. (b) All but 4 of the 25 values are positive, which indicates 
that the U.S. women’s soccer team had a very good season. They 
won 21/25 = 84% of their games. 

1.41 As coins get older, they are taken out of circulation and new 
coins are introduced, meaning that most coins will be from recent 
years with a few from previous years. 

1.43 Both distributions are roughly symmetric and have about the 
same amount of variability. The center of the internal distribution 
is greater than the center of the external distribution, indicating 
that external rewards do not promote creativity. Neither distribution 
appears to have outliers. 

1.45 (a) Otherwise, most of the data would appear on just a few 
stems, making it hard to identify the shape of the distribution. 
(b) Key: 12 | 1 means that 12.1% of that state’s residents are aged 
25 to 34. (c) The distribution of percent of residents aged 25—34 
is roughly symmetric with a possible outlier at 16.0%. ‘The center 
is around 13%. Other than the outlier at 16.0%, the values vary 
from 11.4% to 15.1%. 

1.47 (a) The stemplots are given in the next column. The stemplot 
with split stems makes it easier to see the shape of the distribution. 
(b) The distribution is slightly skewed to the right with a center near 
780 mm, and values that vary from around 600 mm to 960 mm. 
‘There do not appear to be any outliers. (c) In El Nifio years, there 
is typically less rain than in other years (18 of 23 years). 


Without splitting stems With splitting stems 
6 | 03557 6 | 03 
7 | 0124488999 6 | 557 
8 | 113667 7 | 01244 
9 | 06 7 | 88999 
8 | 113 
8 | 667 
Key: 6 | 3 = 630 mm of rain 9/0 
9/6 


1.49 (a) Most people will round their answers to the nearest 10 
minutes (or 30 or 60). The students who claimed 300 and 360 min- 
utes of studying on a typical weeknight may have been exaggerat- 
ing. (b) The stemplots suggest that women (claim to) study more 
than men. The center for women (about 175 minutes) is greater 
than the center for men (about 120 minutes). 


Women Men 
0/0 3 3 3 3 
9 6/0|/5 6 6 6 99 9 
222222 2 2/1/0 2 2 2 2 
8888888888755 5 5}1}/5 5 8 
444 0/2}0 0 3 4 4 
2 
Key: 213 = 230 minutes 3}0 
6/3 


1.51 (a) The distribution is slightly skewed to the left and uni- 
modal. (b) The center is between 0% and 2.5%. (c) The highest 
return was between 10% and 12.5%. Ignoring the low outliers, the 
lowest return was between —12.5% and —10%. (d) About 37% of 
these months (102 out of 273) had negative returns. 

1.53 (a) The histogram is given below. (b) The distribution of 
travel times is roughly symmetric. The center is near 23 minutes 
and the values vary from 15.5 to 30.9 minutes. There do not appear 
to be any outliers. 


Frequency 
ioe) 


14 16 18 20 22 24 26 28 30 32 
Travel time (minutes) 


1.55 The histogram is given below. The distribution of DRP 
scores is roughly symmetric with the center around 35. The 
DRP scores vary from 14 to 54. There do not appear to be any 
outliers. 


Frequency 
> 


12 18 24 30 36 42 48 54 60 
DRP scores 


Solutions S-3 


1.57 (a) The histogram is given below. The distribution of word 
lengths is skewed to the right and single-peaked. The center is 
around 4 letters, with words that vary from | to 15 letters. There do 
not appear to be any outliers. (b) There are more short words in 
Shakespeare’s plays and more very long words in Popular Science 
articles. 


20 


10 


Percent of words 


0 T T ae a ee | 
2 4 6 8 10 12 14 
Length of words (number of letters) 


1.59 The scale on the horizontal axis is very different from one 
graph to the other. 

1.61 A bar graph should be used because birth month is a categori- 
cal variable. A possible bar graph is given below. 
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1.63 (a) The percents for women sum to 100.1% due to round- 
ing errors. (b) Relative frequency histograms are shown below 
because there are considerably more men than women. (c) Both 
histograms are skewed to the right. The center of the women’s 
distribution of salaries is less than the men’s. The distributions 
of salaries are about equally variable, and the table shows that 
there are some outliers in each distribution who make between 


$65,000 and $70,000. 


Percent of men 


i T 
10 20 30 40 50 60 70 
Salary ($ thousands) 


Percent of women 


T T T 
10 20 30 40 SO 60 70 


Salary ($ thousands) 


1.65 The distribution of age is skewed to the right for both males 
and females, meaning that younger people outnumber older peo- 
ple. Among the younger Vietnamese, there are more males than 
females. After age 35, however, females seem to outnumber the 
males, making the center of the female distribution a little greater 
than the male distribution. Both distributions have about the same 
amount of variability and no outliers. 


S-4 Solutions 


1.67 (a) Amount of studying. We would expect some students to 
study very little, but most students to study a moderate amount. Any 
outliers would likely be high outliers, leading to a right-skewed distri- 
bution. (b) Right- versus left-handed. About 90% of the population is 
right-handed (represented by the bar at 0). (c) Gender. We would 
expect a more similar percentage of males and females than for the 
right-handed and left-handed students. (d) Heights. We expect many 
heights near the average and a few very short or very tall people. 

1.69 a 

ial ae 

1.73 d 

1.75 (a) Major League Baseball players who were on the roster on 
opening day of the 2012 season. (b) 6. Two variables are categorical 
(team, position) and the other 4 are quantitative (age, height, 
weight, and salary). 

1.77 (a) 71/858 = 8.3% were elite soccer players and 43/858 = 5.0% 
of the people had arthritis. (b) 10/71 = 14.1% of elite soccer players 
had arthritis and 10/43 = 23.3% of those with arthritis were elite soc- 
cer players. 


Section 1.3 


Answers to Check Your Understanding 

page 53: 1. Because the distribution is skewed to the right, we 
would expect the mean to be larger than the median. 2. Yes. ‘The 
mean is 31.25 minutes, which is greater than the median of 22.5 
minutes. 3. Because the distribution is skewed, the median would 
be a better measure of the center of the distribution. 

page 59: 1. The data in order are: 290, 301, 305, 307, 307, 310, 324, 
345. The 5-number summary is 290, 303, 307, 317, 345. 2. The IOR 
is 14 pounds. The range of the middle half of the data is 14 
pounds. 3. Any outliers occur below 303 — 1.5(14) = 282 or above 
317 + 1.5(14) = 338, so 345 pounds is an outlier. 4. ‘The boxplot is 
given below. 


I? it 
290 300 310 320 330 340 350 
Weight 


page 63: 1. The mean is 75. 2. The table is given below. 


Observation Deviation Squared deviation 
67 67-75 =-8 (-8) = 64 
72 72-75 =-3 (-3)2 = 
76 76-75 =1 v=1 
76 76-75 =1 ?=4 
84 84-75 =9 9 = 81 
Total 0 156 
156 


. . ¥ . 
3. ‘The variance is sy = = 39 inches squared and the stan- 


5-1 
dard deviation is s, = 39 = 6.24 inches. 


4. The players’ heights typically vary by about 6.24 inches from the 
mean height of 75 inches. 


Answers to Odd-Numbered Section 1.3 Exercises 
1.79 x= 85 


1.81 (a) median = 85 (b) x = 79.33 and median = 84. The 
median did not change much but the mean did, showing that the 
median is more resistant to outliers than the mean. 

1.83 The mean is $60,954 and the median is $48,097. The distri- 
bution of salaries is likely to be quite right skewed because of a few 
people who have a very large income, making the mean larger than 
the median. 

1.85 The team’s annual payroll is 1.2(25) = 30 or $30 million. No, 
because the median only describes the middle value in the distribu- 
tion. It doesn’t provide specific information about any of the other 
values. 

1.87 (a) Estimating the frequencies of the bars (from left to right) 
as 10, 40, 42, 58, 105, 60, 58, 38, 27, 18, 20, 10, 5, 5, 1, and 3, the 


3504 


mean is x = = 7.01. The median is the average of the 250th 


and 251st values, which is 6. (b) Because the median is less than the 
mean, we would use the median to argue that shorter domain 
names are more popular. 


1.89 (a) IOR = 91 — 78 = 13. The middle 50% of the data have 
a range of 13 points. (b) Any outliers are below 78 — 1.5(13) = 58.5 
or above 91 + 1.5(13) = 110.5. There are no outliers. 

1.91 (a) Outliers are anything below 3 — 1.5(40) = —57 or above 
43 + 1.5(40) = 103, so 118 is an outlier. The boxplot is shown 
below. (b) The article claims that teens send 1742 texts a month, 
which is about 58 texts a day. Nearly all of the members of the class 
(21 of 25) sent fewer than 58 texts per day, which seems to contra- 
dict the claim in the article. 

120 * 

100 _ 


Number of texts 


1.93 (a) Positive numbers indicate students who had more text 
messages than calls. Because the Ist quartile is about 0, roughly 
75% of the students had more texts than calls, which supports the 
article’s conclusion. (b) No. Students in statistics classes tend to be 
upperclassmen and their responses might differ from those of 
underclassmen. 

1.95 (a) About 3% and —3.5%. (b) About 0.1%. (c) The stock fund 
is much more variable. It has higher positive returns, but also higher 
negative returns. 


2.06 
1.97 (a) 5. =,4 j=" = 0.6419 mg/dl. (b) The phosphate level 
typically varies from the mean by about 0.6419 mg/dl. 


1.99 (a) Skewed to the right, because the mean is much larger than 
the median and Q; is much further from the median than Q). 
(b) The amount of money spent typically varies from the mean by 
$21.70. (c) Any points below 19.06 — 1.5(26.66) = —20.93 or 
above 45.72 + 1.5(26.66) = 85.71 are outliers. Because the maxi- 
mum of 93.34 is greater than 85.71, there is at least one outlier. 
1.101 Yes. For example, in data set 1, 2, 3, 4,5, 6, 7, 8 the IOR is 4. 
If 8 is changed to 88, the IOR will still be 4. 

1.103 (a) One possible answer is 1, 1, 1, 1. (b) 0, 0, 10, 10. (c) For 
part (a), any set of four identical numbers will have s, = 0. For part 
(b), however, there is only one possible answer. We want the values 
to be as far from the mean as possible, so our best choice is two 
values at each extreme. 


1.105 State: Do the data indicate that men and women differ in 
their study habits and attitudes toward learning? Plan: We will draw 
side-by-side boxplots of the data about men and women; compute 
summary statistics; and compare the shape, center, and spread of 
both distributions. Do: The boxplots are given below, as is a table of 
summary statistics. 


Female t 1 * 
Male 
T T T T T T T T 
60 80 100 120 140 160 180 200 220 
SSHA score 
Variable WN Mean StDev Minimum lon Median Q3 Maximum 
Women 18 141.06 26.44 101.00 126.00 138.50 154.00 200.00 
Men 20 121.25 32.85 70.00 98.00 114.50 143.00 187.00 


Both distributions are slightly skewed to the right. Both the mean 
and median are higher for women than for men. The scores for 
men are more variable than the scores for women. There are no 
outliers in the male distribution and a single outlier at 200 in the fe- 
male distribution. Conclude: Men and women differ in their study 
habits and attitudes toward learning. ‘The typical score for females 
is about 24 greater than the typical score for males. Female scores 
are also more consistent than male scores. 


1.107 d 

1.109 e 

1.111 A histogram is given below. This distribution is roughly 
symmetric with a center around 170 cm and values that vary from 
145.5 cm to 191 cm. There do not appear to be any outliers. 


rere 


CNBR DOCH 


Frequency 


T T T T 
50 160 170 180 190 
Heights (cm) 


1.113 Women appear to be more 


ikely to engage in behaviors that 


Solutions $-5 


R1.3 (a) The “bars” are different widths. For example, the bar for 
“send/receive text messages” should be roughly twice the size of the 
bar for “camera” when it is actually about 4 times as large. (b) No, 
because they do not describe parts of a whole. Students were free to 
answer in more than one category. (c) A bar graph is given below. 
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R14 (a) 148/219 = 67.6%. Marginal distribution, because it is part 
of the distribution of one variable for all categories of the other vari- 
able. (b) 78/82 = 95.1% of the younger students were Facebook 
users. 78/148 = 52.7% of the Facebook users were younger. 

R1.5 There does appear to be an association between age and 
Facebook status. From both the table and the graph given below, 
we can see that as age increases, the percent of Facebook users 
decreases. For younger students, about 95% are members. ‘That 
drops to 70% for middle students and drops even further to 31.3% 
for older students. 


Facebook user? 


are indicative of good “habits of mind.” They are especially more 
likely to revise papers to improve their writing. The difference is a 
little smaller for seeking feedback on their work, although the 
percentage is still higher for females. 


Answers to Chapter 1 Review Exercises 


R1.1 (a) Movies. (b) Quantitative: Year, time, box office sales. 
Categorical: Rating, genre. Note: Year might be considered categor- 
ical if we want to know how many of these movies were made each 
year rather than the average year. (c) This movie is Avatar, released 
in 2009. It was rated PG-13, runs 162 minutes, is an action film, and 
had box office sales of $2,781,505,847. 


R1.2 A bar chart is given below. 
60 


Percent 
w 
i=) 


Age Yes No 
Younger (18-22) 95.1% 4.9% 
Middle (23-27) 70.0% 30.0% 
Older (28 and up) 31.3% 68.7% 
100 
80 
a=] 
3 60 
2 40 
20 
0 TT ra TT 
Facebook Yes No Yes No Yes No 
Age Younger Middle Older 


R1.6 (a) A stemplot is given below. (b) The distribution is roughly 
symmetric with one possible outlier at 4.88. The center of the dis- 
tribution is between 5.4 and 5.5. The densities vary from 4.88 to 
5.85. (c) Because the distribution is roughly symmetric, we can use 
the mean to estimate the Earth’s density to be about 5.45 times the 
density of water. 


48|8 

49 

50/7 

51]0 

5216799 

53 | 04469 Key: 48 | 8 = 4.88 
54 | 2467 

55 | 03578 

56 | 12358 

57] 59 


5815 


S-6 Solutions 


R1.7 (a) A histogram is given below. The survival times are right- 
skewed, as expected. ‘The median survival time is 102.5 days and 
the range of survival times is 598 — 43 = 555 days. There are sev- 
eral high outliers with survival times above 500. 


Frequency 


T t T T —_ T 
120 240 360 480 600 
Survival time (days) 


(b) The boxplot is given below. 


Survival time (days) 
3 


(c) Use the median and IOR to summarize the distribution 
because the outliers will have a big effect on the mean and standard 
deviation. 

R1.8 (a) About 20% of low-income and 33% of high-income 
households. (b) The shapes of both distributions are skewed to 
the right; however, the skewness is much stronger in the distribu- 
tion for low-income households. On average, household size is 
larger for high-income households. One-person households 
might have less income because they would include many young 
single people who have no job or retired single people with a 
fixed income. 

R1.9 (a) The amount of mercury per can of tuna will typically 
vary from the mean by about 0.3 ppm. (b) Any point below 0.071 
— 1.5(0.309) = —0.393 or above 0.38 + 1.5(0.309) = 0.8435 
would be considered an outlier. There are no low outliers, but 
there are several high outliers. (c) The distribution of the amount 
of mercury in cans of tuna is highly skewed to the right. The 
median is 0.18 ppm and the JOR is 0.309 ppm. 

R1.10 The distribution for light tuna is skewed to the right with 
several high outliers, while the distribution for albacore tuna is 
more symmetric with just a couple of high outliers. Because it has 
a greater center, the albacore tuna generally has more mercury. 
However, the light tuna has a much bigger spread of values, with 
some cans having as much as twice the amount of mercury as the 
largest amount in the albacore tuna. 


Answers to Chapter 1 AP® Statistics Practice Test 


Tl. d 
T1.2 e 
T1.3 b 
T14 b 
T1l5c¢ 
if By sia 
TL.7 b 
T1.8 c 


T1.9 e 

T1.10 b 

T1.11 d 

T1.12 (a) A histogram is given below. (b) Any point below 
30 — 1.5(47) = —40.5 or above 77 + 1.5(47) = 147.5 is an outlier. 
So 151 minutes is an outlier. (ce) Median and IOR, because the 
distribution is skewed and has a high outlier. 


10 


Frequency 


oN BDO 


0 40 80 120 160 
Time on Internet (minutes) 


T1.13 (a) Row totals are 1154, 53, and 1207. Column totals are 
785, 375, 47, and 1207. (b) Nondiabetic: 96.1% none and 3.9% 
one or more. Prediabetic: 96.5% none and 3.5% one or more. 
Diabetic: 80.9% none and 19.1% one or more. (c) The graph is 
given below. 
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(d) Yes. Nondiabetics and prediabetics appear to have babies with 
birth defects at about the same rate. However, those with diabetes 
have a much higher rate of babies with birth defects. 

T1.14 (a) Between 550 and 559 hours. (b) Because it has a higher 
minimum lifetime or because its lifetimes are more consistent (less 
variable). (c) Because it has a higher median lifetime. 

T1.15 Side-by-side boxplots and descriptive statistics for both leagues 
are given below. Both distributions are roughly symmetric, although 
there are two low outliers in the NL. The data suggest that the num- 
ber of home runs is somewhat less in the NL. All 5 numbers in the 
5-number summary are less for the NL teams than for the AL teams. 
However, there is more variability among the AL teams. 


American 
League 


National 


League am re 


20 30 40 50 60 70 80 
Home runs 


Variable N Mean StDev Minimum Q, Median Q,; Maximum 


American14 56.93 12.69 35.00 49.00 57.50 68.00 77.00 
League 
National 14 50.14 11.13 29.00 
League 


46.00 50.50 55.00 67.00 


Chapter 2 
Section 2.1 


Answers to Check Your Understanding 

page 89: 1. c 2. Her daughter weighs more than 87% of girls her 
age and she is taller than 67% of girls her age. 3. About 65% of calls 
lasted less than 30 minutes, which means that about 35% of calls 
lasted 30 minutes or longer. 4. Q; = 13 minutes, Q; = 32 minutes, 
and JOR = 19 minutes. 

page 91: 1. z = —0.466. Lynette’s height is 0.466 standard devia- 
tions below the mean height of the class. 2. z = 1.63. Brent’s 
height is 1.63 standard deviations above the mean height of the 


4 — 
class. 3. —0.85 = i 2 


, so 0 = 2,35 inches. 


page 97: 1. Shape will not change. However, it will multiply the 
center (mean, median) and spread (range, JOR, standard 
deviation) by 2.54. 2. Shape and spread will not change. It will, 
however, add 6 inches to the center (mean, median). 3. Shape 
will not change. However, it will change the mean to 0 and the 
standard deviation to 1. 


Answers to Odd-Numbered Section 2.1 Exercises 

2.1 (a) She is at the 25th percentile, meaning that 25% of the girls 
had fewer pairs of shoes than she did. (b) He is at the 85th percen- 
tile, meaning that 85% of the boys had fewer pairs of shoes than he 
did. (c) The boy is more unusual because only 15% of the boys have 
as many or more than he has. ‘The girl has a value that is closer to 
the center of the distribution. 

2.3 A percentile only describes the relative location of a value in a 
distribution. Scoring at the 60th percentile means that Josh’s score 
is better than 60% of the students taking this test. His correct per- 
centage could be greater than 60% or less than 60%, depending on 
the difficulty of the test. 

2.5 The girl weighs more than 48% of girls her age, but is taller 
than 78% of the girls her age. 

2.7 (a) The student sent about 205 text messages in the 2-day period 
and sent more texts than about 78% of the students in the sample. 
(b) Locate 50% on the y-axis, read over to the points, and then go 
down to the x-axis. The median is approximately 115 text messages. 
2.9 (a) IQR ~ $46 — $19 = $27 (b) About the 26th percentile. 
(c) The histogram is below. 


Percent 
a 


Amount spent ($) 


2.11 Eleanor. Her standardized score (z = 1.8) is higher than 
Gerald’s (z = 1.5). 

2.13 (a) Your bone density is far below average —about 1.5 times 
farther below average than a typical below-average density. 


948 — 956 
: gives o = 5.52 g/cm’. 


(b) Solving —1.45 = 


Solutions S-7 


2.15 (a) He is at the 76th percentile, meaning his salary is higher 
than 76% of his teammates. (b) z = 0.79. Lidge’s salary was 0.79 
standard deviations above the mean salary. 

2.17 Multiply each score by 4 and add 27. 

2.19 (a) mean = 87.188 inches and median = 87.5 inches. 
(b) The standard deviation (3.20 inches) and IOR (3.25 inches) do 
not change because adding a constant to each value in a distribu- 
tion does not change the spread. 

2.21 (a) mean = 5.77 feet and median = 5.79 feet. (b) Standard 
deviation = 0.267 feet and IOR = 0.271 feet. 


2.23 Mean = = (25) + 32 =77°F and standard deviation = 
= (2) = 3.6°F. 


L295 C 

2.21 ¢ 

2.29 c 

2.31 The distribution is skewed to the right with a center around 
20 minutes and the range close to 90 minutes. The two largest 
values appear to be outliers. 


Section 2.2 


Answers to Check Your Understanding 

page 107: 1. It is legitimate because it is positive everywhere and 
it has total area under the curve = 1. 2. 12% 3. Point A in the 
graph below is the approximate median. About half of the area is to 
the left of A and half of the area is to the right of A. 4. Point B in 
the graph below is the approximate mean (balance point). The 
mean is less than the median in this case because the distribution is 
skewed to the left. 


Total area under 
curve = 1 


Area = 0.12 


page 112: 1. The graph is given below. 2. Approximately 


Ww = 16%. 3. Approximately “a = 16% 
have heights below 62 inches and approximately 
100% 8 0.15% of young women have heights above 


72 inches, so the remaining 83.85% have heights between 62 and 
72 inches. 


T T T T T 
57.0 59.5 62.0 64.5 67.0 69.5 72.0 


Heights 


page 116: (All graphs are shown on the following page.) 1. The 
proportion is 0.9177. 2. The proportion is 0.9842. 3. The propor- 
tion is 0.9649 — 0.2877 = 0.6772. 4. The z-score for the 20th per- 
centile is ¢ = —0.84. 5. 45% of the observations are greater than 
z= 0.13. 


S-8 Solutions 

7% y 

0 1.39 -2.15 0 

z z 
MN A\ 

T T T T 

-0.56 0 1.81 0 
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z 
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T 
0 
z: 


page 121: 1. For 14-year-old boys, the amount of cholesterol fol- 
lows a N(170, 30) distribution and we want to find the percent of 
boys with cholesterol of more than 240 (see graph below). 
240 — 170 : 

= —~— = 2.33. From Table A, the proportion of z-scores 
above 2.33 is 1 — 0.9901 = 0.0099. Using technology: normalcdf 
(lower: 240,upper:1000,:170,0:30) = 0.0098. About 
1% of 14-year-old boys have cholesterol above 240 mg/dl. 2. For 
14-year-old boys, the amount of cholesterol follows a N(170, 30) 
distribution and we want to find the percent of boys 
with cholesterol between 200 and 240 (see graph below). 
_ 200 — 170 _ 240 — 170 


Zz 30 =landz 


proportion of z-scores between | and 2.33 is 0.9901 — 0.8413 = 
0.1488. Using technology: normalcdf (lower:200,upper: 
240,p:170,0:30) = 0.1488. About 15% of 14-year-old boys 
have cholesterol between 200 and 240 mg/dl. 3. For Tiger Woods, 
the distance his drives travel follows an N (304, 8) distribution and 
the 80th percentile is the boundary value x with 80% of the 
distribution to its left (see graph below). A z-score of 0.84 gives the 


— 304 
area closest to 0.80 (0.7995). Solving 0.84 = ae 


310.7. Using technology: invNorm(area:0.8,11:304,0:8) 
= 310.7. The 80th percentile of Tiger Woods’s drive lengths is 
about 310.7 yards. 


= 2.33. From Table A, the 


gives x = 


T T T 
170 200 240 


Cholesterol levels 


T i 
170 240 
Cholesterol levels 


280 «QRS 20% 304312 320 328 
Distance (yards) 


Answers to Odd-Numbered Section 2.2 Exercises 
2.33 Sketches will vary, but here is one example: 


LY“~ 


2.35 (a) It is on or above the horizontal axis everywhere, and the 
1 l 1 
X3=1. xXl= 

3 ; (b) 3 3 
1.1 — 0.8 = 0.3, the proportion is 3 xX 0.3 = 0.1. 
2.37 Both are 1.5. 


2.39 (a) Mean is C, median is B. (b) Mean is B, median is B. 
2.41 The graph is shown below. 


area beneath the curve is . (c) Because 


T T T T T 
61.5 64.0 66.5 69.0 71.5 74.0 76.5 
Men’s height (inches) 


2.43 (a) Between 69 — 2(2.5) = 64 and 69 + 2(2.5) = 74 inches. 
Ca ‘oO ‘Oo 6 ‘Oo 
(b) About “a = 2.5%. (c) About = = 16% of 


100% — 95% 


men are shorter than 66.5 inches and = 2.5% are 


shorter than 64 inches, so approximately 16% — 2.5% = 13.5% of 

men have heights between 64 inches and 66.5 inches. (d) Because 

ee = 16% of the area is to the right of 71.5, 71.5 is at the 

84th percentile. 

2.45 Taller curve: standard deviation ~ 0.2. Shorter curve: stan- 

dard deviation ~ 0.5. 

2.47 (a) 0.9978. (b) 1 — 0.9978 = 0.0022 (c) 1 — 0.0485 = 0.9515 

(d) 0.9978 — 0.0485 = 0.9493 

2.49 (a) 0.9505 — 0.0918 = 0.8587 (b) 0.9633 — 0.6915 = 0.2718 

2.51 (a)z = —1.28 (b) z= 0.41 

2.53 (a) The length of pregnancies follows a N(266, 16) distribu- 

tion and we want the proportion of pregnancies that last less than 
240 — 266 

240 days (see graph below). z = ee 1.63. From ‘Table 

A, the proportion of z-scores less than — 1.63 is 0.0516. Using tech- 

nology: normalcdf (lower:-1000,upper:240,p:266, 

o:16) = 0.0521. About 5% of pregnancies last less than 240 days, 

so 240 days is at the 5th percentile of pregnancy lengths. 


240 206 
Length of pregnancy (days) 
(b) The length of pregnancies follows a N(266, 16) distribution and 
we want the proportion of pregnancies that last between 240 


240 — 266 

——— = -1.63 
16 

and z= = 0.25. From ‘Table A, the proportion of 


z-scores between —1.63 and 0.25 is 0.5987 — 0.0516 = 0.5471. 
Using technology: normalcdf (lower:240,upper:270, 


and 270 days (see the following graph). z = 
270 — 266 


p:266,0:16) = 0.5466. About 55% of pregnancies last between 
240 and 270 days. 


240 266 270 
Length of pregnancy (days) 
(c) The length of pregnancies follows a N(266, 16) distribution 
and we are looking for the boundary value x that has an area of 0.20 
to the right and 0.80 to the left (see graph below). A z-score of 0.84 
x — 266 
16 

x = 279.44. Using technology: invNorm(area:0.8,:266, 
0:16) = 279.47. The longest 20% of pregnancies last longer than 
279.47 days. 


gives the area closest to 0.80 (0.7995). Solving 0.84 = 


gives 


266 
Length of pregnancy (days) 
2.55 (a) For large lids, the diameter follows a N(3.98, 0.02) 
distribution and we want to find the percent of lids that have diam- 
3.95 — 3.9 

eters less than 3.95 (see graph below). z= oe 1.5. 
From ‘Table A, the proportion of z-scores below —1.5 is 0.0668. 
Using technology: normalcdf (lower:-1000,upper:3.95, 
p:3.98,0:0.02) = 0.0668. About 7% of the large lids are too 
small to fit. 


3.95 3.98 
Lid width (inches) 


(b) For large lids, the diameter follows a N(3.98, 0.02) distribution 
and we want to find the percent of lids that have diameters greater 


4.05 — 3.98 = 3.5. From Table A, 


the proportion of z-scores above 3.50 is approximately 0. Using tech- 
nology: normalcdf (lower:4.05,upper:1000,»1:3.98, 
6:0.02) = 0.0002. Approximately 0% of the large lids are too 
big to fit. 


than 4.05 (see graph below). z = 


T T 
3.98 4.05 
Lid width (inches) 


(c) Make a larger proportion of lids too small. If lids are too small, 
customers will just try another lid. But if lids are too large, the cus- 
tomer may not notice and then spill the drink. 


Solutions S-9 


2.57 (a) For large lids, the diameter follows a N(p, 0.02) 
distribution and we want to find the value of yz that will result in 
only 1% of lids that are too small to fit (see graph below). 
A z-score of —2.33 gives the value closest to 0.01 (0.0099). 


3.95 — 
Solving = 239 = — 


invNorm(area:0.01,:0,0:1) gives z = —2.326. Solving 
3.95 — 
2.326 = 
0.02 


the mean diameter to approximately 4 = 4.00 to ensure that only 


gives pp = 4.00. Using technology: 


gives jo = 4.00. The manufacturer should set 


1% of lids are too small. (b) For large lids, the diameter follows 
a N(3.98, o) distribution and we want to find the value of o that 
will result in only 1% of lids that are too small to fit (see graph 
below). A z-score of —2.33 gives the value closest to 0.01 (0.0099). 


Solving —2.33 = cela 
o 


invNorm(area:0.01,:0,0:1) gives z = —2.326. Solving 
7326 = 3.95 — 3.98 


gives o = 0.013. A standard deviation of at 
most 0.013 will result in only 1% of lids that are too small to fit. 


gives 0 = 0.013. Using technology: 


Area = 0.01 


i 
Diameter (inches) 
(c) Reduce the standard deviation. This will reduce the number of 
lids that are too small and the number of lids that are too big. If we 
make the mean a little larger as in part (a), we will reduce the num- 
ber of lids that are too small, but we will increase the number of lids 
that are too big. 


Area = 0.01 


T 
3.98 
Diameter (inches) 


x — 64.5 
2.5 


: 
gives x = 67.7 


2.59 (a) zg = —1.28 and z = 1.28 (b) Solving —1.28 = 
G2 


64. 
2.5 


gives x = 61.3 inches and solving 1.28 = 
inches. 


60 — 7 
and ee — 
oO 


oO 


2.61 Solving 1.04 = gives po = 41.43 


minutes and ¢ = 17.86 minutes. 


2.63 (a) A histogram is given below. The distribution of shark 
lengths is roughly symmetric and somewhat bell-shaped, with a 
mean of 15.586 feet and a standard deviation of 2.55 feet. (b) 30/44 
= 68.2% , 42/44 = 95.5%, and 44/44 = 100%. These are very close 
to the 68—95 —99.7 rule. 


IMORMAL FLOAT AUTO REAL RADIAN HP f 


S-10 Solutions 


(c) A Normal probability plot is given below. Except for one small 
shark and one large shark, the plot is fairly linear, indicating that the 
distribution of shark lengths is approximately Normal. 


o 2 ‘ 

=] 

5 . 

2 1 ra 

& ? 

3 : on 

e 

2-41 

& f 

Bol, iJ 
T if T T T T T T 
10 12 14 16 18 20 22 24 


Length of shark (feet) 


(d) All indicate that shark lengths are approximately Normal. 

2.65 The distribution is close to Normal because the plot is nearly 
linear. There is a small “wiggle” between 120 and 130, with several 
values a little larger than would be expected in a Normal distribu- 
tion. Also, the smallest value and the two largest values are a little 
farther from the mean than would be expected in a Normal 
distribution. 

2.67 No. If it was Normal, then the minimum value should be 
around 2 or 3 standard deviations below the mean. However, the 
actual minimum has a z-score of just z = —1.09. Also, if the distri- 
bution was Normal, the minimum and maximum should be about 
the same distance from the mean. However, the maximum is much 
farther from the mean (20,209) than the minimum (8741). 

2.69 b 

2.71 b 

fas a 

2.75 For both kinds of cars, we see that the highway mileage is greater 
than the city mileage. The two-seater cars have a more variable distri- 
bution, both on the highway and in the city. Also the mileage values 
are slightly lower for the two-seater cars than for the minicompact cars, 
both on the highway and in the city, with a greater difference on the 
highway. All four distributions are roughly symmetric. 


Answers to Chapter 2 Review Exercises 


R2.1 (a) z = 1.20. Paul’s height is 1.20 standard deviations above 
the average male height for his age. (b) 85% of boys Paul’s age are 
shorter than Paul. 

R2.2 (a) 58th percentile (b) IQR = 11 — 2.5 = 8.5 hours per week. 
R2.3 (a) The shape of the distribution would not change. 


Mean = 328 = 13.32 meters, median = 228 = 12.80 meters, 
ine 

standard deviation = 3287 3.81 meters, 

IOR= a2 = 3.81 meters. (b) Mean = 43.7 — 42.6 = 1.1 feet; 


standard deviation = 12.5 feet, because subtracting a constant from 
each observation does not change the spread. 

R2.4 (a) The median (line A in the graph below) should be slightly to 
the right of the main peak, with half of the area to the left and half to 
the right. (b) The mean (line B in the graph below) should be slightly 
to the right of the line for the median at the balancing point. 


R2.5 (a) Between 336 — 3(3) = 327 days and 336 + 3(3) = 345 
100% — 68% 
days. (b) About JOOS 8H 16%. 


R2.6 (a) 0.9616 — 0.0122 = 0.9494 (b) If 35% of all values are 
greater than a particular z-value, then 65% are lower. A z-score of 
0.39 gives the value closest to 0.65 (0.6517). Using technology: 
invNorm(area:0.65,:0,0:1) gives z = 0.385. 


0.65 

I T T T 
-2.25 0 1.77 0 
z Zz 


R2.7 (a) Birth weights follow a N(3668, 511) distribution and we 
want to find the percent of babies with weights less than 2500 


2500 — 3668 2.29. From 


‘Table A, the proportion of z-scores below —2.29 is 0.0110. Using 
technology: normalcdf (lower:-1000,upper:2500,y: 
3668,0:511) = 0.0111. About 1% of babies will be identified as 
low birth weight. 


grams (see graph below). z= 


T T 
2500 3668 
Birth weights (grams) 


(b) Birth weights follow a N(3668, 511) distribution. The Ist quar- 
tile is the boundary value with 25% of the area to its left. The 3rd 
quartile is the boundary value with 75% of the area to its left (see 
graph below). A zscore of —0.67 gives the value closest to 0.25 


(0.2514). Solving 7 gives Q, = 3325.63. A 


a 
z-score of 0.67 gives the value closest to 0.75 (0.7486). Solving 
— 366 
0.67 = —_ gives Q3 4010.37. Using technology: 


invNorm(area:0.25,:3668,06:511) gives Q; = 3323.34 
and invNorm(area:0.75,:3668,0:511) gives Q; 
4012.66. The quartiles are Q; = 3323.34 grams and Q3; = 
4012.66 grams. 


Area = 0.25 Area = 0.25 


T T T T T 
2135 2646 3157 3668 4179 4690 5201 
Birth weight (grams) 


R2.8 (a) The amount of ketchup dispensed follows a N(1.05, 0.08) 
distribution and we want to find the percent of times that the amount 
of ketchup dispensed will be between 1 and 1.2 ounces (see 


1.2 — 1.05 1 — 1.05 
~ 0.08 = 1.88 and z= 0.08 
From Table A, the proportion of z-scores between —0.63 and 1.88 
is 0.9699 — 0.2643 = 0.7056. Using technology: normalcdf 
(lower:1,upper:1.2,p:1.05,0:0.08) =0.7036. About 


70% of the time the dispenser will put between | and 1.2 ounces of 


= — 0.63. 


graph below). z = 


ketchup on a burger. 


(b) The amount of ketchup dispensed follows a N(1.1, 0) distribu- 
tion and we want to find the value of o that will result in at least 
99% of burgers getting between | and 1.2 ounces of ketchup (see 
graph below). Because the mean of 1.1 is in the middle of the 
interval from | to 1.2, we are looking for the middle 99% of the 
distribution. This leaves 0.5% in each tail. A z-score of —2.58 gives 


the value closest to 0.005 (0.0049). Solving — 2.58 = gives 


o = 0.039. Using technology: invNorm(area:0.005,1:0,0:1) 
—1.] 
2.576 = i= gives og = 0.039.A 


standard deviation of at most 0.039 ounces will result in at least 


gives z = —2.576. Solving 


99% of burgers getting between | and 1.2 ounces of ketchup. 


Area = 0.99 


T 
1.10 


R2.9 If the distribution is Normal, the 10 and 90" percentiles 
must be equal distances above and below the mean. Thus, the 


+ 475 

at = 250 points. The 10" percentile in a standard 
tie atta ; 25 — 250 

Normal distribution is z= —1.28. Solving — 1.28 = ————,, we 

G 


mean is 


get 0 = 175.8. Using technology: invNorm(area:0.10,p1:0, 


=) 
1.282 = =e and o = 175.5. 


6:1) gives z = —1.282, so 


R2.10 A histogram and Normal probability plot are given below. 
The histogram is roughly symmetric but not very bell-shaped. 
The Normal probability plot, however, is roughly linear. For 
these data, x = 0.8004 and s, = 0.0782. Although the percentage 
within 1 standard deviation of the mean (55.1%) is less than 
expected (68%), the percentage within 2 (93.9%) and 3 standard 
deviations (100%) match the 68—95—99.7 rule quite well. It is 
reasonable to say that these data are approximately Normally 
distributed. 


Frequency 


T T T T 
0.60 0.70 0.80 0.90 
Length of thorax (mm) 


Solutions S-11 


2 2 : ‘. 
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z 
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30 Pe la 
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i 

* ° 
ml 2 : 

T T T T T [i 

0.60 0.65 0.70 0.75 0.80 0.85 0.90 0.95 
Length of thorax (mm) 


R2.11 The steep, nearly vertical portion at the bottom and the 
clear bend to the right indicate that the distribution of the data is 
right-skewed with several outliers and not approximately Normally 
distributed. 


Answers to Chapter 2 AP® Statistics Practice Test 


TZ 
T2.2 
123 
T2.4 
T25 
12.6 
T27 
T2.8 
12.9 
T2.10 c 

T2.11 (a) Jane’s performance was better. Because her performance 
(40) exceeded the standard for the Presidential award (39), she per- 
formed above the 85th percentile. Matt’s performance (40) met the 


oogqoqaoaonrwraa 


standard for the National award (40), meaning he performed at the 
50th percentile. (b) Because Jane’s score has a higher percentile 
than Matt’s score, she is farther to the right in her distribution than 
Matt is in his. Therefore, Jane’s standardized score will likely be 
greater than Matt’s. 

12.12 (a) For male soldiers, head circumference follows a N(22.8, 
1.1) distribution and we want to find the percent of soldiers with 
head circumference less than 23.9 inches (see graph below). 
_ 23.9 — 22.8 
dl 
below 1 is 0.8413. Using technology: normalcdf (lower: 
—1000,upper:23.9,:22.8,0:1.1) = 0.8413. About 84% 
of soldiers have head circumferences less than 23.9 inches. Thus, 
23.9 inches is at the 84th percentile. 


Zz =1. From Table A, the proportion of zscores 


N(22.8, 1.1) 


22.8 23.9 
Head circumference (inches) 


(b) For male soldiers, head circumference follows a N(22.8, 1.1) 
distribution and we want to find the percent of soldiers with head 
circumferences less than 20 inches or greater than 26 inches (see 

20 — 22.8 26 — 22.8 
graph below). z = as i 2.55 and z= aa. 
From Table A, the proportion of z-scores below z = —2.55 is 0.0054 
and the proportion of z-scores above 2.91 is 1 — 0.9982 = 0.0018, 
for a total of 0.0054 + 0.0018 = 0.0072. Using technology: 
1 — normalcdf (lower:20,upper:26,0:22.8,0:1.1) 
= 1 — 0.9927 = 0.0073. A little less than 1% of soldiers have head 


= 2.91. 


S-12 Solutions 


circumferences less than 20 inches or greater than 26 inches and 
require custom helmets. 


T T T ig T 
19.5 20.6 21.7 22.8 23.9 25.0 26.1 
Head circumference (inches) 


(c) For male soldiers, head circumference follows a N(22.8, 1.1) 
distribution. The Ist quartile is the boundary value with 25% of the 
area to its left. The 3rd quartile is the boundary value with 75% of 
the area to its left (see graph below). A z-score of —0.67 gives the 


28 
value closest to 0.25 (0.2514). Solving — 0.67 = = 


Q; = 22.063. A zscore of 0.67 gives the value closest to 0.75 
(0.7486). Solving 0.67 = — 


gives 


2. 
od gives QO; = 23.537. Using tech- 


nology: invNorm(area:0.25,y:22.8,0:1.1) givesQ) = 
22.058 and invNorm(area:0.75,y:22.8,0:1.1) gives 
QO; = 23.542. Thus, IQR = 23.542 — 22.058 = 1.484 inches. 


T T T T T T T 
19.5 206 21.7 228 23.9 25.0 26.1 
Head circumference (inches) 


12.13 No. First, there is a large difference between the mean and 
the median. In a Normal distribution, the mean and median are the 
same, but in this distribution the mean is 48.25 and the median is 
37.80. Second, the distance between the minimum and the median 
is 35.80 but the distance between the median and the maximum is 
167.10. Ina Normal distribution, these distances should be about the 
same. Both of these facts suggest that the distribution is skewed to the 
right. 


Chapter 3 
Section 3.1 


Answers to Check Your Understanding 

page 144: 1. Explanatory: number of cans of beer. Response: 
blood alcohol level. 2. Explanatory: amount of debt and income. 
Response: stress caused by college debt. 

page 149: 1. Positive. The longer the duration of the eruption, the 
longer we should expect to wait between eruptions because long 
eruptions use more energy and it will take longer to build up the 
energy needed to erupt again. 2. Roughly linear with two clusters. 
The clusters indicate that, in general, there are two types of erup- 
tions—shorter eruptions that last around 2 minutes and longer 
eruptions that last around 4.5 minutes. 3. Fairly strong. The points 
don’t deviate much from the linear form. 4. There are a few pos- 
sible outliers around the clusters. However, there aren’t many and 
potential outliers are not very distant from the main clusters of 
points. 5. How long the previous eruption was. 

page 153: (a) r ~ 0.9. This indicates that there is a strong, positive 
linear relationship between the number of boats registered in 


Florida and the number of manatees killed. (b) r ~ 0.5. This indi- 
cates that there is a moderate, positive linear relationship between 
the number of named storms predicted and the actual number of 
named storms. (c) r ~ 0.3. This indicates that there is a weak, 
positive linear relationship between the healing rate of the two front 
limbs of the newts. (d) r= — 0.1. This indicates that there is a 
weak, negative linear relationship between last year’s percent return 
and this year’s percent return in the stock market. 


Answers to Odd-Numbered Section 3.1 Exercises 

3.1 Explanatory: water temperature (quantitative). Response: 
weight change (quantitative). 

3.3 (a) Positive. Students with higher IQs tend to have higher 
GPAs and vice versa because both IO and GPA are related to 
mental ability. (b) Roughly linear, because a line through the 
scatterplot of points would provide a good summary. Moderately 
strong, because most of the points would be close to the line. (c) 
IQ ~ 103 and GPA = 0.4. 

3.5 A scatterplot is shown below. 


Pack weight (Ib) 


100 110 120 130 140 150 160 170 180 190 
Body weight (Ib) 


3.7 (a) There is a positive association between backpack weight 
and body weight. For students under 140 pounds, there seems to be 
a linear pattern in the graph. However, for students above 140 
pounds, the association begins to curve. Because the points vary 
somewhat from the linear pattern, the relationship is only moder- 
ately strong. (b) The hiker with body weight 187 pounds and pack 
weight 30 pounds. This hiker makes the form appear to be nonlin- 
ear for weights above 140 pounds. Without this hiker, the associa- 
tion would look very linear for all body weights. 

3.9 (a) Ascatterplot is shown below. (b) The relationship is curved. 
Large amounts of fuel were used for low and high values of speed 
and smaller amounts of fuel were used for moderate speeds. This 
makes sense because the best fuel efficiency is obtained by driving 
at moderate speeds. (c) Both directions are present in the scatter- 
plot. The association is negative for lower speeds and positive for 
higher speeds. (d) The relationship is very strong, with little 
deviation from a curve that can be drawn through the points. 


22.5 
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Speed (km/h) 


Fuel used (liters/100 km) 


3.11 (a) Most of the southern states fall in the same pattern as the 
rest of the states. However, southern states typically have lower 
mean SAT math scores than other states with a similar percent of 
students taking the SAT. (b) West Virginia has a much lower mean 


SAT Math score than the other states that have a similar percent of 
students taking the exam. 
3.13 Ascatterplot is shown below. There is a negative, linear, mod- 
erately strong relationship between the percent returning and the 
number of breeding pairs. 
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Breeding pairs 


3.15 (a)r = 0.9 (b) r=0 (c)r = 0.7 (d) r= — 0.3 (e) r= —0.9 
3.17 (a) Gender is a categorical variable and correlation r is for two 
quantitative variables. (b) The largest possible value of the corre- 
lation isr = 1. (c) The correlation r has no units. 

3.19 (a) The scatterplot below shows a strong, positive linear rela- 
tionship between the two measurements. It appears that all five speci- 
mens come from the same species. (b) The femur measurements 
have x = 58.2 and s,= 13.2. The humerus measurements have 
y = 66 and s, = 15.89. The sum of the z-score products is 3.97620, 
so the correlation coefficient is r= (1/4)(3.97620) = 0.9941. The 
very high value of the correlation confirms the strong, positive linear 
association between femur length and humerus length in the scat- 
terplot from part (a). 


Humerus length (cm) 


40 : T T T T T T T T 
40 45 50 55 60 65 70 75 
Femur length (cm) 


3.21 (a) There is a strong, positive linear association between 
sodium and calories. (b) It increases the correlation. It falls in the 
linear pattern of the rest of the data and observations with unusually 
small or unusually large values of x have a big influence on the 
correlation. 

3.23 (a) The correlation would not change, because correlation is 
not affected by a change of units for either variable. (b) The correla- 
tion would not change, because it does not distinguish between 
explanatory and response variables. 

3.25 (a) Ascatterplot is shown below. (b) r = 0 (c) The correlation 
measures the strength of a linear association, but this plot shows a 
nonlinear relationship between speed and mileage. 


Mileage (mpg) 


20 30 40 50 60 
Speed (mph) 


3.27 a 
3.29 d 


Solutions S-13 


3.31 b 

3.33 A histogram is shown below. The distribution is right-skewed, 
with several possible high outliers. Because of the skewness and 
outliers, we should use the median (5.4 mg) and IOR (5.5 mg) to 
describe the center and spread. 
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Section 3.2 


Answers to Check Your Understanding 

page 168: 1. 40. For each additional week, we predict that a rat 
will gain 40 grams of weight. 2. 100. The predicted weight for a 
newborn rat is 100 grams. 3. y = 100 + 40(16) = 740 grams 
4. 2 years = 104 weeks, so y = 100 + 40(104) = 4260 grams. This 
is equivalent to 9.4 pounds (about the weight of a large newborn 
human). This is unreasonable and is the result of extrapolation. 
page 172: ‘The answer is given in the text. 

page 174: 1. y — y = 31,891 — 36,895 = —$5004 2. The actual 
price of this truck is $5004 less than predicted based on the number 
of miles it has been driven. 3. The truck with 44,447 miles and a 
price of $22,896. This truck has a residual of —$8120, which means 
that the line overpredicted the price by $8120. No other truck had 
a residual that was farther below 0 than this one. 

page 176: 1. The backpack for this hiker was almost 4 pounds 


heavier than expected based on the weight of the hiker. 2. Because 
there appears to be a negative-positive-negative pattern in the resid- 
ual plot, a linear model is not appropriate for these data. 


Answers to Odd-Numbered Section 3.2 Exercises 

3.35 predicted weight = 80 — 6 (days) 

3.37 (a) 1.109. For each 1-mpg increase in city mileage, the pre- 
dicted highway mileage will increase by 1.109 mpg. (b) 4.62 mpg. 
This would represent the highway mileage for a car that gets 0 mpg 
in the city, which is impossible. (c) 22.36 mpg 

3.39 (a) -0.0053. For each additional week in the study, the pre- 
dicted pH decreased by 0.0053 units. (b) 5.43. The predicted pH 
level at the beginning of the study (weeks = 0) is 5.43. (c) 4.635 
3.41 No. 1000 months is well outside the observed time period and 
we can’t be sure that the linear relationship continues after 150 
weeks. 

3.43 The line py = 1 — x is a much better fit. The sum of squared 
residuals for this line is only 3, while the sum of squared residuals 
for y = 3 — 2x is 18. 

3.45 residual = 5.08 — 5.165 = —0.085. The actual pH value for 
that week was 0.085 less than predicted. 

3.47 (a) The scatterplot (with regression line) is shown below. 
(b) y = 31.9 — 0.304x. (c) For each increase of | in the percent of 
returning birds, the predicted number of new adult birds will 
decrease by 0.304. (d) residual = 11 — 16.092 = —5.092. In this 
colony, there were 5.092 fewer new adults than expected based on 


the percent of returning birds. 


S-14 Solutions 


New adults 


Percent return 


3.49 (a) Because there is no obvious leftover pattern in the residual 
plot shown below, a line is an appropriate model to use for these 
data. (b) The point with the largest residual (66% returning) has a 
residual of about —6. This means that the colony with 66% return- 
ing birds has about 6 fewer new adults than predicted based on the 
percent returning. 


Residual 


Percent return 


3.51 No. Because there is an obvious negative-positive-negative 
pattern in the residual plot, a linear model is not appropriate for 
these data. 

3.53 (a) There is a positive, linear association between the two vari- 
ables. ‘There is more variation in the field measurements for larger 
laboratory measurements. (b) No. The points for the larger depths fall 
systematically below the line y = x, showing that the field measure- 
ments are too small compared to the laboratory measurements. 
(c) The slope would be closer to 0 and the y intercept would be larger. 
3.55 (a) residual = 150.06 — 146.295 = 3.765. Yu-Na Kim’s free 
skate score was 3.765 points higher than predicted based on her 
short program score. (b) Because there is no leftover pattern in the 
residual plot, a linear model is appropriate for these data. (c) When 
using the least-squares regression line with x = short program score 
to predict y = free skate score, we will typically be off by about 10.2 
points. (d) About 73.6% of the variation in free skate scores is 
accounted for by the linear model relating free skate scores to short 
program scores. 

3.57 1°: About 56% of the variation in the number of new adults is 
accounted for by the linear model relating number of new adults to 
the percent returning. s: When using the least-squares regression 
line with x = percent returning to predict y = number of new 
adults, we will typically be off by 3.67 adults. 

3.59 (a) y = 266.07 — 6.650x, where y = percent of males that 
return the next year and x = number of breeding pairs. When 
x = 30, 9 = 66.57. (b) RSq = 74.6% (c) r= — V0.746 = —0.864. 
‘The sign is negative because the slope is negative. (d) When using 
the least-squares regression line with x = number of breeding pairs 
to predict y = percent returning, we will typically be off by 7.76%. 
3.61 (a) y = 33.67 + 0.54x. (b) If the value of x is 1 standard devia- 
tion below x, the predicted value of y will be r standard deviations 
of y below y. So the predicted value for the husband _ is 
68.5 — 0.5(2.7) = 67.15 inches. 

3.63 (a)? = 0.25. About 25% of the variation in husbands’ heights 
is accounted for by the linear model relating husband’s height to 


wife’s height. (b) When using the least-squares regression line with 
x = wife’s height to predict y = husband’s height, we will typically 
be off by 1.2 inches. 

3.65 (a) y =x where y = final and x = midterm (b) If x = 50, 
y = 67.1. Ifx = 100, y = 87.6. (c) The student who did poorly on 
the midterm (50) is predicted to do better on the final (closer to the 
mean), while the student who did very well on the midterm (100) is 
predicted to do worse on the final (closer to the mean). 

3.67 State: Is a linear model appropriate for these data? If so, how 
well does the least-squares regression line fit the data? Plan: We will 
look at the scatterplot and residual plot to see if the association is 
linear or nonlinear. Then, if a linear model is appropriate, we will 
use s and r? to measure how well the line fits the data. Do: The scat- 
terplot below shows a moderately strong, positive linear association 
between the number of stumps and the number of clusters of beetle 
larvae. The residual plot doesn’t show any obvious leftover pattern, 
confirming that a linear model is appropriate. 
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y = —1.29 + 11.89x, where y = number of clusters of beetle lar- 


vae and x = number of stumps. s = 6.42, meaning that our predic- 
tions will typically be off by about 6.42 clusters when we use the 
line to predict the number of clusters of beetle larvae from the 
number of stumps. Finally, r? = 0.839, meaning 83.9% of the varia- 
tion in the number of clusters of beetle larvae is accounted for by 
the linear model relating number of clusters of beetle larvae to the 
number of stumps. Conclude: The linear model relating number of 
clusters of beetle larvae to the number of stumps is appropriate and 
fits the data well, accounting for more than 80% of the variation in 
number of clusters of beetle larvae. 

3.69 (a) A scatterplot is shown below. There is a moderate, positive 
linear association between HbA and FBG. There are possible outliers 
to the far right (subject 18) and near the top of the plot (subject 15). 
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(b) Because the point is in the positive, linear pattern formed by 
most of the data values, it makes r closer to 1. Also, because the 
point is likely to be below the least-squares regression line, it will 
“pull down” the line on the right side, making the slope closer to 0. 
Without the outlier, r decreases from 0.4819 to 0.3837 as expected. 
Likewise, the equation changes from y= 66.4 + 10.4x to 
y = 52.3 + 12.1x. (c) The point makes r closer to 0 because it is out 
of the linear pattern formed by most of the data values. Because this 
point’s x coordinate is very close to x but the y coordinate is far 
above y, it won't influence the slope very much but will increase 
the y intercept. Without the outlier, r increases from 0.4819 to 
0.5684, as expected. Likewise, the equation changes from 
y = 66.4 + 10.4x to p = 69.5 + 8.92x. 

3.71 a 

3.73: ¢ 

379d 

3.77 b 

3.79 For these vehicles, the combined mileage follows a N(18.7, 
4.3) distribution and we want to find the percent of cars with lower 
pe ee —_ = 147. From 
‘Table A, the proportion of z-scores below 1.47 is 0.9292. Using tech- 
nology: normalcdf (lower:-1000,upper:25,»:18.7,0: 
4.3) = 0.9286. About 93% percent of vehicles get worse com- 
bined mileage than the Chevrolet Malibu. 


mileage than 25 (see graph below). z = 


N(18.7, 4.3) 


187 25.0 
3.81 (a) A bar graph is given below. The people who use marijuana 
more are more likely to have caused accidents. (b) Association does 
not imply causation. For example, it could be that drivers who use 
marijuana more often are more willing to take risks than other driv- 
ers and that the willingness to take risks is what is causing the higher 
accident rate. 
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Answers to Chapter 3 Review Exercises 


R3.1 (a) There is a moderate, positive linear association between 
gestation and life span. Without the outliers at the top and in the 
upper right, the association appears moderately strong, positive, 
and curved. (b) It makes 7 closer to 0 because it decreases the 
strength of what would otherwise be a moderately strong positive 
association. Because this point is close to x but far above y, it won't 
affect the slope much but will increase the y intercept. Because it 
has such a large residual, it increases s. (c) Because it is in the posi- 
tive, linear pattern formed by most of the data values, it will make r 
closer to 1. Also, because the point is likely to be above the least- 
squares regression line, it will “pull up” the line on the right side, 


Solutions S-15 


making the slope larger and the intercept smaller. Because this 
point is likely to have a small residual, it decreases s. 

R3.2 (a) 0.0138. For each increase of | meter in dive depth, the 
predicted duration increases by 0.0138 minutes. (b) The y intercept 
suggests that a dive of 0 depth would last an average of 2.69 min- 
utes; this obviously does not make any sense. (c) 5.45 minutes (d) If 
the variables are reversed, the correlation will remain the 
same. However, the slope and y intercept will be different. 

R3.3 (a) y = 3704 + 12,188x, where y represents the mileage of 
76,832 = 
—11,832. This teacher has driven 11,832 fewer miles than pre- 


dicted based on the age of the car. (c) r= +V0.837 = 0.915. This 
shows that there is a strong, positive linear association between the 


the cars and x represents the age. (b) residual = 65,000 


age of cars and their mileage. (d) Yes, because there is no leftover 
pattern in the residual plot. (e) s = 20,870.5: When using the least- 
squares regression line with x = car’s age to predict y = number of 
miles it has been driven, we will typically be off by about 20,870.5 
miles. r? = 83.7%: About 83.7% of the variability in mileage is 
accounted for by the linear model relating mileage to age. 

R3.4 (a) The scatterplot is shown below. Average March tempera- 
ture, because changes in March temperature probably have an 
effect on the date of first bloom. 
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(b) r= —0.85 and y = 33.12 — 4.69x, where y represents the num- 
ber of days and x represents the temperature. r: There is a strong, 
negative linear association between the average March temperature 
and the days in April until first bloom. Slope: For every 1° increase 
in average March temperature, the predicted number of days in 
April until first bloom decreases by 4.69. y intercept: If the average 
March temperature was 0°C, the predicted number of days in April 
to first bloom is 33.12 (May 3). (c) No, x = 8.2 is well beyond the 
values of x we have in the data set. (d) residual = 10 — 12.015 = 
—2.015. In this year, the actual date of first bloom occurred about 2 
days earlier than predicted based on the average March tempera- 
ture. (e) There is no leftover pattern in the residual plot shown 
below, indicating that a linear model is appropriate. 
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S-16 Solutions 


R3.5 (a) y = 30.2 + 0.16x, where y = final exam score and x = 
total score before the final examination. (b) 78.2 (c) Of all the lines 
that the professor could use to summarize the relationship between 
final exam score and total points before the final exam, the least- 
squares regression line is the one that has the smallest sum of 
squared residuals. (d) Because 1? = 0.36, only 36% of the variability 
in the final exam scores is accounted for by the linear model relat- 
ing final exam scores to total score before the final exam. More than 
half (64%) of the variation in final exam scores is not accounted for, 
so Julie has reason to question this estimate. 

R3.6 Even though there is a high correlation between number of 
calculators and math achievement, we shouldn’t conclude that 
increasing the number of calculators will cause an increase in math 
achievement. It is possible that students who are more serious about 
school have better math achievement and also have more 
calculators. 


Answers to Chapter 3 AP® Statistics Practice Test 


T3.1 d 
T3.2.e 
13.3 
13.4 
T3.5 
T3.6 
dese 
T3.8 
T3.9 b 

T3.10 c 

T3.11 (a) A scatterplot with regression line is shown below. (b) 
y = 71.95 + 0.3833x, where y = height and x = age. (c) 255.934 
cm, or 100.76 inches (d) This was an extrapolation. Our data were 
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based only on the first 5 years of life and the linear trend will not 
continue forever. 
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13.12 (a) The point in the upper-righthand corner has a very high 
silicon value for its isotope value. (b) (i) r would get closer to — 1 
because it does not follow the linear pattern of the other points. (ii) 
Because this point is “pulling up” the line on the right side of the 
plot, removing it will make the slope steeper (more negative) and 
the y intercept smaller (note that the y axis is to the right of the 
points in the scatterplot). (iii) Because this point has a large resid- 
ual, removing it will make s a little smaller. 

T3.13 (a) p = 92.29 — 0.05762x, where y is the percent of the grass 
burned and x is the number of wildebeest. (b) For every increase of 
1000 wildebeest, the predicted percent of grassy area burned 
decreases by about 0.058. (c) r= —V0.646 = —0.804. There is a 


strong, negative linear association between the percent of grass 


burned and the number of wildebeest. (d) Yes, because there is no 
obvious leftover pattern in the residual plot. 


Chapter 4 
Section 4.1 


Answers to Check Your Understanding 

page 213: 1. Convenience sampling. This could lead the inspec- 
tor to overestimate the quality of the oranges if the farmer puts the 
best oranges on top. 2. Voluntary response sampling. In this case, 
those who are happy that the UN has its headquarters in the U.S. 
already have what they want and so are less likely to respond. The 
proportion who answered “No” in the sample is likely to be higher 
than the true proportion in the U.S. who would answer “No.” 
page 223: 1. You would have to identify 200 different seats, go to 
those seats in the arena, and find the people who are sitting there, 
which would take a lot of time. 2. It is best to create strata where 
the people within a stratum are very similar to each other but differ- 
ent than the people in other strata. In this case, it would be better to 
take the lettered rows as the strata because each lettered row is the 
same distance from the court and so would contain only seats with 
the same (or nearly the same) ticket price. 3. It is best ifthe people 
in each cluster reflect the variability found in the population. In 
this case, it would be better to take the numbered sections as the 
clusters because they include all different seat prices. 

page 228: 1. (a) Undercoverage (b) Nonresponse (c) Undercoverage 
2. By making it sound like they are not a problem in the landfill, 
this question will result in fewer people suggesting that we should 
ban disposable diapers. The proportion who would say “Yes” to this 
survey question is likely to be smaller than the proportion who 
would say “Yes” to a more fairly worded question. 


Answers to Odd-Numbered Section 4.1 Exercises 

4.1 Population: all local businesses. Sample: the 73 businesses that 
return the questionnaire. 

4.3 Population: the 1000 envelopes stuffed during a given hour. 
Sample: the 40 randomly selected envelopes. 

4.5 This is a voluntary response sample. In this case, it appears that 
people who strongly support gun control volunteered more often, 
causing the proportion in the sample to be greater than the propor- 
tion in the population. 

4.7 This is a voluntary response sample and overrepresents the 
opinions of those who feel most strongly about the issue being 
surveyed. 

4.9 (a) Aconvenience sample (b) The first 100 students to arrive at 
school likely had to wake up earlier than other students, so 7.2 
hours is probably less than the true average. 

4.11 (a) Number the 40 students from 01 to 40. Pick a starting 
point on the random number table. Record two-digit numbers, 
skipping numbers that aren’t between 01 and 40 and any repeated 
numbers, until you have 5 unique numbers between 01 and 40. 
Use the 5 students corresponding to these numbers. (b) Using line 
107, skip the numbers not in bold: 82 73 95 78 90 20 80 
74 75 11 81 67 65 53 00 94 38 31 48 93 60 94 07. Select 
Johnson (20), Drasin (11), Washburn (38), Rider (31), and Calloway 
(07). 

4.13 (a) Using calculator: Number the plots from | to 1410. Use 
the command randInt (1,1410) toselect 141 different integers 
from | to 1410 and use the corresponding 141 plots. (b) Answers 
will vary. 

4.15 (a) False —although, on average, there will be four 0s in every 
set of 40 digits, the number of 0s can be less than 4 or greater than 
4 by chance. (b) True—there are 100 pairs of digits 00 through 99, 


and all are equally likely. (c) False —0000 is just as likely as any 
other string of four digits. 

4.17 (a) It might be difficult to locate the 20 phones from among 
the 1000 produced that day. (b) The quality of the phones pro- 
duced may change during the day, so that the last phones manufac- 
tured are not representative of the day’s production. (c) Because 
each sample of 20 phones does not have the same probability of 
being selected. In an SRS, it is possible for 2 consecutive phones to 
be selected in a sample, but this is not possible with a systematic 
random sample. 

4.19 Assign numbers 01 to 30 to the students. Pick a starting point 
on the random digit table. Record two-digit numbers, skipping any 
that aren’t between 01 and 30 and any repeated numbers, until you 
have 4 unique numbers between 01 and 30. Use the corresponding 
four students. Then assign numbers 0 to 9 to the faculty members. 
Continuing on the table, record one-digit numbers, skipping any 
repeated numbers, until you have 2 unique numbers between 0 
and 9. Use the corresponding faculty members. Starting on line 
123 gives 08-Ghosh, 15-Jones, 07-Fisher, and 27-Shaw for the stu- 
dents and 1-Besicovitch and 0-Andrews for the faculty. 

4.21 (a) Use the three types of seats as the strata because people 
who can afford more expensive tickets probably have different 
opinions about the concessions than people who can afford only 
the cheaper tickets. (b) A stratified random sample will include 
seats from all over the stadium, which would make it very time- 
consuming to obtain. A cluster sample of numbered sections would 
be easier to obtain, because the people selected for the sample 
would be sitting close together. 

4.23 No. In an SRS, each possible sample of 250 engineers is 
equally likely to be selected, including samples that aren’t exactly 
200 males and 50 females. 

4.25 (a) Cluster sampling. (b) To save time and money. In an SRS, 
the company would have to visit individual homes all over the rural 
subdivision instead of only 5 locations. 

4.27 (a) It is unlikely, because different random samples will 
include different students and produce different estimates of the 
proportion of students who use Twitter. (b) An SRS of 100 students. 
Larger random samples give us better information about the popu- 
lation than smaller random samples. 

4.29 Because you are sampling only from the lower-priced ticket 
holders, this will likely produce an estimate that is too small, as fans 
in the club seats and box seats probably spend more money at the 
game than fans in cheaper seats. 

4.31 (a) 89.1% (b) Because the people who have long commutes 
are less likely to be at home and be included in the sample, this will 
likely produce an estimate that is too small. 

4.33, We would not expect very many people to claim they have 
run red lights when they haven’t, but some people will deny run- 
ning red lights when they have. Thus, we expect that the sample 
proportion underestimates the true proportion of drivers who have 
run a red light. 

4.35 (a) The wording is clear, but the question is slanted in favor of 
warning labels because of the first sentence stating that some cell 
phone users have developed brain cancer. (b) The question is clear, 
but it is slanted in favor of national health insurance by asserting it 
would reduce administrative costs and not providing any counterar- 
guments. (c) The wording is too technical for many people to 
understand. For those who do understand the question, it is slanted 
because it suggests reasons why one should support recycling. 


4,37 ¢ 


Solutions S-17 


4.39 d 

4.41 d 

4.43 (a) For each additional day, the predicted sleep debt increases 
by about 3.17 hours. (b) The predicted sleep debt for a 5-day school 
week is 2.23 + 3.17(5) = 18.08 hours. This is about 3 hours more 
than the researcher claimed for a 5-day week, so the students have 
reason to be skeptical of the research study’s reported results. 


Section 4.2 


Answers to Check Your Understanding 

page 237: 1. Experiment, because a treatment (brightness of 
screen) was imposed on the laptops. 2. Observational study, 
because students were not assigned to eat a particular number of 
meals with their family per week. 3. Explanatory: number of meals 
per week eaten with their family. Response: GPA. 4. There are 
probably other variables that are influencing the response variable. 
For example, students who have part-time jobs may not be able to 
eat many meals with their families and may not have much time to 
study, leading to lower grades. 

page 247: 1. Randomly assign the 29 students to two treatments: 
evaluating the performance in small groups or evaluating the per- 
formance alone. The response variable will be the accuracy of their 
final performance evaluations. To implement this design, use 29 
equally sized slips of paper. Label 15 of them “small group” and 14 
of them “alone.” Then shuffle the papers and hand them out at 
random to the 29 students, assigning them to a treatment. 2. The 
purpose of the control group is to provide a baseline for compari- 
son. Without a group to compare to, it is impossible to determine if 
the small group treatment is more effective. 

page 249: 1. No. Perhaps seeing the image of their unborn child 
encouraged the mothers who had an ultrasound to eat a better diet, 
resulting in healthier babies. 2. No. While the people weighing 
the babies at birth may not have known whether that particular 
mother had an ultrasound or not, the mothers knew. This might 
have affected the outcome because the mothers knew whether they 
had received the treatment or not. 3. Treat all mothers as if they 
had an ultrasound, but for some mothers the ultrasound machine 
wouldn’t be turned on. ‘To avoid having mothers know the machine 
was turned off, the ultrasound screen would have to be turned away 
from all the mothers. 


Answers to Odd-Numbered Section 4.2 Exercises 

4.45 Experiment, because students were randomly assigned to the 
different teaching methods. 

4.47 (a) Observational study, because mothers weren’t assigned to 
eat different amounts of chocolate. (b) Explanatory: the mother’s 
chocolate consumption. Response: the baby’s temperament. 
(c) No, this study is an observational study so we cannot draw a 
cause-and-effect conclusion. It is possible that women who eat 
chocolate daily have less stressful lives and the lack of stress helps 
their babies to have better temperaments. 

4.49 Type of school. For example, private schools tend to have 
smaller class sizes and students that come from families with higher 
socioeconomic status. If these students do better in the future, we 
wouldn’t know if the better performance was due to smaller class 
sizes or higher socioeconomic status. 

4.51 Experimental units: pine seedlings. Explanatory variable: 
light intensity. Response variable: dry weight at the end of the study. 
‘Treatments: full light, 25% light, and 5% light. 


S-18 Solutions 


4.53 Experimental units: the individuals who were called. 
Explanatory variables: (1) information provided by interviewer; 
(2) whether caller offered survey results. Response variable: whether 
or not the call was completed. ‘Treatments: (1) name/no offer; (2) 
university/no offer; (3) name and university/no offer; (4) name/ 
offer; (5) university/offer; (6) name and university/offer. 

4.55 Experimental units: 24 fabric specimens. Explanatory vari- 
ables: (1) roller type; (2) dyeing cycle time; (3) temperature. 
Response variable: a quality score. ‘Treatments: (1) metal, 30 min, 
150°; (2) natural, 30 min, 150°; (3) metal, 40 min, 150°; (4) natural, 
40 min, 150°; (5) metal, 30 min, 175°; (6) natural, 30 min, 175°; 
(7) metal, 40 min, 175°; (8) natural, +0 min, 175°. 

4.57 There was no control group. We don’t know if the improve- 
ment was due to the placebo effect or if the flavonols actually 
affected the blood flow. 

4.59 (a) Write all names on slips of paper, put them in a container, 
and mix thoroughly. Pull out 40 slips of paper and assign these 
subjects to ‘Treatment 1. Then pull out 40 more slips of paper and 
assign these subjects to Treatment 2. The remaining 40 subjects are 
assigned to Treatment 3. (b) Assign the students numbers from | to 
120. Using the command RandInt (1,120) on the calculator, 
assign the students corresponding to the first 40 unique numbers cho- 
sen to Treatment 1, the students corresponding to the next 40 unique 
numbers chosen to Treatment 2, and the remaining 40 students to 
‘Treatment 3. (c) Assign the students numbers from 001 to 120. Pick 
a spot on ‘Table D and read off the first 40 unique numbers between 
001 and 120. The students corresponding to these numbers are 
assigned to Treatment 1. The students corresponding to the next 40 
unique numbers between 001 and 120 are assigned to Treatment 2. 
The remaining 40 students are assigned to Treatment 3. 

4.61 Random assignment. If players are allowed to choose which 
treatment they get, perhaps the more motivated players will choose 
the new method. If they improve more by the end of the study, the 
coach can’t be sure if it was the exercise program or player motiva- 
tion that caused the improvement. 

4.63 Comparison: Researchers used a design that compared a low- 
carbohydrate diet with a low-fat diet. Random assignment: Subjects 
were randomly assigned to one of the two diets. Control: The exper- 
iment used subjects who were all obese at the beginning of the 
study and who all lived in the same area. Replication: There were 
66 subjects in each treatment group. 

4.65 Write the names of the patients on 36 identical slips of paper, 
put them in a hat, and mix them well. Draw out 9 slips. ‘The corre- 
sponding patients will receive the antidepressant. Draw out 9 more 
slips. Those patients will receive the antidepressant plus stress man- 
agement. The patients corresponding to the next 9 slips drawn will 
receive the placebo, and the remaining 9 patients will receive the 
placebo plus stress management. At the end of the experiment, 
record the number and severity of chronic tension-type headaches 
for each of the 36 subjects and compare the results for the + groups. 
4.67 (a) Other variables include expense and condition of the 
patient. For example, if a patient is in very poor health, a doctor 
might choose not to recommend surgery because of the added 
complications. Then we won’t know if a higher death rate is due to 
the treatment or the initial health of the subjects. (b) Write the 
names of all 300 patients on identical slips of paper, put them in a 
hat, and mix them well. Draw out 150 slips and assign the corre- 
sponding subjects to receive surgery. ‘The remaining 150 subjects 
receive the new method. At the end of the study, count how many 
patients survived in each group. 


4.69 The subjects developed rashes on the arm exposed to the 
placebo (a harmless leaf) simply because they thought they were 
being exposed to a poison ivy leaf. Likewise, most of the subjects 
didn’t develop rashes on the arm that was exposed to poison ivy 
because they didn’t think they were being exposed to the real 
thing. 

4.71 Because the experimenter knew which subjects had learned 
the meditation techniques, he is not blind. If the experimenter 
believed that meditation was beneficial, he may subconsciously 
rate subjects in the meditation group as being less anxious. 

4.73 (a) To make sure that the two groups were as similar as possi- 
ble before the treatments were administered. (b) The difference in 
weight loss was larger than would be expected due to the chance 
variation created by the random assignment to treatments. (c) Even 
though the low-carb dieters lost 2 kg more over the year than the 
low-fat group, a difference of 2 kg could be due just to chance varia- 
tion created by the random assignment. 

4.75 (a) The different diagnoses, because the treatments were ran- 
domly assigned to patients within each diagnosis. (b) Using a ran- 
domized block design allows us to account for the variability in 
response due to differences in diagnosis by initially comparing the 
results within each block. In a completely randomized design, this 
variability will be unaccounted for, making it harder to determine 
if there is a difference in health and satisfaction due to the differ- 
ence between doctors and nurse-practitioners. 

4.77 (a) A randomized block design would help us account for the 
variability in yield that is due to the differences in fertility in 
the field, making it easier to determine if one variety is better than 
the others. (b) The rows. ‘There should be a stronger association 
between row number and yield than column number and yield. 
(c) Let the digits 1 to 5 correspond to the five corn varieties A to E. 
Begin with line 111 on the random digit table, and assign the letters 
to the top row from left to right, ignoring numbers 0 and 6—9 and 
repeated numbers. Use a different line (111, 112, 113, 114, and 
115) for each row. Top row (left to right): ADECB, second row: 
ECDAB, third row: BEDCA, fourth row: DEACB, bottom row: 
ADCBE. 

4.79 (a) If all rats from litter 1 were fed Diet A and if these rats 
gained more weight, we would not know if this was because of the 
diet or because of genetics and initial health. (b) Use a randomized 
block design with the litters as blocks. For each of the litters, ran- 
domly assign half of the rats to receive Diet A and the other half to 
receive Diet B. This will allow researchers to account for the differ- 
ences in weight gain caused by the differences in genetics and ini- 
tial health. 

4.81 (a) Matched pairs design. (b) In a completely randomized 
design, the differences between the students will add variability to 
the response, making it harder to detect if there is a difference 
caused by the treatments. In a matched pairs design, each student 
is compared with himself (or herself), so the differences between 
students are accounted for. (c) If all the students used the hands- 
free phone during the first session and performed worse, we 
wouldn’t know if the better performance during the second session 
is due to the lack of phone or to learning from their mistakes the 
first time. By randomizing the order, some students will use the 
hands-free phone during the first session and others during the sec- 
ond session. (d) The simulator, route, driving conditions, and traffic 
flow were all kept the same for both sessions, preventing these vari- 
ables from adding variability to the response variable. 


4.83 (a) Randomly assign the 20 subjects into two groups of 10. 
Write the name of each subject on a note card, shuffle the cards, 
and select 10 to be assigned to the 70° environment. The remain- 
ing 10 subjects will be assigned to the 90° environment. Then the 
number of correct insertions will be recorded for each subject and 
the two groups compared. (b) All subjects will perform the task 
twice, once in each temperature condition. Randomly choose the 
order by flipping a coin. Heads: 70°, then 90°. Tails: 90°, then 70°. 
For each subject, compare the number of correct insertions in each 
environment. 

4.85 (a) If the students find a difference between the two groups, 
they will not know if the difference is due to gender or the deodor- 
ant. (b) Each student should have one armpit randomly assigned to 
receive Deodorant A and the other Deodorant B. Because each 
gender uses both deodorants, there is no longer any confounding 
between gender and deodorant. 

4.87 c 

4.89 b 

4.916 

4.93 b 

4.95 (a) For these seeds, the weights follow a N(525, 110) distribution 
and we want the proportion of seeds that weigh more than 500 mg 


500 — 525 
(see graph below). z = ae = — (0.23. From Table A, the 


proportion of z-scores greater than —0.23 is 1 — 0.4090 = 0.5910. 
Using technology normalcdf (lower:500,upper:10000, 
p:525,0:110) = 0.5899. About 59% of seeds will weigh more 
than 500 mg. 


(525,110) 


500 525 
Weight (mg) 
(b) For these seeds, the weights follow a N(525, 110) distribution 


and we are looking for the boundary value x that has an area of 0.10 
to the left (see graph below). A z-score of —1.28 gives the closest 


value to 0.10 (0.1003). Solving ame gives x = 384.2. 


Using technology: invNorm(area:0.10,y:525,0:110) = 
384.0. The smallest weight among the remaining seeds should be 
about 384 mg. 


(525,110) 
Area = 0.10 


T 
525 


Weight (mg) 


Section 4.3 


Answers to Odd-Numbered Section 4.3 Exercises 

4.97 If the study involves random sampling, we can make infer- 
ences about the population from which we sampled. If the study 
involves random assignment, we can make inferences about cause 
and effect. 


Solutions S-19 


4.99 Because this study involved random assignment to the treat- 
ments, we can infer that the difference between foster care or insti- 
tutional care caused the difference in response. 

4.101 Because this study did not involve random assignment to a 
treatment, we cannot infer cause and effect. Also, because the indi- 
viduals were not randomly chosen, we cannot generalize to a larger 
population. 

4.103 As daytime running lights become more common, they may 
be less effective at catching the attention of other drivers. Also, a 
driving simulator might not be very realistic. 

4.105 Answers will vary. 

4.107 Answers will vary. 

4.109 Confidential. The person taking the survey knows who is 
answering the questions, but will not share the results of individuals 
with anyone else. 

4.111 The subjects were not able to give informed consent. ‘They 
did not know what was happening to them and they were not old 
enough to understand the ramifications. 

4.113 The conditional distributions for males and females are dis- 
played in the table and graph below. Men are more likely to view 
animal testing as justified if it might save human lives: over two- 
thirds of men agree or strongly agree with this statement, compared 
to slightly less than half of the women. The percentages who dis- 
agree or strongly disagree tell a similar story: 16% of men versus 
30% of women. 


Response Male Female 
Strongly agree 14.7% 9.3% 
Agree 52.3% 38.8% 
Neither 16.9% 21.9% 
Disagree 11.8% 19.3% 
Strongly disagree 4.3% 10.7% 
60 
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Answers to Chapter 4 Review Exercises 


R4.1 (a) Population: all Ontario residents. Sample: the 61,239 
people interviewed. (b) Because different samples will produce dif- 
ferent estimates, it is unlikely that the percentages in the entire 
population would be exactly the same as the percentages in the 
sample. However, they should be fairly close. 

R4.2 (a) Announce in a daily bulletin that there is a survey con- 
cerning student parking available in the main office for students 
who want to respond. Because those who feel strongly are more 
likely to respond, their opinions will be overrepresented. 
(b) Interview a group of students as they come in from the parking 
lot. People who already can park on campus might have different 
opinions about the parking situation than those who cannot. 


S-20 Solutions 


R4.3 (a) Number the players from 01 to 25 in alphabetical order. 
Move from left to right, reading pairs of digits until you find three 
different pairs between 01 and 25, and select the corresponding 
players. (b) 17 (Musselman), 09 (Fuhrmann), and 23 (Smith). 
R4.4 Stratified, because it is likely that the opinions of professors 
will vary based on which type of institution they are at. Then a 
stratified random sample will provide a more precise estimate than 
the other methods. Furthermore, the other methods might miss fac- 
ulty from one particular type of institution. 

R4.5 (a) People may not remember how many movies they 
watched in a movie theater in the past year. So shorten the amount 
of time that they ask about, perhaps 3 or 6 months. (b) This will 
underrepresent younger adults who use only cell phones. If younger 
adults go to movies more often than older adults, the estimated 
mean will be too small. (c) Because the frequent moviegoers will 
not be at home to respond, the estimated mean will be too small. 
R4.6 (a) Different anesthetics were not randomly assigned to the 
subjects. (b) ‘Type of surgery. If Anesthesia C is used more often 
with a type of surgery that has a higher death rate, we wouldn’t 
know if the death rate was higher because of the anesthesia or the 
type of surgery. 

R4.7 (a) Units: potatoes. Explanatory: storage method and time 
from slicing until cooking. Response: ratings of color and flavor. 
‘Treatments: (1) fresh/immediately, (2) fresh/after an hour, (3) room 
temperature/immediately, (4) room temperature/after an hour, 
(5) refrigerator/immediately, (6) refrigerator/after an hour. (b) 
Using 300 identical slips of paper, write “1” on 50 of them, “2” on 
50 of them, and so on. Put the papers in a hat and mix well. Then 
select a potato and randomly select a slip from the hat to determine 
which treatment that potato will receive. Repeat this process for the 
remaining 299 potatoes, making sure not to replace the slips of 
paper into the hat. (c) Use a randomized block design with regular 
potatoes in one block and sweet potatoes in the other block. 
Randomly assign the 6 treatments within each block as in part (b). 
R4.8 (a) No. The 1000 students were not randomly selected from 
any larger population. (b) Yes. The students were randomly assigned 
to the three treatments. 

R4.9 (a) By giving some patients a treatment that should have no 
effect at all, but appears like the Saint-John’s-wort, the researchers 
can account for the expectations of patients (the placebo effect) by 
comparing the results for the two groups. (b) To create two groups 
of subjects that are roughly equivalent at the beginning of the 
experiment. (c) The subjects should not know which treatment 
they are getting so that the researchers can account for the placebo 
effect. The researchers should be unaware of which subjects 
received which treatment so that they cannot influence how the 
results are measured. (d) The difference in improvement between 
the two groups wasn’t large enough to rule out the chance variation 
caused by the random assignment to treatments. 

R4.10 (a) Randomly assign 15 students to easy mazes and the other 
15 to hard mazes. Use 30 identical slips of paper and write the 
name of each subject on a slip. Mix the slips in a hat, select 15 of 
them at random, and assign these subjects to hard mazes. The 
remaining 15 will be assigned to easy mazes. After the experiment, 
compare the time estimates of the two groups. (b) Each student 
does the activity twice, once with each type of maze. Randomly 
determine which set of mazes is used first by flipping a coin for each 
subject. Heads: easy, then hard. ‘Tails: hard, then easy. After the 
experiment, compare each student’s easy maze and hard maze time 
estimate. (c) The matched pairs design would be more likely to 


detect a difference because it accounts for the variability between 
subjects. 

R4.11 (a) This does not meet the requirements of informed con- 
sent because the subjects did not know the nature of the experi- 
ment before they agreed to participate. (b) All individual data 
should be kept confidential and the experiment should go before 
an institutional review board before being implemented. 


Answers to Chapter 4 AP® Statistics Practice Test 


T4.1 ¢ 

142. ¢ 

T4.3 d 

T44¢ 

T4.5 b 

T4.6 b 

T4.7 d 

T4.8 d 

T4.9 d 

T4.10 b 

T4.11 d 

4.12 (a) Experimental units: acacia trees. Treatments: placing 
either active beehives, empty beehives, or nothing in the trees. 
Response: damage to the trees caused by elephants. (b) Assign the 
trees numbers from 01 to 72 and use a random number table to pick 
24 different two-digit numbers in this range. Those trees will get the 
active beehives. The trees corresponding to the next 24 different 
two-digit numbers from 01 to 72 will get the empty beehives, and 
the remaining 24 trees will remain empty. Compare the damage 
caused by elephants to the three groups of trees. 

4.13 (a) Not all possible samples of size 1067 were possible. For 
example, using their method, they could not have had all respon- 
dents from the east coast. (b) If the household members who typi- 
cally answer the phone have a different opinion than those who 
don’t typically answer the phone, their opinions will be overrepre- 
sented. (c) If people without phones or with cell phones only have 
different opinions than the group of people with residential lines, 
these opinions will be underrepresented. 

T4.14 (a) Each of the 11 individuals will be a block in this matched 
pairs design, with the order of treatments randomly assigned. This 
was to help account for the variability in tapping speed caused by the 
differences in subjects. (b) If all the subjects got caffeine the second 
time, the researchers wouldn’t know if the increase was due to the 
caffeine or due to practice with the task. (c) Yes. Neither the subjects 
nor the people who come in contact with them during the experi- 
ment (including those who record the number of taps) need to know 
the order in which the caffeine or placebo was administered. 


Answers to Cumulative AP® Practice Test 1 


AP1.1 d 
AP1.2 e 
AP1.3 b 
APIA c 
APL.5 
AP 1.6 
AP LS 
AP 1.8 
AP1.9 d 
AP1.10 d 
AP1.11 d 


oO ononp 


AP1.12 b 

AP1.13 b 

AP1.14 a 

AP1.15 (a) The distribution of gains for subjects using Machine A 
is roughly symmetric, while the distribution of gains for subjects 
using Machine B is skewed to the left. The center is greater for 
Machine B than for Machine A. The distribution for Machine B is 
more variable than the distribution for Machine A. (b) B. The typi- 
cal gain using Machine B is greater than the typical gain using 
Machine A. (c) A. The spread for Machine A is less than the spread 
for Machine B. (d) Volunteers from one fitness center were used 
and these volunteers may be different in some way from the general 
population of those who are interested in cardiovascular fitness. ‘To 
broaden their scope of inference, they should randomly select peo- 
ple from the population they would like to draw an inference about. 
AP1.16 (a) Number the 60 retail sales districts with a two-digit 
number from 01 to 60. Using a table of random digits, read two- 
digit numbers until 30 unique numbers from 01 to 60 have been 
selected. The corresponding 30 districts are assigned to the mone- 
tary incentives group and the remaining 30 to the tangible incen- 
tives group. After a specified period of time, record the change in 
sales for each district and compare the two groups. (b) The districts 
labeled 07, 51, and 18 are the first three to be assigned to the mon- 
etary incentives group. (c) Pair the two districts with the largest 
sales, the next two largest, down to the two smallest districts. For 
each pair, pick one of the districts and flip a coin. If the flip is 
“heads,” this district is assigned to the monetary incentives group. If 
it is “tails,” this district is assigned to the tangible incentives group. 
The other district in the pair is assigned to the other group. After a 
specified period of time, record the change in sales for each district 
and compare within each pair. 

AP1.17 (a) There is a very strong, positive, linear association 
between sales and shelf length. (b) y = 317.94 + 152.68x, where 
y = weekly sales (in dollars) and x = shelf length (in feet). 
(c) $1081 (d) When using the least-squares regression line with 
x = shelf space to predict y = sales, we will typically be off by about 
s = $23. (e) $$$ About 98.2% of the variation in weekly sales rev- 
enue can be accounted for by the linear model relating sales to 
shelf length. (f) It would be inappropriate to interpret the intercept, 
because the data represent sales based on shelf lengths of 3 to 6 feet 
and 0 feet falls substantially outside that domain. 


Chapter 5 
Section 5.1 


Answers to Check Your Understanding 

page 292: 1. (a) Ifyou asked a large sample of U.S. adults whether 
they usually eat breakfast, about 61% of them will answer yes. 
(b) The exact number of breakfast eaters will vary from sample to 
sample. 2. (a) 0. [fan outcome can never occur, then it will occur 
in 0% of the trials. (b) 1. If an outcome will occur on every trial, 
then it will occur in 100% of the trials. (c) 0.01. An outcome that 
occurs in 1% of the trials is very unlikely, but will occur every once 
ina while. (d) 0.6. An outcome that occurs in 60% of the trials will 
happen more than half of the time. 

page 299: 1. Assign the members of the AP® Statistics class the 
numbers 01—28 and the rest of the students numbers 29-95. 
Ignore the numbers 96—99 and 00. In ‘Table D, read off 4 two-digit 
numbers, making sure that the second number is different than the 
first and that the fourth number is different than the third. Record 


Solutions S-21 


whether all four numbers are between 01 and 28 or not. 2. Assign 
the numbers 1—10 to Jeff Gordon, 11—40 to Dale Earnhardt, Jr., 
41—60 to Tony Stewart, 61—85 to Danica Patrick, and 86—100 to 
Jimmie Johnson. Then proceed as in the example. 


Answers to Odd-Numbered Section 5.1 Exercises 

5.1 (a) If we use a polygraph machine on many, many people who 
are all telling the truth, the machine will say about 8% of the people 
are lying. (b) Answers will vary. A false positive would mean that a 
person telling the truth would be found to be lying. A false negative 
would mean that a person lying would be found to be telling the 
truth. 

5.3 (a) If we look at many families like this, approximately 25% of 
them will have a first-born child that develops cystic fibrosis. 
(b) No. The number of children with cystic fibrosis could be 
smaller or larger than + by random chance. 

5.5 (a) Answers will vary. (b) Spin the coin many more times. 

5.7 In the short run, there was quite a bit of variability in the per- 
centage of made free throws. However, this percentage became less 
variable and approached 0.30 as the number of shots increased. 
5.9 No, he is incorrectly applying the law of large numbers to a 
small number of at-bats. 

5.11 (a) There are 10,000 four-digit numbers (0000, 0001, ..., 
2873, ..., 9999), and each is equally likely. (b) 2873. To many, 
2873 “looks” more random than 9999—we don’t “expect” to get the 
same number four times in a row. It would be best to choose a num- 
ber that others would avoid so you don’t have to split the pot with 
many other people. 

5.13 (a) Let diamonds, spades, and clubs represent making a free 
throw and hearts represent missing. Deal one card from the deck. 
(b) Let 00-74 represent making the free throw and 75—99 repre- 
sent missing. Read a two-digit number from ‘Table D. (c) Let 1-3 
represent the player making the free throw and 4 represent a miss. 
Generate a random integer from 1—4. 

5.15 (a) There are 19 (not 18) numbers from 00 to 18, 19 (not 18) 
numbers from 19 to 37, and 3 (not 2) numbers from 38 to 40. 
(b) Repeats should not be skipped. For example, if the first number 
selected was 08, then the probability of selecting a left-hander on 
the next selection would be 9% (instead of 10%). 

5.17 (a) Valid. The chance of rolling a 1, 2, or 3 is 75% on a 4-sided 
die and the 100 rolls represent the 100 randomly selected U.S. 
adults. (b) Not valid. The probability of heads is 50% rather than 
60%. This method will underestimate the number of times she hits 
the center of the target. 

5.19 (a) What is the probability that, in a random selection of 10 
passengers, none from first class are chosen? (b) Number the first- 
class passengers 01—12 and the other passengers 13—76. Look up 
two-digit numbers in Table D until you have 10 unique numbers 
from 01 to 76. Count the numbers between 01 and 12. (c) 71 48 70 
99-84 29 07 A456 63 61 68 34 76 52. There is one person selected 
who is in first class. (d) It seems plausible that the actual selection 
was random, because 15/100 is not very small. 

5.21 (a) Use a random integer generator to select 30 numbers from 
1 to 365. Record whether or not there were any repeats in the sam- 
ple. (b) Answers will vary. (c) Answers will vary. 

5.23 (a) Obtaining a sample percentage of 55% or higher is not par- 
ticularly unusual (probability ~ 43/200) when 50% of all students 
recycle. (b) Obtaining a sample percentage of at least 63% is very 
unlikely (probability ~ 1/200) when 50% of all students recycle. 


S-22 Solutions 


5.25 State: What is the probability that, in a sample of 4 randomly 
selected U.S. adult males, at least one of them is red-green color- 
blind? Plan: Let 00-06 denote a colorblind man and 07-99 
denote a non-colorblind man. Read 4 two-digit numbers from 
‘Table D for each sample and record whether or not the sample had 
at least one red-green colorblind man in it. Do: In our 50 samples, 
15 had at least one colorblind man in them. Conclude: The proba- 
bility that a sample of 4 men would have at least one colorblind 
man is approximately 15/50 = 0.30. 

5.27 State: What is the probability that it takes 20 or more selec- 
tions in order to find one man who is red-green colorblind? Plan: 
Let 0—6 denote a colorblind man and 7—99 denote a non-color- 
blind man. Use technology to pick integers from 0 to 99 until we 
get a number between 0 and 6. Count how many numbers there are 
in the sample. Do: In 16 of our 50 samples, it took 20 or more 
selections to get one colorblind man. Conclude: Not surprised. The 
probability of needing 20 or more selections to get one colorblind 
man is fairly large (approximately 16/50 = 0.32). 

5.29 State: What is the probability that the random assignment 
will result in at least 6 men in the same group? Plan: Number the 
men 1—8 and the women 9-20. Use technology to pick 10 unique 
integers between | and 20 for one group. Record if there are at least 
6 numbers between | and 8 in either group. Do: In our 50 repeti- 
tions, 9 had one group with 6 or more men in it. Conclude: Not 
surprised. The probability of getting 6 or more men in one group is 
fairly large (approximately 9/50 = 0.18). 

531 ¢ 

5.33 b 

5.45 ¢ 

5.37 (a) Population: adult U.S. residents. Sample: the 353,564 
adults who were interviewed. (b) The people who do not have a 
telephone were excluded. This would lead to an underestimate of 
the proportion in the population who experienced stress a lot of the 
day yesterday if the people without phones are poorer and conse- 
quently experience more stress. 


Section 5.2 

Answers to Check Your Understanding 

page 309: 1. A person cannot have a cholesterol level of both 240 
or above and between 200 and 239 at the same time. 2. A person 
has either a cholesterol level of 240 or above, or they have a choles- 
terol level between 200 and 239. P(A or B) = 0.16 + 0.29 = 0.45. 
3. PCC) =1-—045 = 0.55. 


page 311: 1. 
Face card Non-face card Total 

Heart 3 10 13 
Non-heart 9 30 39 
Total 12 40 52 


2. P(F and H) = 3/52 = 0.058. 3. The face cards that are hearts 
will be double-counted because F and H are not mutually exclu- 
12. 13 3 = 22 ; 

oo 2 


Answers to Odd-Numbered Section 5.2 Exercises 
5.39 (a) (1,1), (1,2), (1,3), (1,4), (2,1), (2,2), (2,3), (2,4), (3,1), 
(3,2), (3,3), 3,4), (4,1), (4,2), (4,3), (4,4). (b) Each outcome has 


sive. P(F or H) = 


probability ‘e 


l l l l 4 
tn Oe ig 6 IG I 
5.43 (a) Legitimate. (b) Not legitimate: the total is more than 1. 
(c) Legitimate. 
5.45 (a) P(type AB) = 1 - 0.96 = 0.04 (b) P(not type AB) = 1 — 
P(type AB) = 1 — 0.04 = 0.96 (c) P(type O or B) = 0.49 + 0.20 = 0.69 
5.47 (a) 1 — 0.13 — 0.29 — 0.30 = 0.28 (b) Using the complement 
tule, 1 — 0.13 = 0.87. 


= 0.25. 


2 
5.49 (a) P(Female) = = = 0.462 (b) P(Eats breakfast regularly) 
300 110 
= S957 0.504. (c) P(Female and breakfast) = 595” 0.185. 
275 300 =110 465 
(d) P(Female or breakfast) 595 + 595 595 > 595 0.782. 
5.51 (a) 
B Not B Total 
E 10 10 20 
Not E 8 10 18 
Total 18 20 38 


2 
(b) P(B) = . = 0.474; P(E) = xe = 0.526. (c) The ball lands ina 


spot that is black and even. P(B and E) = = = 0.263. (d) If we add 


the probabilities of B and E, the spots that are black and even will 


18 20 10 28 
b ble- ted. P(B or E) = = 
e double-counted. P(B or E) 38°38 38 38 
5.53 (a) 


= 0.737. 


430 
(b) PPBUM) = — = 0.723. There is a 0.723 probability that we 


select a person who is a breakfast eater, a male, or both. 


165 
(c) PBEM M°) = oa = 0.277. There is a 0.277 probability that 


5 
we select a female who is not a breakfast eater. 
5.55 (a) 
FB Not FB Total 
YT 0.66 0.07 0.73 
Not YT 0.19 0.08 0.27 
Total 0.85 0.15 1 
(b) 
FB YT 
0.08 


(c) FBUYT (d) P(FBUYT) = 0.85 + 0.73 — 0.66 = 0.92. 


DOr © 

Doe 

5.61 The scatterplot for the average crawling age and average tem- 
perature is given below. 


Average crawling age 
(weeks) 
he 


Average temperature (F°) 


In this scatterplot, there appears to be a moderately strong, negative 
linear relationship between average temperature and average crawl- 
ing age. The equation for the least-squares regression line is age = 
35.7 — 0.077 (temp). We predict that babies will walk 0.077 weeks 


earlier for every degree warmer it gets. 


Section 5.3 
Answers to Check Your Understanding 
36 
page 321: 1. P(L) = aa = 0.3656. There is a 0.3656 probabil- 
ity of selecting a course grade that is lower than a B. 2. 
pe ye eos P(L| E) = 2° = 0.50 P(L|E) gi 
36560 ~ 1600 ~ 


the probability of getting a lower grade given that the student is 
studying engineering or physical science. Because this probability 
(0.50) is greater than P(L) = 0.3656, we can conclude that grades 
are lower in engineering and physical sciences. 


page 326: 1. 
gb Laptops 
e California <0.25 
> Desktops 
0 Laptops 
0.25 
Computers 0.30 
%, Desktops 
Q 
Coy 


o0 Laptops 
New York <0.5p on 
2. P(laptop) = 0.30 + 0.175 + 0.175 = 0.65. 

0.30 

3. P(made in CA | laptop) = 065 = 0.462. 
page 328: 1. Independent. Because we are replacing the cards, 
knowing what the first card was will not help us predict what the 
second card will be. 2. Not independent. Once we know the suit of 
the first card, then the probability of getting a heart on the second 
card will change depending on what the first card was. 
3. Independent. P(right-handed) = 24/28 = 6/7 is the same as 
P(right-handed | female) = 18/21 = 6/7. 
page 331: 1. P(retumed safely) = 0.95. So P(safe return on all 20 
missions) = 0.957° = 0.3585. 2. No. Being a college student and 
being 55 or older are not independent events. 


Answers to Odd-Numbered Section 5.3 Exercises 


. 597 | 
5.63 (a) P(almost certain|M) = 7459 ~ 0.2428. 
426 
(b) P(F | Some chance) = 7 = 0.5983. 


Solutions S-23 


l 
5.65 (a) P(D| F) = = 0.7647. Given that a senator is female, 


there is a 0.7647 probability that she is a Democrat. 
1 
(b) P| D) = = = 0.2167. Given that a senator is a Democrat, 


there is a 0.2167 probability that she is a female. 
5.67 (a) P(not English) = 1 — 0.59 = 0.41. 


(b) P(Spanish | other than English) = — = 0.6341. 


5.69 P(B) < P(B|T) < P(T) < P(T |B). There are very few pro 
basketball players, so P(B) should be smallest. If you are a pro bas- 
ketball player, it is quite likely that you are tall, so P(T | B) should 
be largest. Finally, it’s much more likely to be over 6 feet tall than it 
is to be a pro basketball player if you're over 6 feet tall. 

0.66 
5.71 P(YT| FB) = 0857 0.7765. 
5.73 P(download music) = 0.29, P(don’t care | download music) 
= 0.67. 
P(download music M don’t care) = (0.29)(0.67) =0.1943 = 19.43%. 


5.75 (a) A tree diagram is below. 


13 5 
1 Soft center 
6 
4 Soft center 19 
20, Hard center 
Candies 6 
20 14 5 
Soft center 


Hard center 
Hard center 


(b) P(one soft MN one hard) = (33)(q5) + (4)(G8) = 383 = 0.4421 


5.77 (a) A tree diagram is below. 
028 Credit card 
Regular <ar 
gasoline No credit card 
03% Credit card 
Mid-grade <0665 ; 
gasoline No credit card 
oAX Credit card 
Premium <asy 
gasoline No credit card 


(b) P(credit card) = (0.88)(0.28) + (0.02)(0.34) + (0.10)(0.42) = 


0.0420 
0.2952 (c) P(premium gasoline|credit card) = 02952 > 0.142. 


Customer 


5.79 (a) P(lactose intolerant) = (0.82)(0.15) + (0.14)(0.70) + 
(0.04)(0.90) = 0.257. 
0.036 


(b) P(Asian | lactose intolerant) = 0257 > 0.1401. 
5.81 P(antibody | positive) = 
(0.01)(0.9985) — 0.6270 
(0.01)(0.9985) + (0.99)(0.006) 
5.83 (a) : = 0.2801 (b) en = 0.2944 (c) The events are 
2367 4826 


not independent because the probabilities in parts (a) and (b) are 

not the same. 

5.85 Not independent. From Exercise 5.65, we saw that 
60 


P(D | F) = 0.7647, which is not the same as P(D) = 100 > 0.60. 


S-24 Solutions 


5.87 Independent. P(sum of 7 | green is 4) = 1/6 = 0.1667, which 
equals P(sum of 7) = 6/36 = 0.1667. 
5.89 P(all remain bright) = (0.98)? = 0.6676 
5.91 P(at least one universal donor) = 1 — (0.928)!° = 0.5263 
5.93 No, because the events are not independent. If one show 
starts late, we can predict that the next show will start late as well. 

6 1 
5.95 (a) P(doubles) = 6 6 0.167 

al 5 

(b) P(mo doubles first MN doubles second) = (7) = 367 0.139 


6/\6 216 


3 4 
(d) 4th: (2) (Z) 5th: (2) (2). The probability that the first 
l 


kl 
doubles are rolled on the kth roll is (2) @ 


6 
5.97 ¢ 
5.99 e 
5.101 P(at least one is underweight) = 1 — (1 


(c) P(first doubles on third roll) = ( 2\(2) =o 0.116 


0.131)? = 0.2448 


Answers to Chapter 5 Review Exercises 


R5.1 When the weather conditions are like those seen today, it has 
rained on the following day about 30% of the time. 

R5.2 (a) Let the numbers 00-14 represent not wearing a seat belt 
and 15—99 represent wearing a seat belt. Read 10 sets of two-digit 
numbers. For each set of 10 two-digit numbers, record whether 
there are two consecutive numbers between 00-14 or not. (b) The 
first sample is 29 07 71 48 63 61 68 34 70 52 (not two consecutive). 
The second sample is 62 22 45 10 25 95 05 29 09 08 (two consecu- 
tive). The third sample is 73 59 27 51 86 87 13 69 57 61 (not two 
consecutive). 


R5.3 (a) 
Difference 1 5 = 
; 18 £ Ms 
Probability 36 36 36 
18 6 24 
5s = feet Og Oe 
(b) Die A. P(A > B) = P(positive difference) 36.36 36 


R5.4 (a) Legitimate. All probabilities are between 0 and | and they 
add up to 1. (b) P(Hispanic) = 0.001 + 0.006 + 0.139 + 0.003 = 
0.149 (c) P(not a non-Hispanic white) = 1 — 0.674 = 0.326 (d) 
People who are white and Hispanic will be double-counted. P(white 


or Hispanic) = 0.813 + 0.149 — 0.139 = 0.823. 
R5.5 (a) 

Won Tacos 

36 


(b) P(lost and no tacos) = 36/81 = 0.444 (c) P(won or tacos) = 
41 30 26 45 
8 sl 


R5.6 (a) 
0 4 
od Steroids 0.05 7 
Athlete 
8.99 oO 4. 
No 0.97 


(b) P(+) = (0.10)(0.95) + (0.90)(0.03) = 0.122. 
0.095 


(c) P(steroids | +) = 01227 0.7787. 
R5.7 (a) 
Thick Thin Total 
Mushrooms 2 2 4 
No mushrooms 1 2 3 
Total 3 4 7 


4 
(b) Not independent: P(mushrooms) = a 0.571 does not equal 


P(mushrooms | thick crust) = : = 0.667. 


4 

(c) Independent: P(mushrooms) = g7 27 0.50 is equal to 

: 2 1 
P(mushrooms | thick crust) = 4 = 5 = 0.50. 

209 
R5.8 (a) P(damage) = a1 0.24. 
60 

(b) P(damage | no cover) = Tis 0.2844, 


1 76 
(damage |< ;) = Sea 0.3248, 


1 2 44 
P( damage | 3 to 5) aag 0.1991, and 


(damage |> 5) = = = 0.1415. (c) Yes. It appears that deer do 
much more damage when there is no cover or less than 1/3 cover 
than when there is more cover. 

R5.9 (a) P(up three consecutive years) = (0.65) = 0.274625. 

(b) P(same direction for 3 years) = (0.65)* + (0.35)? = 0.3175, 
R5.10 (a) (A, A), (A, B), (B, A), (B, B) 


Blood type A AB B 
Probability 0.25 0.5 0.25 


(b) (A, A), (A, B), (O, A), (O, B) 


Blood type A AB B 
Probability 0.5 0.25 0.25 


P(at least 1 type B) = 1 — P(neither are type B) = 1 — (0.75) 
= 0.4375. 


Answers to Chapter 5 AP® Statistics Practice Test 


Ts 
Ts2 
133 
T5.4 
bs 
T5.6 
TS7 
T5.8 
75.9 b 

15.10 c 

T5.11 (a) Here is a completed table, with T indicating that the teacher 


OOo ero & Oo 


2 
wins and Y indicating that you win. P(teacher wins) = — = 0.5625. 


1 2 3 4 5 6 7 8 
1 _ T T T T T T T 
2 Y _ T T T iL: T T 
3 Y Y — T T T T T 
4 Y y¥ _ T T T T 
5 Y Y Y Y _ T T T 
6 Y Y Y Y Y _ T T 


(b) PAUB) = P(A) + P(B) — (ANB) ae ; = : 


48 48 
aie 0.5625 d t ] 
4g 7 0:029 does not equa 


(c) Not independent. P(A) = 
P(A|B)= 2 = 0.625. 


T5.12 (a) 


0® Defective 


02% Defective 
B <ay 
OK 


Defective 


Part 


(b) P(defective) = (0.60)(0.10) + (0.30)(0.30) + (0.10)(0.40) = 


0.06 
0.19. (c) Machine B. P(A| defective) = 019 = 0.3158. 
0.09 0.04 | 
P(B| defective) = 019 0.4737.P(C | defective) = 019 > 0.2105. 
15.13 (a) Here is a two-way table that summarizes this 
information: 
Smokes Does not Total 
smoke 
Cancer 0.08 0.04 0.12 
No cancer 0.17 0.71 0.88 
Total 0.25 0.75 1.00 
0.08 
P(gets cancer | smoker) = = 0.32. 


0.25 — 


Solutions S-25 


0.08 = 0.29. 


(b) P(smokes U gets cancer) = 0.25 + 0.12 


(c) P(cancer) = 0.12, so P(at least one gets cancer) = 1 — 
P(neither gets cancer) = 1 — 0.887 = 0.2256 


15.14 (a) Let 00-16 represent out-ofstate and 17—99 represent 
in-state. Read two-digit numbers until you have found two numbers 
between 00 and 16. Record how many 2-digit numbers you had to 
read. (b) The first sample is +1 05 09 (it took three cars). The sec- 
ond sample is 20 31 06 44 90 50 59 59 88 43 18 80 53 11 (it took 14 
cars). The third sample is 58 44 69 94 86 85 79 67 05 81 18 45 14 
(it took 13 cars). 


Chapter 6 
Section 6.1 


Answers to Check Your Understanding 

page 350: 1. P(X = 3) is the probability that the student got either 
an A oraB. P(X = 3) = 0.68. 

2. P(X < 2) = 0.02 + 0.10 = 0.12 

3. The histogram below is skewed to the left. Higher grades are 
more likely, but there are a few lower grades. 


0.4 


S 
io 


Probability 
i=) 
Nm 


S 
a 


page 355: 1. wy =1.1. If many, many Fridays are randomly 
selected, the average number of cars sold will be about 1.1. 
2. ox = V0.89 = 0.943. The number of cars sold on a randomly 
selected Friday will typically vary from the mean (1.1) by about 
0.943 cars. 


Answers to Odd-Numbered Section 6.1 Exercises 
6.1 (a) 


Value 0 1 2 3 4 
Probability 1/16 4/16 6/16 4/16 1/16 


(b) The histogram below shows that this distribution is symmetric 
with a center at 2. 


0.4 


0.3 


0.2 


Probability 


0.1 


0.0 T T 


(c) P(X S 3) = 15/16 = 0.9375. There is a 0.9375 probability that 
you will get three or fewer heads in 4 tosses of a fair coin. 

6.3 (a) P(X = 1) = 0.9. (b) The event X S 2 is “at most two non- 
word errors.” P(X < 2) = 0.6. P(X < 2) = 0.3. 

6.5 (a) All of the probabilities are between 0 and 1 and they sum to 
1. (b) The histogram below is unimodal and skewed to the right. 


S-26 Solutions 


0.30 
0.25 
0.20 
0.15 
0.10 
0.05 
0.00 T T T T 


Probability 


(c) The event X = 6 is the event that “the first digit in a randomly 
chosen record is a 6 or higher.” P(X = 6) = 0.222. (d) P(X = 5) = 

0.778. 

6.7 (a) The outcomes that make up the event A are 7, 8, and 9. 
P(A) = 0.155. (b) The outcomes that make up the event B are 1, 3, 
5,7, and 9. P(B) = 0.609. (c) The outcomes that make up the event 
“A or B” are 1, 3, 5, 7, 8, and 9. P(A or B) = 0.660. This is not the 
same as P(A) + P(B) because the outcomes 7 and 9 are included in 
both events. 


6.9 (a) 


x -$1 $2 
Probability 0.75 0.25 


(b) E(X) = —$0.25. If the player makes many $1 bets, he will lose 
about $0.25 per $1 bet, on average. 

6.11 py = 2.1. If many, many undergraduates performed this task, 
they would make about 2.1 nonword errors, on average. 

6.13 (a) This distribution is symmetric and 5 is located at the cen- 
ter. (b) According to Benford’s law, E(X) = 3.441. To detect a fake 
expense report, compute the sample mean of the first digits. A value 
closer to 5 suggests a fake report and a value near 3.441 is consistent 
with a truthful report. (c) P(Y > 6) = 3/9 = 0.333. Under Benford’s 
law, P(X > 6) = 0.155. To detect a fake expense report, compute 
the proportion of first digits that begin with 7, 8, or 9. A value closer 
to 0.333 suggests a fake report and a value closer to 0.155 is consis- 
tent with a truthful report. 

6.15 oy = V1.29 = 1.1358. The number of nonword errors in a 
randomly selected essay will typically differ from the mean (2.1) by 
about 1.14 words. 

6.17 (a) oy = V6.667 = 2.58. (b) ox = V6.0605 = 2.4618. This 
would not be the best way to tell the difference between a fake and 
a real expense report because the standard deviations are similar. 
6.19 (a) See the following histograms. The distribution of the 
number of rooms is roughly symmetric for owners and skewed to 
the right for renters. Renter-occupied units tend to have fewer 
rooms than owner-occupied units. ‘There is more variability in the 


number of rooms for owner-occupied units. 


0.25 


0.20 
0.15 


0.10 


Probability 


0.05 


0.00 | ——__ oe 


12345 67 8 9 10 


Number of rooms in 
owner-occupied units 


0.4 


0.3 


0.2 


Probability 


0.1 


0.0 


T T T T TTT T T T 
123 45 67 8 9 10 
Number of rooms in 
renter-occupied units 


(b) Owner: px = 6.284 rooms. Renter: fy = 4.187 rooms. Single 
people and younger people are more likely to rent and need less 
space than people with families. (c) cx = V2.68934 = 1.6399. 
The number of rooms in a randomly selected owner-occupied unit 
will typically differ from the mean (6.284) by about 1.6399 rooms. 
oy = V1.71003 = 1.3077. The number of rooms in a randomly 
selected renter-occupied unit will typically differ from the mean 
(4.187) by about 1.3077 rooms. 

6.21 (a) P(X > 0.49) = 0.51. (b) P(X = 0.49) = 0.51. 

(c) P(0.19 = X < 0.37 or 0.84 < X S 1.27) = 0.18 + 0.16 = 0.34 
6.23 The time Y ofa randomly chosen student has the N(7.11, 0.74) 
distribution. We want to find P(Y < 6). z = 6 0 -— = —1.50 and 
P(Z > —1.50) = 0.0668. Using technology: normalcd£ (lower: 
—1000,upper:6,:7.11,0:0.74) = 0.0668. There is about 
a 7% chance that this student will run the mile in under 6 minutes. 
6.25 (a) The speed Y of a randomly chosen serve has the N(115, 6) 
distribution. We want to find P(Y > 120).z = ee = 0.83 
and P(Z > 0.83) = 0.2033. Using technology: normalcdf 
(lower:120,upper:1000,:115,0:6) =0.2023. There is 
a 0.2023 probability of selecting a serve that is greater than 
120 mph. (b) The line above 120 has no area, so 
P(Y = 120) = P(Y > 120) = 0.2023. (c) We want to find ¢ such 


1.04 = + gives c = 108.76. 


Using technology: invNorm(area:0.15,y:115,0:6) = 
108.78. Fifteen percent of Nadal’s serves will be less than or equal to 
108.78 mph. 

6.27 b 

6.29 c 

6.31 Yes. The mean difference (post — pre) was 5.38 and the 


that P(Y = c) = 0.15. Solving 


median difference was 3. This means that at least half of the stu- 
dents improved their reading scores. 
6.33 predicted post-test = 17.897 + 0.78301 (pretest). 


Section 6.2 


Answers to Check Your Understanding 

page 367: 

1. Y = 500X. pry = 500(1.1) = $550. oy = 500(0.943) = $471.50. 
2. T=Y— 75. pp = 550 — 75 = $475. op = $471.50. 

page 376: 1. pr = 1.1 + 0.7 = 1.8. Over many Fridays, this deal- 
ership sells or leases about 1.8 cars in the first hour of business, on 


average. 
2. oF = (0.943) + (0.64)? = 1.2988, so op = V/1.2988 = 1.14. 

3. og = 500(1.1) + 300(0.7) = $760. of = (500)7(0.943)? + 
(300)7(0.64)? = 259,176.25, so og = V/259,176.25 = $509.09. 
page 378: 1. wp =1.1-0.7=0.4. Over many Fridays, this 
dealership sells about 0.4 cars more than it leases during the first 
hour of business, on average. 


2. of = (0.943) + (0.64)? = 1.2998, so op = V1.2998 = 1.14. 
3. pg = 500(1.1) — 300(0.7) = $340. 0% = (500)7(0.943)? + 


(300)(0.64)? = 259,176.25, so og = V259,176.25 = $509.09. 


Answers to Odd-Numbered Section 6.2 Exercises 

6.35 pay = 2.54(1.2) = 3.048 cm and oy = 2.54(0.25) = 0.635 cm. 
6.37 (a) The distribution shown below is skewed to the left. Most 
of the time, the ferry makes $20 or $25. 


0.4 


S 
io 


Probability 
o 
nN 


0.1 


(b) jum = $19.35. If many ferry trips were selected at random, 
the ferry would collect about $19.35 per trip, on average. 
(c) om = $6.45. The amounts collected on randomly selected ferry 
trips will typically vary by about $6.45 from the mean ($19.35). 
6.39 (a) pic = 5(7.6) + 50 = 88. (b) a¢ = 5(1.32) = 6.6. (c) 02 = 
(Sax)? = 250%. The variance of G is 25 times the variance of X. 
6.41 (a) wy = — $0.65. If many ferry trips were selected at ran- 
dom, the ferry would lose about $0.65 per trip, on average. 
(b) oy = $6.45. The amount of profit on randomly selected ferry 
trips will typically vary by about $6.45 from the mean (—$0.65). 
6.43 py = 6(3.87) — 20 = $3.22. ay = 6(1.29) = $7.74. 

6.45 (a) py = 47.3°F. oy = 4.05 °F. (b) Y has the N(47.3, 4.05) dis- 
tribution. We want to find P(Y < 40). z= “ae = — 1.80 
and P(Z < —1.80) = 0.0359. Using technology: normalcdf 
(lower:—1000,upper:40,:47.3,0:4.05) =0.0357. 
There is a 0.0357 probability that the midnight temperature in the 
cabin is below 40°F. 

6.47 (a) Yes. The mean of a sum is always equal to the sum of the 
means. (b) No, because it is not reasonable to assume that X and Y 
are independent. 

6.49 uy, +y, = (—0.65) + (—0.65) = — $1.30. of 4y, = 6.457 + 
6.452 = 83.205, so oy, +, = V83.205 = $9.12. 

6.51 psx = 3(2.1) = 6.3 and o3x = 3(1.136) = 3.408. purxy = 2(1.0) 
= 2.0 and ozy = 2(1.0) = 2.0. Thus, psy42y = 6.3 + 2.0 = 8.3 
and o3y+2y = 3.408? + 2.0? = 15.6145, so osxa2y = V15.6145 
= 3.95. 

6.53 (a) py_x = 1.0- 2.1 = — 1.1. If you were to select many 
essays, there would be about 1.1 fewer word errors than 
nonword errors, on average. oy_x = (1.0)? + (1.136)? = 2.2905, so 
oy-x = V2.2905 = 1.51. The difference in the number errors will 
typically vary by about 1.51 from the mean (—1.1). (b) The out- 
comes that make up this event are 1-0=1, 2-0=2, 
2-—1=1,3-0=3,3-1=2,3 —2=1. There isa 0.15 prob- 
ability that a randomly chosen student will have more word errors 


than nonword errors. 

6.55 The difference in score deductions for a randomly selected 
essay is 3X — 2Y. psy = 3(2.1) = 6.3 and o3y = 3(1.136) = 3.408. 
L2y = 2(1.0) = 2.0 and 027y = 2(1.0) = 2.0. Thus, /3x-2Y = 
6.3-20=43 and  ofy_. = 3.408? + 2.0? = 15.6145, so 
O3x-2y = V15.6145 = 3.95. 


Solutions S-27 


6.57 pux,+x, = 303.35 + 303.35 = $606.70 and 
O%, +x, = 9707.57° + 9707.57* = 188,473,830.6, so 


——— l 
Ox, +x = W188,473,830.6 = $13,728.58. W = 5 (Xi +X), so 


l l 
poy = (606.70) = $303.35 and ow = 5(13,728.58) = $6864.29. 


6.59 (a) Normal with mean = 11 + 20 = 31 seconds and standard 


deviation = V2? + 4? = 4.4721 seconds. (b) We want to find the 
probability that the total time is less than 30 seconds. 
30 — 31 
z= = = 0, é Z < —0. = 0). . Usi 
4472] 0.22 and P(Z 0.22) 0.4129. Using 
technology: normalcdf (lower :—1000,upper:30,p:31,0: 
4.4721) =0.4115. There is a 0.4115 probability of completing 
the process in less than 30 seconds for a randomly selected part. 
6.61 Let T =the total team swim time. pep = 55.2 + 58.0 + 
56.3 + 54.7 = 224.2 seconds and of = (2.8)? + (3.0)7 + (2.6)? 
(2.7) = 30.89, so op = 30.89 = 5.56 seconds. Thus, T has the 
N(224.2, 5.56) distribution. We want to find P(T < 220). 
2742 
= a ae = —0.76 and P(Z < —0.76) = 0.2236. Using 
technology: normalcdf (lower:—1000,upper:220,: 
224.2,0:5.56) = 0.2250. There is a 0.2250 probability that the 
total team time is less than 220 seconds in a randomly selected race. 
6.63 Let D =X, — X= the difference in NOX levels. up = 
14-—14=0 and OX,-x, = O*, ! OK, = 0,37 + 0.37 =0.18, so 
Ox,-x, = V0.18 = 0.4243. Thus, D has the N(0, 0.4243) distribu- 
tion. We want to find P(D > 0.8 or D < —0.8) = P(D > 0.8) + 
0.8 — 0 — 0.8 —0 
< —0.8).z= = =——__ = ~], 
P(D 0.8). z 0.4243 1.89 and z 0.4243 1.89 
and P(Z < —1.89 or Z > 1.89) = 0.0588. Using technology: 


1 — normalcdf (lower:—0.8,upper:0.8,»:0,0: 
0.4243) = 0.0594. There is a 0.0594 probability that the 
difference is at least as large as the attendant observed. 

6.65 c 

6.67 (a) Fidelity Technology Fund, because its correlation is 
larger. (b) No, the correlation doesn’t tell us anything about the 
values of the variables, only about the strength of the linear relation- 
ship between them. 


Section 6.3 


Answers to Check Your Understanding 
page 389: 1. Binomial. Binary? “Success” = getanace. “Failure” 
= don’t get an ace. Independent? Because you are replacing the 


& 


card in the deck and shuffling each time, the result of one trial does 
not tell you anything about the outcome of any other trial. Number? 
n = 10. Success? The probability of success is p = 4/52 for each 
trial. 2. Not binomial. Binary? “Success” = over 6 feet. “Failure” 
= not over 6 feet. Independent? Because we are selecting without 
replacement from a small number of students, the observations are 
not independent. Number? n = 3. Success? The probability of suc- 
cess will not change from trial to trial. 3. Not binomial. Binary? 
“Success” = roll a 5. “Failure” = don’t roll a 5. Independent? 
Because you are rolling a die, the outcome of any one trial does not 
tell you anything about the outcome of any other trial. Number? n 
= 100. Success? No. The probability of success changes when the 
corner of the die is chipped off. 
page 397: 1. Binary? “Success” = question answered correctly. 
“Failure” = question not answered correctly. Independent? The 
computer randomly assigned correct answers to the questions, so 


S-28 Solutions 


knowing the result of one trial (question) should not tell you 
anything about the result on any other trial. Number? n = 10. 
Success? The probability of success is p = 0.20 for each trial. 


2. PX = 3) = (PJo2rsy = 0.2013. There is a 20% chance 


that Patti will answer exactly 3 questions correctly. 

3. P(X = 6)=1-—PX $5) =1- 0.9936 = 0.0064. There is 
only a 0.0064 probability that a student would get 6 or more correct, 
so we would be quite surprised if Patti was able to pass. 

page 400: 1. py = 10(0.20) = 2. If many students took the quiz, 
we would expect students to get about 2 answers correct, on aver- 


age. 2. ox = /10(0.20)(0.80) = 1.265. If many students took the 
quiz, we would expect individual students’ scores to typically vary 
from the mean of 2 correct answers by about 1.265 correct answers. 
3. P(X > 2+ 2(1.265)) = PX > 4.53)=1-PXs4= 

1 — 0.9672 = 0.0328. 

page 408: 1. Die rolls are independent, the probability of getting 
doubles is the same on each roll (1/6), and we are repeating the 
chance process until we get a success (doubles). 

2. PT =3)= (=) (;) = 0.1157. There is a 0.1157 probability 
that you will get the first set of doubles on the third roll of the 


dice. 3. P(T = 3) : (Z\(Z) (2) (Z) = 042. 


Answers to Odd-Numbered Section 6.3 Exercises 
6.69 Binomial. Binary? “Success” = seed germinates and “Failure” 
= seed does not germinate. Independent? Yes, because the seeds 


were randomly selected, knowing the outcome of one seed 
shouldn’t tell us anything about the outcomes of other seeds. 
Number? n = 20 seeds. Success? p = 0.85. 

6.71 Not binomial. Binary? “Success” = person is left-handed and 
“Failure” = person is right-handed. Independent? Because stu- 
dents are selected randomly, their handedness is independent. 
Number? There is not a fixed number of trials for this chance pro- 
cess because you continue until you find a left-handed student. 
Success? p = 0.10. 

6.73 (a) Binomial. Binary? “Success” = reaching a live person and 
“Failure” = any other outcome. Independent? Knowing whether 
or not one call was completed tells us nothing about the outcome 
on any other call. Number? n = 15. Success? p = 0.2. (b) This is 
not a binomial setting because there are not a fixed number of 
attempts. ‘The Binary, Independent, and Success conditions are sat- 
isfied, however, as in part (a). 


6.75 PX =4)= (JJosnesor = 0.2304. There is a 0.2304 


probability that exactly + of the 7 elk survive to adulthood. 


6.77 P(X >4 = (Z)co-9710.567 ++++=0.1402. Because this 


probability isn’t very small, it is not surprising for more than 4 elk to 
survive to adulthood. 


6.79 (a) P(X = 17) = (3S )osso.sy = 0.2428. 


(b) P(X = 12)= (7 )eossro.5)"+ seed (7 )cossyr00.159 
= 0.0059. Because this is such a low probability, Judy should be 


suspicious. 


6.81 (a) px = 15(0.20) = 3. If we watched the machine make 
many sets of 15 calls, we would expect about 3 calls to reach a live 
person, on average. (b) ox = V15(0.20)(0.80) = 1.55. If we 
watched the machine make many sets of 15 calls, we would expect 
the number of calls that reach a live person to typically vary by 
about 1.55 from the mean (3). 

6.83 (a) wy = 15(0.80) = 12. Notice that wy = 3 and 12 + 3 = 15 
(the total number of calls). (b) ay = V15(0.80)(0.20) = 1.55. This 
is the same value as ox, because Y = 15 — X and adding a constant 
to a random variable doesn’t change the spread. 

6.85 (a) Binary? “Success” = win a prize and “Failure” = don’t win 
a prize. Independent? Knowing whether one bottle wins or not 
should not tell us anything about the caps on other bottles. Number? 
n = 7. Success? p = 1/16. (b) py = 1.167. If we were to buy many 
sets of 7 bottles, we would get 1.167 winners per set, on average. 
ox = 0.986. If we were to buy many sets of 7 bottles, the number of 
winning bottles would typically differ from the mean (1.167) by 
0.986. (c) P(X = 3) = 1 — P(X S 2) = 0.0958. Because 0.0958 isn’t 
avery small probability, the clerk shouldn’t be surprised. It is plausible 
to get 3 or more winners in a sample of 7 bottles by chance alone. 
6.87 No. Because we are sampling without replacement and the 
sample size (10) is more than 10% of the population size (76), we 
should not treat the observations as independent. 

6.89 If the sample is a small fraction of the population (less than 
10%), the make-up of the population doesn’t change enough to 
make the lack of independent trials an issue. 

6.91 (a) Binary? “Success” = visit an auction site at least once a 
month and “Failure” = don’t visit an auction site at least once a 
month. Independent? We are sampling without replacement, but 
the sample size (500) is far less than 10% of all males aged 18 to 34. 
Number? n= 500. Success? p=0.50. (b) np =250 and 
n(1 — p) = 250 are both at least 10. (c) py = 250 and oy = 11.18. 
Thus, X has approximately the N(250, 11.18) distribution. We want 


235 — 250 
to find P(X = 235). z= 2a = —1.34 and P(Z =—1.34) 
= 0.9099 Using technology: normalcd£ (lower:235, 


upper:1000,:250,0:11.18) = 0.9102. There is a 0.9102 
probability that at least 235 of the men in the sample visit an online 
auction site. 

6.93 Let X be the number of Is and 2s. Then X has a binomial 
distribution with n = 90 and p = 0.477 (in the absence of fraud). 
P(X = 29) = 0.0021. Because the probability of getting 29 or fewer 
invoices that begin with the digits 1 or 2 is quite small, we have 
reason to be suspicious that the invoice amounts are not genuine. 
6.95 (a) Not geometric. We can’t classify the possible outcomes on 
each trial (card) as “success” or “failure” and we are not selecting 
cards until we get a single success. (b) Games of 4-Spot Keno are 
independent, the probability of winning is the same in each game 
(p = 0.259), and Lola is repeating a chance process until she gets a 
success. X = number of games needed to win once is a geometric 
random variable with p = 0.259. 

6.97 (a) Let X = the number of bottles Alan purchases to find one 
winner. P(X = 5) = (5/6)*(1/6) = 0.0804. 

(b) P(X S 8) = (1/6) +--+ + (5/6)(1/6) = 0.7674. 


l 


(b) P(X = 40) = 1 — P(X = 39) = 0.0187. Because the probability 
of not getting an 8 or 9 before the 40th invoice is small, we may 
begin to worry that the invoice amounts are fraudulent. 


6.101 b 


wed 
Managerial & Smoke: 
professional 9.80 : 
Don’t smoke 
022 smoke 
Intermediate <a 
Don’t smoke 
38 


: Smoke 
<ag 
Don’t smoke 


P(smoke) = 0.43(0.20) + 0.34(0.29) + 0.23(0.38) = 0.272 = 27.2% 


Routine & 
manual 


(0.23)(0.38) _ 


o= 21 =32.1% 
(b) P(routine and manual | smoke) 0.272 0.321 = 32.1% 
Answers to Chapter 6 Review Exercises 
R6.1 (a) (PX =5) =1-—0.1 —0.2-0.3 - 0.3 =0.1. 


(b) Discrete, because it takes a fixed set of values with gaps in 
between. (c) P(X S 2) = 0.3. P(X < 2) = 0.1. These are not the 
same because the outcome X=2 is included in the first 
calculation but not the second. (d) py = 1(0.1) +--+ + 5(0.1) = 
3.1. of =(1 — 3.1)9°(0.1) +--+ + G = 3.10.1) = 1.29, so ox = 
V/1.29 = 1.136. 

R6.2 (a) Temperature is a continuous random variable because it 
takes all values in an interval of numbers—there are no gaps 
between possible temperatures. (b) P(X < 540) = P(X = 540) 
because X is a continuous random variable. In this case, 
P(X = 540) = 0 because the line segment above X = 540 has no 
area. (c) Mean = 550 — 550 = 0°C. The standard deviation stays 
the same, 5.7°C, because subtracting a constant does not change 
the variability. (d) In degrees Fahrenheit, the mean _ is 


by = = (550) +32=1022°F and the standard deviation is 


oy = (Z)on = 10.26°F. 


R6.3 (a) If you were to play many games of 4-Spot Keno, you would 
get a payout of about $0.70 per game, on average. If you were to play 
many games of 4-Spot Keno, the payout amounts would typically 
vary by about $6.58 from the mean ($0.70). (b) Let Y be the amount 
of Jerry’s payout. ry = 5(0.70) = $3.50 and ay = 5(6.58) = $32.90. 
(c) Let W be the amount of Marla’s payout. pay = 0.70 + 0.70 + 
0.70 + 0.70 + 0.70 = $3.50 and ow = 6.58" + 6.58" + 6.587 
6.58? + 6.587 = 216.482, so ow = V 216.482 = $14.71. 


(d) Even though their expected values are the same, the casino 


would probably prefer Marla since there is less variability in her 

strategy and her winnings are more predictable. 

R6.4 (a) C follows a N(10, 1.2) distribution and we want to find 
=i) 

P(C > 11).z= 1D De 0.83 and P(Z > 0.83) = 0.2033. 

Using “normalcd£ (lower:11,upper:1000, 


p:10,0:1.2) =0.2023. There is a 0.2023 _ probability 
that a randomly selected cap has a strength greater than 11 


technology: 


inch-pounds. (b) The machine that makes the caps and the 
machine that applies the torque are not the same. (c) C — T is 
Normal with mean 10—7=3 > inch-pounds and _ standard 


deviation V0.9? + 1.2? = 1.5 inch-pounds. (d) We want to find 


Solutions S-29 


0 
PC -T< O.2- = —2 and 
Using — technology: normalcdf (lower :—1000 ,upper:0, 
p:3,0:1.5) = 0.0228. There is a 0.0228 probability that a ran- 


domly selected cap will break when being fastened by the machine. 


P(Z < —2) = 0.0228. 


R6.5 (a) Binary? “Success” = orange and “Failure” = not orange. 
Independent? The sample of size n = 8 is less than 10% of the 
large bag, so we can assume the outcomes of trials are independent. 
Number? n = 8. Success? p = 0.20. (b) pax = 8(0.2) = 1.6. If we 
were to select many samples of size 8, we would expect to get about 
1.6 orange M&M’S, on average. (c) ox = V8(0.2)(0.8) = 1.13. If 
we were to select many samples of size 8, the number of orange 
M&M’S would typically vary by about 1.13 from the mean (1.6). 


R6.6 (a) P(X = 0) = (Foros = 0.1678. Because the prob- 


ability is not that small, it would not be surprising to get no 
orange M&M’S in a sample of size 8. (b) P(X =5)= 


(E)oarosor +-+-++= 0.0104 Because the probability is small, 


it would be surprising to find 5 or more orange M&M’S ina sample 
of size 8. 
R6.7 Let Y be the number of spins to get a “wasabi bomb.” Y is a 


geometric random variable with p= -. = 0.25. P(Y = 3)= 


(0.75)7(0.25) + (0.75)(0.25) + 0.25 = 0.5781. 
R6.8 (a) Let X be the number of heads in 10,000 tosses. 
Lx = 10,000(0.5) = 5,000 and oy= /10,000(0.5)(0.5) = 50. 
(b) np = 10,000(0.5) = 5,000 and n(1 — p) = 10,000(0.5) = 5000 
are both at least 10. (c) We want to find P(X S 4933 or X = 5067). 
ae = Ce ter = 134 sa 
P(Z = — 1.34) + P(Z = 1.34) =0.1802. Using __ technology: 
1 —-— normalcdf (lower:4933,upper:5067,»:5000, 
o:50) = 0.1802. Because this probability isn’t small, we don’t 
have convincing evidence that Kerrich’s coin was unbalanced—a 
difference this far from 5000 could be due to chance alone. 


Answers to Chapter 6 AP® Statistics Practice Test 


T6.1 b 
T6.2 d 
T6.3 <d 
T6.4 e 
T6.5 d 
T6.6 b 
T6.7 c 
T6.8 b 
T6.9 b 
T6.10 ¢ 
TO6.11 (a) P(Y S 2) = 0.96. (b) py = 0(0.78) +++» = 0.38. If we 
were to randomly select many cartons of eggs, we would 
expect about 0.38 to be broken, on average. (c) of = 


(0 — 0.38)7(0.78) + ... = 0.6756. So oy = V 0.6756 = 0.8219. 

If we were to randomly select many cartons of eggs, the number of 
broken eggs would typically vary by about 0.6756 from the mean 
(0.38). (d) Let X stand for the number of cartons inspected to find 
one carton with at least 2 broken eggs. X is a geometric random 
variable with = p=0.11. PX S 3) = (0.11) + (0.89)(0.11) + 
(0.89)7(0.11) = 0.2950. 


S-30 Solutions 


T6.12 (a) Binary? “Success” = dog first and “Failure” = not 
dog first. Independent? We are sampling without replacement, 
but 12 is less than 10% of all dog owners. Number? n= 12. 


Success? p = 0.66. (b) P(X = 4) = (1 )eo.coro.s4y” ranees 


12 
( 4 )(0.66)"0.34 = 0.0213. Because this probability is small, it 


is unlikely to have only 4 or fewer owners greet their dogs first by 

chance alone. This gives convincing evidence that the claim by the 

Ladies Home Journal is incorrect. 

T6.13 (a) pp =50—25=25 minutes, of = 100 + 25 = 125, 

and op = 125 = 11.18 minutes. (b) D follows a N(25, 11.18) dis- 
0-25 


tribution and we want to find P(D < 0). z= Tis = — 2,24 


and P(Z << — 2.24) = 0.0125. Using technology: normalcdf 
(lower:—1000,upper:0,»:25,0:11.18) =0.0127. 
There is a 0.0127 probability that Ed spent longer on his assign- 
ment than Adelaide did on hers. 
T6.14 (a) Let X stand for the number of Hispanics in the sample. 
bx = 1200(0.13) = 156 and oy = V1200(0.13)(0.87) = 11.6499. 
(b) 15% of 1200 is 180, so we want to find P(X = 180) = 
1200 ; 
( — oasy%o.s7 +-+++= 0.0235. Because this probability 
is small, it is unlikely to select 180 or more Hispanics in the sample 
just by chance. This gives us reason to be suspicious about the sam- 
pling process. 


Chapter 7 
Section 7.1 


Answers to Check Your Understanding 

page 425: 1. Parameter: js = 20 ounces. Statistic: x = 19.6 
ounces. 2. Parameter: p = 0.10, or 10% of passengers. Statistic: 
p = 0.08, or 8% of the sample of passengers. 

page 428: 1. Individuals: M&M’S Milk Chocolate Candies; vari- 
able: color; and parameter of interest: proportion of orange M&M’S. 
The graph below shows the population distribution. 


25 


20 


Percent 
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Color 


2. The graph below shows a possible distribution of sample data. 


wi 
For this sample there are 11 orange M&M’S, so p = 50 = 0.22. 


Frequency 


T Uy i T T 
Blue Orange Green Yellow Red Brown 
Color 


3. The middle graph is the approximate sampling distribution of p 
because the center of the distribution should be at approximately 


0.20. The first graph shows the distribution of the colors for one 
sample and the third graph is centered at 0.40 rather than 0.20. 
page #34: 1. No. The mean of the approximate sampling distribu- 
tion of the sample median (73.5) is not equal to the median of the 
population (75). 2. Smaller. Larger samples provide more precise 
estimates because larger samples include more information about 
the population distribution. 3. Skewed to the left and unimodal. 


Answers to Odd-Numbered Section 7.1 Exercises 

7.1 (a) Population: all people who signed a card saying that they 
intend to quit smoking. Parameter: the proportion of the population 
who actually quit smoking. Sample: a random sample of 1000 peo- 
ple who signed the cards. Statistic: the proportion of the sample 
who actually quit smoking; p = 0.21. (b) Population: all the turkey 
meat. Parameter: minimum temperature in all of the turkey meat. 
Sample: four randomly chosen locations in the turkey. Statistic: 
minimum temperature in the sample of four locations; sample 
minimum = 170°F. 

7.3, = 2.5003 is a parameter and x = 2.5009 is a statistic. 

7.5 p = 0.48 isa statistic and p = 0.52 is a parameter. 

7.7 (a) 2 and 6 (x = 4), 2 and 8 (5), 2 and 10 (6), 2 and 10 (6), 2 
and 12 (7), 6 and 8 (7), 6 and 10 (8), 6 and 10 (8), 6 and 12 (9), 8 
and 10 (9), 8 and 10 (9), 8 and 12 (10), 10 and 10 (10), 10 and 12 
(11), 10 and 12 (11). (b) The sampling distribution of x is skewed to 
the left and unimodal. The mean of the sampling distribution is 8, 
which is equal to the mean of the population. The values of x vary 
from 4 to 11. 
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7.9 (a) In one simulated SRS of 100 students, there were 73 stu- 
dents who did all their assigned homework. (b) The distribution is 
reasonably symmetric and bell-shaped. It is centered at about 0.60. 
Values vary from about 0.47 to 0.74. There don’t appear to be any 
outliers. (c) Yes, because there were no values of p less than or 
equal to 0.45 in the simulation. (d) Because it would be very sur- 
prising to get a sample proportion of 0.45 or less in an SRS of size 
100 when p = 0.60, we should be skeptical of the newspaper’s 
claim. 
7.11 (a) A graph of the population distribution is shown below. 
0.60 
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(b) Answers will vary. An example bar graph is given. 
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7.13 (a) Skewed to the right with a center at 9(°F)?. The values 
vary from about 2 to 27.5(°F)*. (b) A sample variance of 25(°F)” 
provides convincing evidence that the manufacturer’s claim is false 
and that the thermostat actually has more variability than claimed 
because a value this large was rare in the simulation. 
7.15 Ifwe chose many SRSs and calculated the sample mean x for 
each sample, we will not consistently underestimate ju or consis- 
tently overestimate ju. 
7.17 A larger random sample will provide more information and, 
therefore, more precise results. 
7.19 (a) Statistics ii and iii, because the means of their sampling 
distributions appear to be equal to the population parameter. 
(b) Statistic ii, because it is unbiased and has very little variability. 
Wael 
ED a 
7.25 (a) We are looking for the percentage of values that are 2.5 
standard deviations or farther below the mean in a Normal distribu- 
tion. In other words, we are looking for P(Z =—2.5). Using Table 
A, P(Z =—2.5) = 0.0062. Using technology:normal cdf (lower: 
—1000,upper:—2.50,u:0,0:1) = 0.0062. Less than 1% of 
healthy young adults have osteoporosis. (b) Let X be the BMD for 
women aged 70 — 79 on the standard scale. Then X follows a 
N(—2, 1) distribution and we want to find P(X S—2.5). 
2 =75 = (2) 


Zz i =-0.5 and P(Z=s—0.5) = 0.3085. Using 


technology: 
p:-2,0:1) = 0.3085. About 31% of women aged 70 — 79 have 
osteoporosis. 

Section 7.2 


Answers to Check Your Understanding 
page 445: 1. p,=p =0.75. 2. The standard deviation of the 


‘ |= 0.75(0.25 
sampling distribution of p is of = ca 7 P) -/ aa De 


0.0137. There are more than 10(1000) = 10,000 young adult 


normalcdf (lower:—1000,upper:—2.5, 


Internet users, so the 10% condition has been met. 3. Yes. Both 
np = 1000(0.75) = 750 and n(1 — p) = 1000(0.25) = 250 are at 
least 10. 4. The sampling distribution would still be approximately 
Normal with mean 0.75. However, the standard deviation would be 


l= 0.75(0.25 
smaller by a factor of 3: 05 = na P) -/ a - 0.0046. 
n 


Answers to Odd-Numbered Section 7.2 Exercises 

7.27 (a) We would not be surprised to find 8 (32%) orange candies 
because values this small happened fairly often in the simulation. 
However, there were few samples in which there were 5 (20%) or 
fewer orange candies. So getting 5 orange candies would be surpris- 
ing. (b) A sample of 50, because we expect to be closer to p = 0.45 
in larger samples. 


(l—p)— /0.45(0.55 
7.29 (a) ug =p = 0.45. (b) 95 -,/? A ° = 7 


0.0995. The 10% condition is met because there are more than 
10(25) = 250 candies in the large machine. (c) Yes, because 
np = 25(0.45) = 11.25 and n(1 — p) = 25(0.55) = 13.75 are both 
at least 10. (d) The sampling distribution would still be approxi- 
mately Normal with a mean of jg = 0.45. However, the standard 


]- 0.45(0.55 
deviation decreases to of = via FA P) = ‘ . ) = 0.0497. 


Solutions S-31 


7.31 (a) No, because more than 10% of the population (10/76 = 13%) 
was selected. (b) No, because the sample size was onlyn = 10. Neither 
np nor n(1 — p) will be at least 10. 

7.33 The Large Counts condition is not met 
np = 15(0.3) = 4.5 < 10. 


/0.7(0.3 
7.35 (a) ug = p = 0.70. (b) of = = 0.0144. The 10% 


condition is met because the sample of size 1012 is less than 10% of 
the population of all US. adults. (c) Yes, because 
np = 1012(0.70) = 708.4 and n(1 — p) = 1012(0.30) = 303.6 are 
both at least 10. (d) We want to find P(p < 0.67).z= 
POT MWD 908 and FY 2 PHe)= WORE: Uanaeeinal 
—o01l44. § and P(Z =—2.08) = 0.0188. Using technol- 
ogy: normalcdf (lower:—1000,upper:0.67,»:0.70,0: 
0.0144) =0.0186. There is a 0.0186 probability of obtaining a 
sample in which 67% or fewer say they drink the milk. Because this is 
a small probability, there is convincing evidence against the claim. 
7.37 4048, because using 4n for the sample size halves the stan- 
dard deviation (V4n = 2Vn). 

7.39 jug = 0.70. Because 267 is less than 10% of the population of 


/0.7(0.3 
women, 0% = a ) = 0.0280. Because np = 


267(0.7) = 186.9 and n(1 — p) = 267(0.3) = 80.1 are both at least 
10, the sampling distribution of p can be approximated by a Normal 


because 


college 


a 0.75 — 0. 
distribution. We want to find P(p = 0.75). z = a = 1.79 
and P(Z = 1.79) = 0.0367. Using technology: normalcdf 


(lower:0.75,upper:1000,u:0.7,0:0.0280) =0.0371. 
There is a 0.0371 probability that 75% or more of the women in the 
sample have been on a diet within the last 12 months. 

7.41 (a) us = 0.90. Because 100 is less than 10% of the population 


/0.90(0.10 
of orders, of = — = 0.03. Because np = 100(0.90) = 90 


and n(1 — p) = 100(0.10) = 10 are both at least 10, the sampling 
distribution of f can be approximated by a Normal distri- 
. ; ny 0.86 — 0.90 

bution. We want to find P(p = 0.86). z= 003. > — 1.33 
and P(Z =—1.33) = 0.0918. Using technology: normalcdf 
(lower :—1000,upper:0.86,0:0.90,0:0.03) =0.0912. 
There is a 0.0912 probability that 86% or fewer of orders in an SRS 
of 100 were shipped within 3 working days. (b) Because the proba- 
bility isn’t very small, it is plausible that the 90% claim is correct 
and that the lower than expected percentage is due to chance alone. 
7.43 a 

7.45 b 


7.47 The Venn diagram is shown below. 


D S 


62% 


62% neither download nor share music files. 


S-32 Solutions 


Section 7.3 


Answers to Check Your Understanding 
page 456: 1. X = length of pregnancy follows a N(266, 16) distri- 
270 — 26 

bution and we want to find P(X > 270). z = a = 0.25 and 
P(Z > 0.25) = 0.4013. Using technology: normalcdf (lower: 
270,upper:1000,»:266,0:16) = 0.4013. There is a 0.4013 
probability of selecting a woman whose pregnancy lasts for more 
than 270 days. 2. juz = = 266 days 3. The sample of size 6 is 


16 
less than 10% of all pregnant women, so oz = 7 == 6.532 


Vn V6 
days. 4. x follows a N(266, 6.532) distribution and we want to find 
270 — 266 

P(x > 270).z= 6532 =0.61 and P(Z > 0.61) = 0.2709. 
Using technology: normalcdf (lower:270,upper:1000, 
p:266,0:6.532) = 0.2701. There is a 0.2701 probability of 
selecting a sample of 6 women whose mean pregnancy length 
exceeds 270 days. 


Answers to Odd-Numbered Section 7.3 Exercises 
7.A9 ju = 6 = 255 seconds. Because the sample size (10) is less 
than 10% of the population of songs on David’s iPod, 


oO 60 
O~ = —= = —= = 18.974 seconds. 
: Vn V10 
a she age oo ed 
Vn 30 


7.53 (a) Normal with 4, = ps = 188 mg/dl. Because the sample 
size (100) is less than 10% of all men aged 20 to 34, o-= 
0 4] 
—= = == = 4.1 mg/dl. (b) We want to find P(185 = x = 191). 
Vn V100 
_ 185 — 188 | 191 — 188 


coca. a — 0.73 andz= 4] 


P(—0.73 = Z = 0.73) = 0.5346. Using technology: normalcdf 
(lower:185,upper:191,:188,0:4.1) =0.5357. There 


is a 0.5357 probability that x estimates ys within +3 mg/dl. 
o 4] 


= Fa 1000 


= 0.73 


= 1.30 mg/dl. So x follows a N(188, 1.30) 


distribution and we want to find P(185 = x = 191).z= a = S 
=-231 and z= 231. PL 21272731) 
= 0.9792. Using technology: normalcd£ (lower:185, 


upper:191,:188,0:1.30) =0.9790. There is a 0.9790 
probability that x estimates jz within +3 mg/dl. The larger sample 
is better because it is more likely to produce a sample mean within 
3 mg/dl of the population mean. 

7.55 (a) Let X = amount of cola in a randomly selected bottle. 
X follows the N(298, 3) distribution and we want to find 


295 — 298 
P(X < 295). z= pee 2 —-l1 and P(Z< —1)=0.1587. 
Using technology: normalcdf (lower :—1000, upper: 295, 


p:298,0:3) =0.1587. There is a 0.1587 probability that a 
randomly selected bottle contains less than 295 ml. 
(b) uw = = 298 ml. Because 6 is less than 10% of all bottles pro- 


van = 1.2247 ml. We want to find P& < 295) 


Va V6 


duced, 0 = 


: ein te 295 — 298 
using the N(298, 1.2247) distribution. z = i247 2.45 
and P(Z < — 2.45) = 0.0071. Using technology: normalcdf 
(lower:-—1000,upper:295,»:298,0:1.2247) 
= 0.0072. There is a 0.0072 probability that the mean contents of 
six randomly selected bottles are less than 295 ml. 

7.57 No. The histogram of the sample values will look like the 
population distribution. The CLT says that the histogram of the 
sampling distribution of the sample mean will look more and more 
Normal as the sample size increases. 

7.59 (a) Because the distribution of the play times of the popula- 
tion of songs is heavily skewed to the right and n = 10 < 30. 
(b) Because n = 36 = 30, the CLT applies. w> = pe = 225 sec- 
onds. Because 36 is less than 10% of all songs on David’s iPod, 


oO 60 


05> FF FS 
“ Vn V6 


using the N(225, 10) distribution. z= 


= 10 seconds. We want to find P(x > 240) 


a = 1.50 and 
P(Z > 1.50) = 0.0668. Using technology: normalcd£ (lower: 
240,upper:1000,:225,0:10) = 0.0668. There is a 0.0668 
probability that the mean play time is more than 240 seconds. 

7.61 (a) We do not know the shape of the distribution of passenger 
weights. (b) We want to find P(x > 6000/30) = P(x > 200). 
Because the sample size is large (n = 30 = 30), the distribution of 
x is approximately Normal with == p= 190 pounds. Because 
n=30 is less than 10% of all 


possible _ passengers, 
200 — 
o>= = =e = 6.3901 pounds. z= a 1.56 and 
Vn V30 6.3901 


P(Z > 1.56) = 0.0594. Using technology: normalcd£ (lower: 
200, upper:1000,:190,0:6.3901) = 0.0588. There is a 
0.0588 probability that the mean weight exceeds 200 pounds. 
7.63 Because the sample size is large (n = 10,000 = 30), the sam- 
pling distribution of x is approximately Normal. y= = po = $250. 
Assuming 10,000 is less than 10% of all homeowners with fire insur- 
Oe = $i) Wewaik in ind F975) 
Vn 10,000 sei 
using the N(250, 10) distribution. z= a = 2.50 and 
P(Z = 2.50) = 0.9938. Using technology: normalcdf (lower: 


—1000,upper:275,y:250,0:10) =0.9938. There is a 
0.9938 probability that the mean annual loss from a sample of 


ance, 0 


10,000 policies is no greater than $275. 


7.65 b 
7.67 b 
. 7 ! 1062 ; 
7.69 Didn’t finish high school: 12.470 = 0.0852; high school but 
1977 : 
se ei 7 ; , . 
no college: 37,834 0.0523, less than a bachelor’s degree: 


1462 1097 
34,439 = 0.0425, college graduate: 40,300 = 0.0272. The unem- 
ployment rate decreases with additional education. 
40,390 
7.71 P(inlabor force | college graduate) = 51.582 = 0.7830. 


Answers to Chapter 7 Review Exercises 


R7.1 The population is the set of all eggs shipped in one day. The 
sample consists of the 200 eggs examined. ‘The parameter is the 
proportion p = 0.03 of eggs shipped that day that had salmonella. 


7 9 
The statistic is the proportion p = 500 = 0.045 of eggs in the sam- 


ple that had salmonella. 


R7.2 (a) A sketch of the population distribution is given below. 


N(3668, 511) 


T T T T T 
2135 2646 3157 3668 4179 4690 5201 
Birth weight (grams) 


(b) Answers will vary. An example dotplot is given. (c) The dot at 
2750 represents one SRS of size 5 from this population where the 
sample range was 2750 grams. 


e ee e e 
es 


2600 3000 3400 3800 4200 
Birth weight (grams) 


R7.3 (a) No, because sample range is always less than the actual 
range (3417). Ifit were unbiased, the distribution would be centered 
at 3417. (b) ‘Take larger samples. 

R74 (a) us = p = 0.15. (b) Because the sample size of n = 1540 
is less than 10% of the population of all adults, 


/0.15(0.85 
of = ee 0.0091. (c) Yes, because np = 1540(0.15) 


= 231 and n(1 — p) = 1540(0.85) = 1309 are both at least 10. 


‘ ” OR. 
(d) We want to find P(0.13 = p = 0.17). z= 00091 2.20 
0.17 — 0.15 ; a ; 
and z= 90001 2.20. The desired probability _ is 


P(—2.20 = Z = 2.20) = 0.9722. Using technology: normalcdf 
(lower:0.13,upper:0.17,p:0.15,0:0.0091) 
= 0.9720. There is a 0.9720 probability of obtaining a sample in 
which between 13% and 17% are joggers. 

R7.5 (a) ug = p = 0.30. Because 100 is less than 10% of the popu- 


—— /0.30(0.70) _ 
op = 100. 0.0458. Because 


np = 100(0.30) = 30 and n(1 — p) = 100(0.70) = 70 are both at 
least 10, the sampling distribution of f can be approximated by 


lation of — travelers, 


a Normal distribution. We want to find P(p <= 0.20). 
2 ae and P(Z S —2.18) = 0.0146. Usi 
Zz 0.0458 : an = =2, ; . Using 
technology: normalcdf (lower:—1000,upper:0.20,p: 


0.30,0:0.0458) = 0.0145. There is a 0.0145 probability that 
20% or fewer of the travelers get a red light. (b) Because this is a 
small probability, there is convincing evidence against the agents’ 
claim —it isn’t plausible to get a sample proportion of travelers with 
ared light this small by chance alone. 

R7.6 (a) X = WAIS score for a randomly selected individual fol- 
N(100, 15) distribution and we want to find 
me =0.33 and P(Z = 0.33) = 0.3707. 
Using technology: normalcdf (lower:105,upper:1000, 
p:100,0:15) = 0.3694. There is a 0.3694 probability of select- 
ing an individual with a WAIS score of at least 105. 
(b) x= == 100. Because the sample of size 60 is less than 


15 
10% of all adults, op=— —= = 1.9365. (c) x follows a 


Van Veo 


lows a 
P(X = 105). z= 


Solutions S-33 


N(100, 1.9365) distribution and we want to find P(x = 105). 
_ 105 — 100 


z= 71.9365 = 2.58 and P(Z = 2.58) = 0.0049. Using technol- 


ogy: normalcdf (lower:105,upper:1000,u:100,0: 
1.9365) = 0.0049. There is a 0.0049 probability of selecting a 
sample of 60 adults whose mean WAIS score is at least 105. (d) The 
answer to part (a) could be quite different depending on the shape 
of the population distribution. The answer to part (b) would be the 
same because the mean and standard deviation do not depend on 
the shape of the population distribution. Because of the large sam- 
ple size (60 = 30), the answer for part (c) would still be fairly reli- 
able due to the central limit theorem. 

R7.7 (a) Because n = 50 = 30. (b) p> = = 0.5. Because 50 is 
less than 10% of all traps, the standard deviation is 


o- = aT oes 0.0990. Thus, x follows a N(0.5, 0.0990) 


. Vn : v50 0.6 — 0.5 
Ses . _ _ 9.6 = 0.5 _ 
distribution and we want to find P(x = 0.6). z = 9.0990 1.01 


and P(Z= 1.01) =0.1562. Using technology: normalcdf 
(lower:0.6,upper:1000,0:0.5,0:0.0990) =0.1562. 
There is a 0.1562 probability that the mean number of moths is 
greater than or equal to 0.6. (c) No. Because this probability is not 
small, it is plausible that the sample mean number of moths is this 
high by chance alone. 


Answers to Chapter 7 AP® Statistics Practice Test 


T/lc 

Tic 

Tia ¢ 

Ts a 

7S b 

T76-b 

Tia b 

T78 ¢ 

179 ¢ 

17.10 e 

T7.11 A. Both A and B appear to be unbiased, and A has less vari- 

ability than B. 

17.12 (a) We do not know the shape of the population distribution 

of monthly fees. (b) x= = 6 = $38. Because the sample of size 500 

is less than 10% of all households with Internet access, 
_ a — 10 

* Vn W500 

(n = 500 = 30), the distribution of x will be approximately 


39 — 38 
: 1 x> ye = 2; 
Normal. (d) We want to find P(x > 39) 0.4472 2.24 and 


P(Z > 2.24) = 0.0125. Using technology: normalcd£ (lower: 
39,upper:1000,1:38,0:0.4472) =0.0127. There is a 
0.0127 probability that the mean monthly fee exceeds $39. 

T7.13 ug = Pp = 0.22. Because 300 is less than 10% of children 


/0.22(0.78) 
under the age of 6, of = — 309 2.0239. Because 


np = 300(0.22) = 66 and n(1 — p) = 300(0.78) = 234 are both at 
least 10, the sampling distribution of f can be approximated 
by a Normal distribution. We want to find P(p > 0.20). z= 
“a = — 0.84 and P(Z > — 0.84) = 0.7995. Using tech- 
nology: normalcdf (lower:0.20,upper:1000,»:0.22,0: 
0.0239) = 0.7987. There is a 0.7987 probability that more than 
20% of the sample are from poverty-level households. 


o 


= 0.4472. (c) Because the sample size is large 


S-34 Solutions 


Answers to Cumulative AP® Practice Test 2 


AP2.1 
AP2.2 
AP2.3 
AP2.4 
AP2.5 
AP2.6 
AP2.7 
AP2.8 
AP2.9 d 
AP2.10 
AP? A 
AP2.12 
AP2.13 
AP2.14 
AP2.15 
AP2.16 
AP2.17 
AP2.18 
AP2.19 
AP2.20 
AP2.21 
AP2.22 (a) Observational study, because no treatments were 
imposed on the subjects. (b) ‘Two variables are confounded when 
their effects on the cholesterol level cannot be distinguished from 
one another. For example, people who take omega-3 fish oil might 
also exercise more. Researchers would not know whether it was the 


soondg¢goasn 


ooo . oa ca CO GO oO 


omega-3 fish oil or the exercise that was the real explanation for 
lower cholesterol. (c) No. Even though the difference was statisti- 
cally significant, this wasn’t an experiment and taking fish oil is pos- 
sibly confounded with exercise. 
AP2.23 (a) P(type O or Hawaiian-Chinese) = 65,516/145,057 = 
0.452. (b) P(type AB| Hawaiian) = 99/4670 = 0.021. 
(c) P(Hawaiian) = 4670/145,057 = 0.032; P(Hawaiian|type B) = 
178/17,604 = 0.010. Because these probabilities are not equal, the 
two events are not independent. (d) P(type A and white) = 
50,008/145,057 = 0.345. P(at least one type A and white) = 1 
— P(neither are type A and white) = 1 — (1 — 0.345)? = 0.571. 
AP2.24 (a) The distribution of seed mass for the cicada plants is 
roughly symmetric, while the distribution for the control plants is 
skewed to the left. The median seed mass is the same for both 
groups. ‘The cicada plants had a bigger range in seed mass, but the 
control plants had a bigger IOR. Neither group had any outliers. 
(b) The cicada plants. The distribution of seed mass for the cicada 
plants is roughly symmetric, which suggests that the mean should 
be about the same as the median. However, the distribution of seed 
mass for the control plants is skewed to the left, which will pull the 
mean of this distribution below its median toward the lower values. 
Because the medians of both distributions are equal, the mean for 
the cicada plants is greater than the mean for the control plants. 
(c) The purpose of the random assignment is to create two groups 
of plants that are roughly equivalent at the beginning of the experi- 
ment. (d) Benefit: controlling a source of variability. Different types 
of flowers will have different seed masses, making the response 
more variable if other types of plants were used. Drawback: we can’t 
make inferences about the effect of cicadas on other types of plants, 
because other plants might respond differently to cicadas. 
AP2.25 (a) We want to find P(x < 25,000/50) = P(x < 500). 
Because the sample size is large (n = 50 = 30), the distribution of 
x is approximately Normal with z= 4 = 525 pages. Because 


n=50 is less than 10% of all novels in the library, 


a 200 500 — 525 
oz= Ue = eh = 28.28 pages. z= 78.28 
P(Z < —0.88) = 0.1894. Using technology:normalcd£ (lower: 
—1000,upper:500,1:525,0:28.28) = 0.1883. There is a 
0.1883 probability that the total number of pages in 50 novels is 
fewer than 25,000. (b) Let X be the number of novels that have 
fewer than 400 pages. X is a binomial random variable with n = 50 
and p = 0.30. We want to find P(X = 20). Using technology: 
P(X = 20) =1-— PX = 19)=1 — binomcdf (trials:50, 
p:0.30, x value:19) = 0.0848. There is a 0.0848 probabil- 
ity of selecting at least 20 novels that have fewer than 400 pages. 
Note: Using the Normal approximation, P(X = 20) = 0.0614. 


Chapter 8 
Section 8.1 


Answers to Check Your Understanding 
page 485: 1. We are 95% confident that the interval from 2.84 to 
7.55 g captures the population standard deviation of the fat content 


= — 0.88 and 


of Brand X hot dogs. 2. If this sampling process were repeated 
many times, approximately 95% of the resulting confidence inter- 
vals would capture the population standard deviation of the fat con- 
tent of Brand X hot dogs. 3. False. Once the interval is calculated, 
it either contains o or it does not contain o. 


Answers to Odd-Numbered Section 8.1 Exercises 
8.1 Sample mean, x = 30.35. 


~ 36 
8.3 Sample proportion, p = =~ = 0.72. 


50 
8.5 (a) Approximately Normal with mean y= = 280 and standard 
60 
deviation ¢. = —— = 2.1. (b) See graph below. (c) About 95% 


840 
of the x values will be within 2 standard deviations of the mean. 
Therefore, m = 2(2.1) = 4.2. (d) About 95%. 


273.7 275.8 277.9 280.0 282.1 284.2 286.3 
x 

8.7 The sketch is given below. The interval with the value of x in 

the shaded region will contain the population mean (280), while 

the interval with the value of x outside the shaded region will not 

contain the population mean (280). 


273.7 275.8 277.9 280.0 282.1 284.2 286.3 


8.9 (a) We are 95% confident that the interval from 0.63 to 0.69 
captures the true proportion of those who favor an amendment to 
the Constitution that would permit organized prayer in public 
: .63 + 0.69 
schools. (b) Point estimate = p = = 5 

of error = 0.69 — 0.66 = 0.03. (c) Because the value 2/3 = 0.667 


(and values less than 2/3) are in the interval of plausible values, 


= 0.66 and margin 


there is not convincing evidence that more than two-thirds of U.S. 
adults favor such an amendment. 

8.11 Because only 84% of the intervals actually contained the true 
parameter, these were probably 80% or 90% confidence intervals. 
§.13 Answers will vary. One practical difficulty is response bias: 
people might answer “yes” because they think they should, even if 
they don’t really support the amendment. 

8.15 Interval: We are 95% confident that the interval from 10.9 to 
26.5 captures the true difference (girls — boys) in the mean num- 
ber of pairs of shoes owned by girls and boys. Level: If this sampling 
process were repeated many times, approximately 95% of the result- 
ing confidence intervals would capture the true difference (girls — 
boys) in the mean number of pairs of shoes owned by girls and boys. 
8.17 Yes. Because the interval does not include 0 as a plausible 
value, there is convincing evidence of a difference in the mean 
number of shoes for boys and girls. 

8.19 (a) Incorrect. The interval provides plausible values for the 
mean BMI of all women, not plausible values for individual BMI 
measurements. (b) Incorrect. We shouldn’t use the results of one 
sample to predict the results for future samples. (c) Correct. A con- 
fidence interval provides an interval of plausible values for a param- 
eter. (d) Incorrect. The population mean doesn’t change and will 
either be a value between 26.2 and 27.4 100% of the time or 0% of 
the time. (e) Incorrect. We are 95% confident that the population 
mean is between 26.2 and 27.4, but that does not absolutely rule 
out any other possibility. 

8.21 b 

8.23 e 

8.25 (a) Observational study, because there was no treatment 
imposed on the pregnant women or the children. (b) No. We can- 
not make any conclusions about cause and effect because this was 
not an experiment. 


Section 8.2 


Answers to Check Your Understanding 

page 496: 1. Random: not met because this was a convenience 
sample. 10%: met because the sample of 100 is less than 10% of the 
population at a large high school. Large Counts: met because 17 
successes and 83 failures are both at least 10. 2. Random: met 
because the inspector chose an SRS of bags. 10%: met because the 
sample of 25 is less than 10% of the thousands of bags filled in an 
hour. Large Counts: not met because there were only 3 successes, 
which is less than 10. 

page #99: 1. p = the true proportion of all U.S. college students 
who are classified as frequent binge drinkers. 2. Random: met 
because the statement says that the students were chosen randomly. 
10%: met because the sample of 10,904 is less than 10% of all U.S. 
college students. Large Counts: met because 2486 successes and 


a = 0.005 and the 


8418 failures are both at least 10. 3. 


Solutions S-35 


closest area in ‘Table A is 0.0051 (or 0.0049), corresponding to a 
critical value of z*=2.57 (or 2.58). Using technology: 
invNorm(area:0.005, w:0, o:1) = —2.576, so z*= 


+ “—=()? + = 


(0.218, 0.238). 4. We are 99% confident that the interval from 
0.218 and 0.238 captures the true proportion of all U.S. college 
students who are classified as frequent binge drinkers. 


/0.80(0.20 
page 503: 1. Solving 1.96 ORY gs for n_ gives 
n 


n = 682.95. We should select a sample of at least 683 custom- 
ers. 2. The required sample size will be larger because the critical 
value is larger for 99% confidence (2.576) versus 95% confidence 
(1.96). The company would need to select at least 1180 
customers. 


Answers to Odd-Numbered Section 8.2 Exercises 
8.27 Random: met because Latoya selected an SRS of students. 
10%: not met because the sample size (50) is more than 10% of the 
population of seniors in the dormitory (175). Large Counts: met 
because np = 14 = 10 and n(1 — f) = 36 = 10. 
§.29 Random: may not be met because we do not know if the 
people who were contacted were a random sample. 10%: met 
because the sample size (2673) is less than 10% of the population 
of adult heterosexuals. Large Counts: not met because np = 
2673(0.002) = 5 is not at least 10. 

1 — 0.98 
8.31 5 


= 0.01, and the closest area is 0.0099, correspond- 


ing to a critical value of z* = 2.33. Using technology: invNorm 
(area:0.01, :0, o:1) = —2.326,soz* = 2.326. 

8.33 (a) Population: seniors at ‘Tonya’s high school. Parameter: 
true proportion of all seniors who plan to attend the prom. 
(b) Random: the sample is a simple random sample. 10%: The 
sample size (50) is less than 10% of the population size (750). Large 
Counts: np = 36 = 10 and n(1 — f) = 14 = 10. 


/0.72(0.2 
(c) 0.72 = 1.645 eed = 0.72 + 0.10 = (0.62, 0.82). 


(d) We are 90% confident that the interval from 0.62 to 0.82 cap- 
tures the true proportion of all seniors at Tonya’s high school who 
plan to attend the prom. 

8.35 (a) S: p = the true proportion of all full-time U.S. college 
students who are binge drinkers. P: One-sample z interval for p. 
Random: the students were selected randomly. 10%: the sample 
size (5914) is less than 10% of the population of all college stu- 
dents. Large Counts: np = 2312 = 10 and n(1 — f) = 3602 = 

10. D: (0.375, 0.407). C: We are 99% confident that the interval 
from 0.375 to 0.407 captures the true proportion of full-time U.S. 
college students who are binge drinkers. (b) Because the value 
0.45 does not appear in our 99% confidence interval, it isn’t plau- 
sible that 45% of full-time U.S. college students are binge 
drinkers. 

8.37 Answers will vary. Response bias is one possibility. 

8.39 S: p = the true proportion of all students retaking the SAT 
who receive coaching. P: One-sample z interval for p. Random: the 
students were selected randomly. 10%: the sample size (3160) is less 


S-36 Solutions 


than 10% of the population of all students taking the SAT twice. 
Large Counts: np =427210 and n(1- p) = 2733 = 10. 
D: (0.119, 0.151). C: We are 99% confident that the interval from 
0.119 to 0.151 captures the true proportion of students retaking the 
SAT who receive coaching. 

8.41 (a) We do not know the sample sizes for the men and for the 
women. (b) The margin of error for women alone would be greater 
than 0.03 because the sample size for women alone is smaller than 
1019. 


/0. 2 
8.43 (a) Solving 1.645 yaa) = 0.04 gives n = 318. 
n 


0.5(0.5) 
n 


(b) Solving 1.645 = 0.04 gives n = 423. In this case, the 


sample size needed is 105 people larger. 


0.5(0.5 
8.45 Solving 1.9 i) = 0.03 gives n = 1068. 
n 


- /0.64(0.36 
8.47 (a) Solving 0.03 =z a ) gives z* = 2.00. ‘The con- 


fidence level is likely 95%, because 2.00 is very close to 1.96. 


(b) ‘Teens are hard to reach and often unwilling to participate in 
surveys, so nonresponse is a major “practical difficulty” for this type 


of poll. 
S49 a 
$.51 ¢ 
8.53 (a) A histogram of the number of accidents per hour is given 
below. 
7 
6 
B 5 
| 
g4 
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m2 
1 
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(b) A graph of the number of accidents is given below. 


35 


Number of accidents 


[—i—. a i—I—1 
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Hour of the day 


(c) The histogram in part (a) shows that the number of accidents 
has a distribution that is skewed to the right. (d) The graph in part 
(b) shows that there is a cyclical nature to the number of accidents. 


Section 8.3 


Answers to Check Your Understanding 

page 514: 1. df=21, t* = 2.189. Using technology: invT 
(area:0.02, df = 21) = —2.189,sot* = 2.189. 2. df= 70, 
t* = 2.660 (using df= 60). Using technology: invT (area: 
0.005, df = 70) = —2.648, so t* = 2.648. 

page 522: S: We are trying to estimate jz = the true mean healing 
rate at a 95% confidence level. P: One-sample ¢ interval for pu. 


Random: The description says that the newts were randomly cho- 
sen. 10%: The sample size (18) is less than 10% of the population 
of newts. Normal/Large Sample: The histogram below shows no 
strong skewness or outliers, so this condition is met. 


Frequency 
N 
i 


0 T i T T t T T 
10 15 20 25 30 35 40 


Healing rate (micrometers/hr) 


2 
D;: 25.67 + 20 5) = 25.67 * 4.14 = (21.53,29.81). C: We 


18 
are 95% confident that the interval from 21.53 to 29.81 micro- 
meters per hour captures the true mean healing time for newts. 


page 524: Using o = 154 and z* = 1.645 for 90% confidence, 


1.645(154)\2 
(Se) = 71.3, so take a sam- 


154 
30 = 1.645—. Thus, n = 
n 
ple of 72 students. 


Answers to Odd-Numbered Section 8.3 Exercises 

8.55 (a) t* = 2.262. (b) t* = 2.861. (c) t* = 1.671 (using technol- 
ogy: t* = 1.665). 

8.57 Because the sample size is small (n = 20 < 30) and there are 
outliers in the data. 

8.59 (a) No, because we are trying to estimate a population propor- 
tion, nota population mean. (b) No, because the 15 team members 
are not a random sample from the population. (c) No, because the 
sample size is small (n = 25 < 30) and there are outliers in the 
sample. 


9. 
8.61 SEZ= a= = 1.7898. If we take many samples of size 27, 


V27 
the sample mean blood pressure will typically vary by about 1.7898 
from the population mean blood pressure. 


8.63 (a) Because 19.03 = —— sy = 91.26 cm. (b) They are using 


V23 
a critical value of t* = 1. With df = 22, the area between t = 
—1 and t = 1 is approximately tcedf£ (lower: —1, 
df: 22) = 0.67. So, the confidence level is 67%. 
8.65 (a) S: = the true mean percent change in BMC for breast- 
feeding mothers. P: One-sample t interval forjz. Random: the moth- 
ers were randomly selected. 10%: 47 is less than 10% of all 
breast-feeding mothers. Normal/Large Sample: n = 47 = 30. D: 
Using df = 40, (—4.575, —2.605). Using technology: (—4.569, 
—2.605) with df = 46. C: We are 99% confident that the interval 
from —4.569 to —2.605 captures the true mean percent change in 
BMC for breast-feeding mothers. (b) Because all of the plausible 
values in the interval are negative (indicating bone loss), the data 
give convincing evidence that breast-feeding mothers lose bone 
mineral, on average. 

8.67 (a) S: = the true mean size of the muscle gap for the popu- 
lation of American and European young men. P: One-sample t 
interval for yx. Random: the young men were randomly selected. 
10%: 200 is less than 10% of young men in America and Europe. 
Normal/Large Sample: n = 200 = 30. D: Using df = 100, (1.999, 
2.701). Using technology: (2.001, 2.699) with df = 199. C: We are 


upper: 1, 


95% confident that the interval from 2.001 to 2.699 captures the 
true mean size of the muscle gap for the population of American 
and European young men. (b) The large sample size (n = 200 = 30) 
allows us to use a t interval for ju. 

8.69 S: = the true mean fuel efficiency for this vehicle. P: One- 
sample t interval for js. Random: the records were selected at ran- 
dom. 10%: it is reasonable to assume that 20 is less than 10% of all 
records for this vehicle. Normal/Large Sample: the histogram does 
not show any strong skewness or outliers. 


Frequency 


T T r T T 1 
13.5 15.55 175 19.5 21.5 23.5 
Mpg 


D: Using df = 19, (17.022, 19.938). C: We are 95% confident that 
the interval from 17.022 to 19.938 captures the true mean fuel ef- 
ficiency for this vehicle. 

8.71 (a) S: «= the true mean difference in the estimates from 
these two methods in the population of tires. P: One-sample t inter- 
val for jz. Random: A random sample of tires was selected. 10%: the 
sample size (16) is less than 10% of all tires. Normal/Large Sample: 
The histogram of differences shows no strong skewness or outliers. 


Frequency 
~) 


0 T T T T T T 
0 2 4 6 8 10 


Difference (weight — groove) 


D: Using df = 15, (2.837, 6.275). C: We are 95% confident that 
the interval from 2.837 to 6.275 thousands of miles captures the 
true mean difference in the estimates from these two methods in 
the population of tires. (b) Because 0 is not included in the confi- 
dence interval, there is convincing evidence of a difference in the 
two methods of estimating tire wear. 


8.73 Solving 2.576 — = | gives n = 374. 


n 

8.75 b 

8.77 b 

8.79 (a) Because the sum of the probabilities must be 1, P(X = 7) 
= 0.57. (b) uy = 5.44. If we were to randomly select many young 
people, the average number of days they watched television in the 
past 7 days would be about 5.44. (c) Because the sample size is large 
(n = 100 = 30), we expect the mean number of days x for 100 ran- 
domly selected young people (aged 19 to 25) to be approximately 
Normally distributed with mean po = ps = 5.44. Because the sam- 
ple size (100) is less than 10% of all young people aged 


La 


Wa V0 
4.96 — 5.44 _ 
0.214 


19 to 25, the standard deviation is 0; = 


We want to find P(x = 4.96). z= — 2.24 and 


Solutions S-37 


P(Z = —2.24) = 0.0125. Using technology:normalcdf£ (lower: 
—1000, upper:4.96, w:5.44, 0:0.214) = 0.0124. 
There is a 0.0124 probability of getting a sample mean of 4.96 or 
smaller. Because this probability is small, a sample mean of 4.96 or 
smaller would be surprising. 


Answers to Chapter 8 Review Exercises 
1 — 0.94 


R8.1 (a) = 0.03, and the closest area is 0.0301, corre- 


sponding to a critical value of z* = 1.88. Using technology: 
invNorm(area:0.03, p:0, o:1) = —1.881, so z* = 
1.881. (b) Using Table B and 50 degrees of freedom, t’ = 2.678. 
Using technology: invT (area:0.005, df£:57) = —2.665,so 


t* = 2.665. 

430 + 470 
R8.2 (a) x= ee 
— 450 = 20 minutes. Because n = 30, df = 29 and t* = 2.045. 


Sy Be 

Because 20 = 2.045——=, standard error = ——= = 9.780 minutes 
V30 V30 

and sy = 53.57 minutes. (b) The confidence interval provided 

gives an interval estimate for the mean lifetime of batteries pro- 


= 450 minutes. Margin of error = 470 


duced by this company, not individual lifetimes. (c) No. A confi- 
dence interval provides a statement about an unknown population 
mean, not another sample mean. (d) If we were to take many sam- 
ples of 30 batteries and compute 95% confidence intervals for the 
mean lifetime, about 95% of these intervals will capture the true 
mean lifetime of the batteries. 

R8.3 (a) p = the proportion of all adults aged 18 and older who 
would say that football is their favorite sport to watch on television. 
It may not equal 0.37 because the proportion who choose football 
will vary from sample to sample. (b) Random: The sample was ran- 
dom. 10%: The sample size (1000) is less than 10% of all 
adults. Large Counts: np = 370 = 10 and n(1 — f) = 630 = 10. 


(c) 0.37 + 1.96, / ee = (0.3401,0.3999). (d) We are 95% 


confident that the interval from 0.3401 to 0.3999 captures the true 
proportion of all adults who would say that football is their favorite 


sport to watch on television. 
R8.4 (a) «= mean IQ score for the 1000 students in the school. 
(b) Random: the data are from an SRS. 10%: the sample size 
(60) is less than 10% of the 1000 students at the school. 
Normal/Large Sample: n = 60 = 30. (c) Using df= 50,114.98 
14.8 
ote 1676( ==) = (111.778,118.182). Using technology: (111.79, 
60 
118.17) with df = 59. (d) We are 90% confident that the interval 
from 111.79 to 118.17 captures the true mean IQ score for the 1000 
students in the school. 


/0.5(0.5 
R8.5 Solving 2.576 P20) = 0.01 gives n = 16,590. 
n 


RS8.6 (a) S: p = the true proportion of all drivers who have run at 
least one red light in the last 10 intersections they have entered. P: 
One-sample z interval for p. Random: the drivers were selected at 
random. 10%: The sample size (880) is less than 10% of all drivers. 
Large Counts: np=171210 and n(1—f)=709= 10. 
D: (0.168,0.220). C: We are 95% confident that the interval from 
0.168 to 0.220 captures the true proportion of all drivers who have 
run at least one red light in the last 10 intersections they have 


S-38 Solutions 


entered. (b) It is likely that more than 171 respondents have run red 
lights because some people may lie and say they haven’t run a red 
light. The margin of error does not account for these sources of 
bias; it accounts only for sampling variability. 

R8.7 (a) S: = the true mean measurement of the critical dimen- 
sion for the engine crankshafts produced in one day. P: One-sample 
t interval for jz. Random: The data come from an SRS. 10%: the 
sample size (16) is less than 10% of all crankshafts produced in one 
day. Normal/Large Sample: the histogram shows no strong skew- 
ness or outliers. 


Frequency 


T T i] i T 
223.90 223.95 224.00 224.05 224.10 
Crankshaft measurement (mm) 


D: Using df = 15, (223.969, 224.035). C: We are 95% confident 
that the interval from 223.969 to 224.035 mm captures the true 
mean measurement of the critical dimension for engine crank- 
shafts produced on this day. (b) Because 224 is a plausible value 
in this interval, we don’t have convincing evidence that the process 
mean has drifted. 


R8.8 Solving 1 90 su 


") = 1000 gives n = 35. 
n 
R8.9 (a) The margin of error must get larger to increase the cap- 


ture rate of the intervals. (b) If we quadruple the sample size, the 
margin of error will decrease by a factor of 2. 

R8.10 (a) When we use the sample standard deviation s, to esti- 
mate the population standard deviation oc. (b) The ¢ distributions 
are wider than the standard Normal distribution and they have a 
slightly different shape with more area in the tails. (c) As the de- 
grees of freedom increase, the spread and shape of the t distribu- 
tions become morte like the standard Normal distribution. 


Answers to Chapter 8 AP® Statistics Practice Test 


T8.1 a 

T8.2 d 

T8.3 ¢ 

T8.4 d 

T8.5 b 

T8.6 a 

T8.7 c 

T8.8 d 

T8.9 e 

T8.10 d 

T8.11 (a) S: p = the true proportion of all visitors to Yellowstone 
who would say they favor the restrictions. P: One-sample z interval 
for p. Random: the visitors were selected randomly. 10%: the 
sample size (150) is less than 10% of all visitors to Yellowstone 
National Park. Large Counts: np = 89 = 10 and n(1 — p) = 
61 = 10. D: (0.490, 0.696). C: We are 99% confident that the 
interval from 0.490 to 0.696 captures the true proportion of all visi- 
tors who would say that they favor the restrictions. (b) Because there 
are values smaller than 0.50 in the confidence interval, the U.S. 


Forest Service cannot conclude that more than half of visitors to 
Yellowstone National Park favor the proposal. 
T8.12 (a) Because the sample size is large (n = 48 = 30), the 
Normal/Large Sample condition is met. (b) Maurice’s interval uses 
az critical value instead of a ¢ critical value. Also, Maurice used the 
wrong value in the square root—it should be n = 48. Correct: 
Using df= 40,6.208 + 2.001( 2722) = (5.457,6.959). Using 
V48 
technology: (5.46, 6.956) with df = 47. 
18.13 S: «2 = the true mean number of bacteria per milliliter of 
raw milk received at the factory. P: One-sample t interval for pu. 
Random: The data come from a random sample. 10%: the sample 
size (10) is less than 10% of all 1-ml specimens that arrive at the 
factory. Normal/Large Sample: the dotplot shows that there is no 
strong skewness or outliers. 


* T T *—* T a8 T T * * T T 
4560 4680 4800 4920 5040 5160 5280 5400 
Bacteria/ml 


D: Using df = 9, (4794.37,5105.63).C: We are 90% confident that 
the interval from 4794.37 to 5105.63 bacterial/ml captures the true 
mean number of bacteria in the milk received at this factory. 


Chapter 9 
Section 9.1 


Answers to Check Your Understanding 

page 541: 1. (a) p = proportion of all students at Jannie’s high 
school who get less than 8 hours of sleep at night. (b) Ho:p = 0.85 
and H,:p ¥ 0.85. 2. (a) 42 = true mean amount of time that it 
takes to complete the census form. (b) Ho: = 10 and H,: > 10. 

page 549: 1, Finding convincing evidence that the new batteries 
last longer than 30 hours on average, when in reality their true 
mean lifetime is 30 hours. 2. Not finding convincing evidence 
that the new batteries last longer than 30 hours on average, when in 
reality their true mean lifetime >30 hours. 3. Answers will vary. A 
consequence of a Type I error would be that the company spends 
the extra money to produce these new batteries when they aren’t 
any better than the older, cheaper type. A consequence of a Type II 
error would be that the company would not produce the new batter- 
ies, even though they were better. 


Answers to Odd-Numbered Section 9.1 Exercises 

9.1 Ho:w = 115; Hy: > 115, where p is the true mean score on 
the SSHA for all students at least 30 years of age at the teacher’s 
college. 

9.3 Ho:p = 0.12; H,:p 4 0.12, where p is the true proportion of 
lefties at his large community college. 

9.5 Ho: o = 3; H,: o > 3, where a is the true standard deviation of 
the temperature in the cabin. 

9.7 The null hypothesis is always that there is “no difference” or 
“no change” and the alternative hypothesis is what we suspect is 
true. Correct: Ho:p = 0.37; Hy:p > 0.37. 

9.9 Hypotheses are always about population parameters. Correct: 
Ho: = 1000 grams; H,:4 < 1000 grams. 

9.11 (a) The attitudes of older students do not differ from other 
students, on average. (b) Assuming the mean score on the SSHA for 
students at least 30 years of age at this school is really 115, there is a 
0.0101 probability of getting a sample mean of at least 125.7 just by 
chance in an SRS of 45 older students. 


9.13 a = 0.10: Because the P-value of 0.2184 > a = 0.10, we fail 
to reject Hp. We do not have convincing evidence that the propor- 
tion of left-handed students at Simon’s college is different from the 
national proportion. a=0.05: Because the P-value of 
0.2184 > a = 0.05, we fail to reject Hp. We do not have convinc- 
ing evidence that the proportion of left-handed students at Simon’s 
college is different from the national proportion. 

9.15 a=0.05: Because the P-value of 0.0101 < a= 0.05, we 
reject Hp. We have convincing evidence that the true mean 
score on the SSHA for all students at least 30 years of age at the 
teacher’s college >115. a=0.01: Because the P-value of 
0.0101 > a = 0.01, we fail to reject Hp. We do not have convinc- 
ing evidence that the true mean score on the SSHA for all students 
at least 30 years of age at the teacher’s college >115. 

9.17 Either Ho is true or Hp is false —it isn’t true some of the time 
and not true at other times. 

9.19 The P-value should be compared with a significance level 
(such as a = 0.05), not the hypothesized value of p. Also, the data 
never “prove” that a hypothesis is true, no matter how large or small 
the P-value. 

9.21 (a) Ho: = 6.7; Ha:p < 6.7, where ys represents the mean 
response time for all accidents involving life-threatening injuries in 
the city. (b) I: Finding convincing evidence that the mean response 
time has decreased when it really hasn’t. A consequence is that the 
city may not investigate other ways to reduce the mean response 
time and more people could die. II: Not finding convincing evi- 
dence that the mean response time has decreased when it really 
has. A consequence is that the city spends time and money investi- 
gating other methods to reduce the mean response time when they 
aren't necessary. (c) ‘Type I, because people may end up dying as a 
result. 

9.23 (a) Ho: = $85,000; Hy: > $85,000, where ys = the mean 
income of all residents near the restaurant. (b) I: Finding convinc- 
ing evidence that the mean income of all residents near the restau- 
rant exceeds $85,000 when in reality it does not. The consequence 
is that you will open your restaurant in a location where the resi- 
dents will not be able to support it. IH: Not finding convincing evi- 
dence that the mean income of all residents near the restaurant 
exceeds $85,000 when in reality it does. ‘The consequence of this 
error is that you will not open your restaurant in a location where 
the residents would have been able to support it and you lose poten- 
tial income. 

9.25 d 

O27 «© 

9.29 (a) P(woman) = 0.4168, so (24,611)(0.4168) = 10,258 
degrees were awarded to women. (b) No. P(woman) = 0.4168, 
which is not equal to P(woman | bachelors) = 0.43. 

(c) P(at least 1 of the 2 degrees earned by a woman) 

= | — P(neither degree is earned by a woman) = 


(Be) e) oe 
24,611/\24610/ 
Section 9.2 


Answers to Check Your Understanding 

page 560: S: Ho:p = 0.20 versus H,:p > 0.20, where p is the true 
proportion of all teens at the school who would say they have elec- 
tronically sent or posted sexually suggestive images of themselves. 
P: One-sample z test for p. Random: Random sample. 10%: 
The sample size (250) < 10% of the 2800 students. Large 


Solutions S-39 


Counts: 250(0.2)=50=10 and 250(0.8) = 200 = 10. D: 
252 — 0.2 
z= ee = 2.06 and P(Z = 2.06) = 0.0197. C: Because 
0.20(0.80) 


250 
the P-value of 0.0197 < a = 0.05, we reject Hp. We have convinc- 
ing evidence that more than 20% of the teens in her school would 
say they have electronically sent or posted sexually suggestive 
images of themselves. 
page 563: S: Ho:p = 0.75 versus H,:p # 0.75, where p is the true 
proportion of all restaurant employees at this chain who would say 
that work stress has a negative impact on their personal lives. P: 
One-sample z test for p. Random: Random sample. 10%: The sam- 
ple size (100) < 10% of all employees. 
100(0.75) = 75 = 10 and 100(0.25) = 25 = 10. 

— 0.68 — 0.75 _ 


/0.75(0.25) 
100 


Because the P-value of 0.1052 > a = 0.05, we fail to reject Hy. We 
do not have convincing evidence that the true proportion of all res- 


Large Counts: 


1.62 and 2P(Z =— 1.62) = 0.1052. C: 


taurant employees at this large restaurant chain who would say that 
work stress has a negative impact on their personal lives is different 
from 0.75. 

page 564: The confidence interval given in the output includes 
0.75, which means that 0.75 is a plausible value for the population 
proportion that we are seeking. So both the significance test (which 
didn’t rule out 0.75 as the proportion) and the confidence interval 
give the same conclusion. The confidence interval, however, gives 
a range of plausible values for the population proportion instead of 
only making a decision about a single value. 

page 569: 1. A'Type IL error. Ifa Type I error occurred, they would 
reject a good shipment of potatoes and have to wait to get a new 
delivery. However, if a ‘Type II error occurred, they would accept a 
bad batch and make potato chips with blemishes. ‘This might upset 
consumers and decrease sales. ‘To minimize the probability of a 
‘Type II error, choose a large significance level such as a = 0.10 
2. (a) Increase. Increasing a to 0.10 makes it easier to reject the 
null hypothesis, which increases power. (b) Decrease. Decreasing 
the sample size means we don’t have as much information to use 
when making the decision, which makes it less likely to correctly 
reject Hy. (c) Decrease. It is harder to detect a difference of 
0.02 (0.10 — 0.08) than a difference of 0.03 (0.11 — 0.08). 


Answers to Odd-Numbered Section 9.2 Exercises 

9.31 Random: Random sample. 10%: 'The sample size (60) < 10% 
of all students. Large 60 (0.80) =48 = 10 and 
60 (0.20) = 12 = 10. 

9.33 mpo = 10(0.5) = 5 and n(1 — po) = 10(0.5) =5 are both 
<10. 


Counts: 


_ 0.683 — 0.80 _ 


0.80(0.20) 


60 
Using technology: normalcdf£ (lower:—1000, upper: 


—2.27, w:0, 0:1) =0.0116. The graph is given below. 


9.35 (a)z 


2.27 (b) (Z = — 2.27) = 0.0116. 


S-40 Solutions 


NO, 1) 


2.27 0 


9.37 (a) P-value = 0.0143. 5%: P-value of 
0.0143 < a= 0.05, we reject Ho. There is convincing evidence 
that p > 0.5. 1%: Because the P-value of 0.0143 > a = 0.01, we 
fail to reject Ho. There is not convincing evidence that p > 0.5. 
(b) P-value = 0.0286. Because this P-value is still less than a = 0.05 
and greater than a = 0.01, we would again reject Hp at the 5% sig- 
nificance level and fail to reject Hp at the 1% significance level. 
9.39 S: Ho:p = 0.37 versus H,:p > 0.37, where p = true propor- 
tion of all students who are satisfied with the parking situation after 
the change. P: One-sample z test for p. Random: Random sample. 
10%: The sample size (200) < 10% of the population of size 2500. 
Large Counts: 200(0.37) = 74 = 10 and 200(0.63) = 126 = 10. 
D: z = 1.32, P-value = 0.0934. C: Because the P-value of 
0.0934 > a = 0.05, we fail to reject Hy. We do not have convinc- 
ing evidence that the true proportion of all students who are satis- 
fied with the parking situation after the change > 0.37. 

9.41 (a) S: Ho:p = 0.50 versus H,:p > 0.50, where p is the true 
proportion of boys among first-born children. P: One-sample z test 
for p. Random: Random sample. 10%: The sample size 
(25,468) < 10% of all _first-borns. Large Counts: 
25,468(0.50) = 12,734 = 10 and 25,468(0.50) = 12,734 = 10. 
D: z = 5.49, P-value~0. C: Because the P-value of approximately 
0 < a=0.05, we reject Hp. There is convincing evidence that 


Because the 


first-born children are more likely to be boys. (b) First-born chil- 
dren, because that is the group that we sampled from. 

9.43 Here are the corrections: H,: p > 0.75; p = the true propor- 
tion of middle school students who engage in bullying behavior; 
10%: the sample size (558) < 10% of the population of middle 
school students; mpp = 558(0.75) = 418.5 = 10 and n(1 — po) = 


558(0.25) = 139.5 = 10;z= PEE aUe 2. 
(0.75)(0.25) 


? 


558 
P-value = 0.0048. Because the P-value of 0.0048 < a = 0.05, we 
reject Hp. We have convincing evidence that more than three-quar- 
ters of middle school students engage in bullying behavior. 
9.45 S: Ho:p = 0.60 versus Hy:p ¥ 0.60, where p is the true pro- 
portion of teens who pass their driving test on the first attempt. 
P: One-sample z test for p. Random: Random sample. 10%: 
The sample size (125) < 10% of all teens. Large Counts: 
125(0.60) =75 210 and 125(0.40)=50210. D:z=2.01, 
P-value = 0.0444. C: Because our P-value of 0.0444 < a = 0.05, 
we reject Ho. There is convincing evidence that the true proportion 
of teens who pass the driving test on their first attempt is different 
from 0.60. 
9.47 (a) D: (0.607,0.769). C: We are 95% confident that the inter- 
val from 0.607 to 0.769 captures the true proportion of teens who 
pass the driving test on the first attempt. (b) Because 0.60 is not in 
the interval, we have convincing evidence that the true proportion 
of teens who pass the driving test on their first attempt is different 
from 0.60. 
9.49 No. Because the value 0.16 is included in the interval, we do 
not have convincing evidence that the true proportion of U.S. 
adults who would say they use ‘Twitter differs from 0.16. 


9.51 (a) p = the true proportion of U.S. teens aged 13 to 17 who 
think that young people should wait to have sex until marriage. 
(b) Random: Random sample. 10%: The sample size (439) < 10% 
of the population of all US. teens. Large Counts: 
439(0.5) = 219.5 = 10 and 439(0.5) = 219.5 = 10. (c) Assuming 
that the true proportion of U.S. teens aged 13 to 17 who think that 
young people should wait to have sex until marriage is 0.50, there is 
a 0.011 probability of getting a sample proportion that is at least as 
different from 0.5 as the proportion in the sample. (d) Yes. Because 
the P-value of 0.011 < a = 0.05, we reject Ho. There is convincing 
evidence that the true proportion of U.S. teens aged 13 to 17 who 
think that young people should wait to have sex until marriage dif- 
fers from 0.5. 

9.53 (a) I: Finding convincing evidence that more than 37% of 
students were satisfied with the new parking arrangement, when in 
reality only 37% were satisfied. Consequence: The principal believes 
that students are satisfied and takes no further action. II: Failing to 
find convincing evidence that more than 37% are satisfied with the 
new parking arrangement, when in reality more than 37% are satis- 
fied. Consequence: ‘The principal takes further action on parking 
when none is needed. (b) If the true proportion of students that are 
satisfied with the new arrangement is really 0.45, there is a 0.75 
probability that the survey provides convincing evidence that the 
true proportion > 0.37. (c) Increase the sample size or signifi- 
cance level. 

9.55 P(Type I) = a = 0.05 and P(T'ype II) = 0.22. 

9.57 (a) If the true proportion of Alzheimer’s patients who would 
experience nausea is really 0.08, there is a 0.29 probability that the 
results of the study would provide convincing evidence that the true 
proportion < 0.10. (b) Increase the number of measurements 
taken (m) to get more information. (c) Decrease. If @ is smaller, it 
becomes harder to reject the null hypothesis. ‘This makes it harder 
to correctly reject Ho. (d) Increase. Because 0.07 is further from the 
null hypothesis value of 0.10, it will be easier to detect a difference 
between the null value and actual value. 

9.59 ic 

9.61 b 

9.63 (a)X — Y has a Normal distribution with mean pyx_y = — 0.2 


and standard deviation ox_y = V0.1)? + (0.05)? = 0.112. To fit 
in a case, X — Y must take on a negative number. (b) We want 
to find PX—Y<0) using the N(— 0.2,0.112) distribution. 
0=— C= 02 

z= a ene 1.79 and P(Z < 1.79) = 0.9633. Using tech- 
nology: 0.9629. There is a 0.9629 probability that a randomly selected 
CD will fit in a randomly selected case. (c) P(all fit) = 
(0.9629)! = 0.0228. There is a 0.0228 probability that all 100 
CDs will fit in their cases. 


Section 9.3 


Answers to Check Your Understanding 

page 579: 1. Ho: = 320 versus Hy: ~ 320, where p= 
the true mean amount of active ingredient (in milligrams) in 
Aspro tablets from this batch of production. 2. Random: Random 
sample. 10%: The sample of size 36 < 10% of the population of 
all tablets in this batch. Normal/Large Sample: n = 36 = 30. 

319 — 320 
3/\/36 

and df = 30, the tail area is between 0.025 and 0.05. Thus, the 
P-value for the two-sided test is between 0.05 and 0.10. Using 
technology: 2tcdf (lower:—1000,upper:—2,d£:35) = 


—2 4. For this test, df = 35. Using Table B 


2(0.0267) = 0.0534. Because the P-value of 0.0534 > a = 0.05, we 
fail to reject Hp. There is not convincing evidence that the true 
mean amount of the active ingredient in Aspro tablets from this 
batch of production differs from 320 mg. 

page 583: 1. S: Ho: = 8 versus Hy: < 8, where pu is the true 
mean amount of sleep that students at the professor’s school get 
each night. P: One-sample t test for 44. Random: Random sample. 
10%: The sample size (28) < 10% of the population of students. 
Normal/Large Sample: The histogram below indicates that there is 
not much skewness and no outliers. 


N=) 


Frequency 


SCRPRNWAUDNIC 


T T T T T T T 
3.0 45 6.0 7.5 9.0 10.5 12.0 


Sleep (hours) 


D: x = 6.643 and s, = 1.981. t= —3.625 and the P-value is be- 
tween 0.0005 and 0.001. Using technology: P-value = 0.0006. 
C: Because our P-value of 0.0006 < a = 0.05, we reject Hy. There 
is convincing evidence that students at this university get less than 
8 hours of sleep, on average. 

page 586: 1. S: Ho: = 128 versus H,: # 128, where po is the 
true mean systolic blood pressure for the company’s middle-aged 
male employees. P: One-sample ¢ test for 4. Random: Random 
sample. 10%: ‘The sample size (72) < 10% of the population of 
middle-aged male employees. Normal/Large Sample: n = 72 = 30. 
D:t=1.10 and P-value = 0.275.C: Because our P-value of 
0.275 > a= 0.05, we fail to reject Hy. There is not convincing 
evidence that the mean systolic blood pressure for this company’s 
middle-aged male employees differs from the national average of 
128. 2. We are 95% confident that the interval from 126.43 to 
133.43 captures the true mean systolic blood pressure for the com- 
pany’s middle-aged male employees. ‘The value of 128 is in this 
interval and therefore is a plausible mean systolic blood pressure for 
the males 35 to 44 years of age. 

page 589: S: Ho:ug = 0 versus Hy:ug > 0, where jug is the true 
mean difference (air — nitrogen) in pressure lost. P: Paired t test 
for jig. Random: ‘Treatments were assigned at random to each pair 
of tires. Normal/Large Sample: n = 31 = 30. D: x = 1.252 and 
s, = 1.202. t= 5.80 and P-value ~ 0. C: Because the P-value of 
approximately 0 < a = 0.05, we reject Hp. We have convincing 
evidence that the true mean difference in pressure (air — nitrogen) 
> 0. In other words, we have convincing evidence that tires lose 
less pressure when filled with nitrogen than when filled with air, 
on average. 


Answers to Odd-Numbered Section 9.3 Exercises 

9.65 Random: Random sample. 10%: 'The sample size (45) < 10% 
of the population size of 1000. Normal/Large 
n=45 = 30. 

9.67 The Random condition may not be met, because we don’t 
know if this isa random sample of the atmosphere in the Cretaceous 
era. Also, the Normal/Large Sample condition is not met. The 
sample size < 30 and the histogram below shows that the data are 
strongly skewed to the left. 


Sample: 
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9.69 (a)t= "29.8/Vi5_ = 2.409. (b) For this test, df = 44. Using 


Table B and df= 40, we have 0.01 < P-value < 0.02. Using 
technology: tcdf(lower:2.409, upper:1000, df:44) 
= 0.0101. 
9.71 (a) Using Table B and df= 19, we have 0.025 < P-value 
< 0.05. Using technology: P-value = 0.043.5%: Because the 
P-value of 0.043 < a = 0.05, we reject Ho. There is convincing 
evidence that pp < 5. 1%: Because the P-value of 0.043 > a = 0.01, 
we fail to reject Hp. ‘There is not convincing evidence that pw < 5. 
(b) Using technology: P-value = 0.086. 5%: Because the P-value of 
0.086 > a = 0.05, we fail to reject Hp. There is not convincing 
evidence that js # 5. 1%: same as part (a). 
9.73 (a) S: Ho: pw = 25 versus Hy: ps > 25, where yu is the true mean 
speed of all drivers in a construction zone. P: One-sample t test for 
je. Random: Random sample. 10%: ‘The sample size (10) < 10% of 
all drivers. Normal/Large Sample: There is no strong skewness or 


outliers in the sample. 


20 22 24 26 28 30 32 34 36 


D: x = 28.8 and s, = 3.94. t= 3.05,df£=9, and the P-value is 
between 0.005 and 0.01 (0.0069). C: Because the P-value of 
0.0069 < a = 0.05, we reject Hy. We have convincing evidence 


that the true mean speed of all drivers in the construction zone 
> 25 mph. (b) Because we rejected Hp, it is possible we made a 
Type I error—finding convincing evidence that the true mean 
speed > 25 mph when it really isn’t. 
9.75 (a) S: Ho: = 1200 versus H,:~ < 1200, where yp is the true 
mean daily calcium intake of women 18 to 24 years of age. P: One- 
sample t test for 4. Random: Random sample. 10%: The sample 
size (36) < 10% of all women aged 18 to 24. Normal/Large 
Sample: n = 36 = 30. D:t = —6.73 and P-value = 0.000. C: 
Because the P-value of approximately 0 < a = 0.05, we reject Hp. 
There is convincing evidence that women aged 18 to 24 are getting 
less than 1200 mg of calcium daily, on average. (b) Assuming that 
women aged 18 to 24 get 1200 mg of calcium per day, on average, 
there is about a 0 probability that we would observe a sample mean 
= 856.2 mg by chance alone. 
9.77 S: Ho: = 11.5 versus Hy: # 11.5, where p is the true mean 
hardness of the tablets. P: One-sample ¢ test for jz. Random: The 
tablets were selected randomly. 10%: The sample size (20) < 10% 
of all tablets in the batch. Normal/Large Sample: There is no strong 


skewness or outliers in the sample. 
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D: x= 11.5164 and s,= 0.0950. t=0.77, df= 19, and the 
P-value is between 0.40 and 0.50 (0.4494). C: Because our P-value 
of 0.4494 > a = 0.05, we fail to reject Hp. We do not have con- 
vincing evidence that the true mean hardness of these tablets is 
different from 11.5. 

9.79 D: With df = 19, (11.472,11.561). C: We are 95% confident 
that the interval from 11.472 to 11.561 captures the true mean 
hardness measurement for this type of pill. The confidence interval 
gives 11.5 as a plausible value for the true mean hardness 1, but it 
gives other plausible values as well. 

9.81 S: Ho: = 200 versus H,:~4 # 200, where p is the true mean 
response time of European servers. P: One-sample t interval to help 
us perform a two-sided test for jz. Random: The servers were 
selected randomly. 10%: The sample size (14) < 10% of all servers 
in Europe. Normal/Large Sample: The sample size is small, but a 
graph of the data reveals no strong skewness or outliers. 
D: (158.22, 189.64). C: Because our 95% confidence interval does 
not contain 200 milliseconds, we reject Hp at the a = 0.05 signifi- 
cance level. We have convincing evidence that the mean response 
time of European servers is different from 200 milliseconds. 

9.83 (a) Yes. Because the P-value of 0.06 > a = 0.05, we fail to 
reject Ho: 4. = 10 at the 5% level of significance. Thus, the 95% 
confidence interval will include 10. (b) No. Because the P-value of 
0.06 < a = 0.10, we reject Ho: = 10 at the 10% level of signifi- 
cance. Thus, the 90% confidence interval would not include 10 as 
a plausible value. 

9.85 (a) If all the subjects used the right thread first and they were 
tired when they used the left thread, then we wouldn’t know if the 
difference in times was because of tiredness or because of the direc- 
tion of the thread. (b) S: Ho: ug = 0 versus H,: tg > 0, where pug is 
the true mean difference (left — right) in the time (in seconds) it 
takes to turn the knob with the left-hand thread and the right-hand 
thread. P: Paired t test for jug. Random: The order of treatments was 
determined at random. Normal/Large Sample: There is no strong 
skewness or outliers. 
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D:x = 13.32 and s, = 22.94. t = 2.903, df = 24, and the P-value 
is between 0.0025 and 0.005 (0.0039). C: Because the P-value of 
0.0039 < a= 0.05, we reject Hj. We have convincing evidence 
that the true mean difference (left — right) in time it takes to turn 
the knob >0. 


9.87 (a) Ho:tua = 9 versus Hy:ug > 0, where pug is the true mean 
difference in tomato yield (A — B). (b) df = 9. (c) Interpretation: 
Assuming that the average yield for both varieties is the same, there is 
a 0.1138 probability of getting a mean difference as large or larger 
than the one observed in this experiment. Conclusion: Because the 
P-value of 0.1138 > a = 0.05, we fail to reject Hy. We do not have 
convincing evidence that the true mean difference in tomato yield 
(A— B) > 0. (d) I: Finding convincing evidence that Variety A 
tomato plants have a greater mean yield, when in reality there is no 
difference. II: Not finding convincing evidence that Variety A tomato 
plants have a higher mean yield, when in reality Variety A does have 
a greater mean yield. ‘They might have made a Type II error. 

9.89 Increase the significance level a or increase the sample 
size n. 

9.91 When the sample size is very large, rejecting the null hypoth- 
esis is very likely, even if the actual parameter is only slightly differ- 
ent from the hypothesized value. 

9.93 (a) No, in a sample of size n = 500, we expect to see about 
(500)(0.01) = 5 people who do better than random guessing, with 
a significance level of 0.01. (b) The researcher should repeat the 
procedure on these four to see if they again perform well. 

9.95 b 

9.97 d 

999° 'c 

9.101 a 

9.103 (a) Not included. The margin of error does not account for 
undercoverage. (b) Not included. The margin of error does not 
account for nonresponse. (c) Included. The margin of error is cal- 
culated to account for sampling variability. 


Answers to Chapter 9 Review Exercises 


R9.1 (a) Ho: w = 64.2; Hy: pp ¥ 64.2, where ys = the true mean 
height of this year’s female graduates from the local high school. 
(b) Ho: p = 0.75; Ha: p < 0.75, where p = the true proportion of 
all students at Mr. Starnes’s school who completed their math 
homework last night. 

R9.2 Random: Random sample. 10%: The sample size (24) < 10% 
of the population of adults. Normal/Large Sample: The histogram 
below shows that the distribution is roughly symmetric with no 
outliers. 
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R9.3 (a) Ho: w = 300 versus Hy: pp < 300, where ys = the true 
mean breaking strength of these chairs. (b) I: Finding convincing 
evidence that the mean breaking strength <300 pounds, when in 
reality it is 300 pounds or higher. Consequence: falsely accusing the 
company of lying. II: Not finding convincing evidence that the 
mean breaking strength <300 pounds, when in reality it <300 
pounds. Consequence: allowing the company to continue to sell 
chairs that don’t work as well as advertised. (c) Because a ‘Type II 
error is more serious, increase the probability of a Type I error by 
using a= 0.10. (d) If the true mean breaking strength is 294 
pounds, there is a 0.71 probability that we will find convincing 


evidence that the true mean breaking strength < 300 pounds. 
(e) Increase the sample size or increase the significance level. 
R9.4 (a) S: Ho: p = 0.05 versus H,: p < 0.05, where p is the true 
proportion of adults who will get the flu after using the vaccine. P: 
One-sample z test for p. Random: Random sample. 10%: 
The sample size (1000) < 10% of the population of adults. Large 
Counts: 1000(0.05)=50=10 and  1000(0.95) = 950 = 10. 
D:z=—1.02 and P-value = 0.1539. C: Because the P-value of 
0.1539 > a = 0.05, we fail to reject Hp. We do not have convinc- 
ing evidence that fewer than 5% of adults who receive this vaccine 
will get the flu. (b) Because we failed to reject the null hypothesis, 
we could have made a Type II error—not finding convincing evi- 
dence that the true proportion of adults get the flu after using this 
vaccine <0.05, when in reality the true proportion <0.05. 
(c) Answers will vary. 
R9.5 (a) Assuming that the roulette wheel is fair, there is a 0.0384 
probability that we would get a sample proportion of reds at least 
this different from the expected proportion of reds (18/38) by 
chance alone. (b) Because the P-value of 0.0384 < a = 0.05, the 
results are statistically significant at the a = 0.05 level. ‘This means 
that we reject Hy and have convincing evidence that the true pro- 
portion of reds is different than p= 18/38. (c) Because 
18/38 = 0.474 is one of the plausible values in the interval, this 
interval does not provide convincing evidence that the wheel is 
unfair. It does not, however, prove that the wheel is fair as there are 
many other plausible values in the interval that are not equal to 
18/38. Also, the conclusion here is inconsistent with the conclusion 
in part (b) because the manager used a 99% confidence interval, 
which is equivalent to a test using a = 0.01. 
R9.6 (a) S: Ho: = 105 versus H,: # 105, where y is the true 
mean reading from radon detectors. P: One-sample t test for ju. 
Random: Random sample. 10%: ‘The sample size (11) < 10% ofall 
radon detectors. Normal/Large Sample: A graph of the data shows 
no strong skewness or outliers. D: t= — 0.06, df= 10, and 
and P-value > 0.50 (0.9513). C: Because the P-value of 
0.9513 > a = 0.10, we fail to reject Hp. We do not have convinc- 
ing evidence that the true mean reading from the radon detectors is 
different than 105. (b) Yes. Because 105 is in the interval from 
99.61 to 110.03, both the confidence interval and the significance 
test agree that 105 is a plausible value for the true mean reading 
from the radon detectors. 
R9.7 (a) The random condition can be satisfied by randomly allo- 
cating which plot got the regular barley seeds and which one got 
the kiln-dried seeds within each pair of adjacent plots. (b) S: 
Ho: fig = 0 versus Hy: fig < 0, where pug is the true mean difference 
(regular — kiln) in yield between regular barley seeds and kiln- 
dried barley seeds. P: Paired t test for fg. Random: Assumed. 
Normal/Large Sample: The histogram below shows no strong skew- 
ness or outliers. 

3.0 


Frequency 
NN 
[=] 


isd 
i) 


S 
o 


T T T T T 
-150 -100 -50 0 50 


Regular — Kiln (yield) 


D: x= — 33.7 and s,= 66.2. t= —1.690, df= 10, and the 
P-value is between 0.05 and 0.10 (0.0609). C: Because the P-value 


Solutions S-43 


of 0.0609 > a = 0.05, we fail to reject Hy. We do not have con- 
vincing evidence that the true mean difference (regular — kiln) in 
yield <0. 


Answers to Chapter 9 AP® Statistics Practice Test 


T9.1 b 
T9.2 e 
19.3 
19.4 
T9.5 
T9.6 
19.7 
T9.8 
19.9 
T9.10 c 

T9.11 (a) S: Ho: p = 0.20 versus Hy: p > 0.20, where p is the true 
proportion of customers who would pay $100 for the upgrade. P: 
One-sample z test for p. Random: Random sample. 10%: The sam- 
ple size (60) < 10% of this company’s customers. Large Counts: 
60(0.20) =12=10 and 60(0.8)=48=10. D:z=1.29, 
P-value = 0.0984. C: Because the P-value of 0.0984 > a = 0.05, 
we fail to reject Ho. We do not have convincing evidence that more 
than 20% of customers would pay $100 for the upgrade. (b) I: 
Finding convincing evidence that more than 20% of customers 
would pay for the upgrade, when in reality they would not. II: Not 
finding convincing evidence that more than 20% of customers 
would pay for the upgrade, when in reality more than 20% would. 


epaagdctceaoawoanon 


For the company, a ‘Type I error is worse because they would go 
ahead with the upgrade and lose money. (c) Increase the sample 
size or increase the significance level. 

19.12 (a) Students may improve from Monday to Wednesday just 
because they have already done the task once. Then we wouldn't 
know ifthe experience with the test or the caffeine is the cause of the 
difference in scores. A better way to run the experiment would be to 
randomly assign half the students to get | cup of coffee on Monday 
and the other half to get no coffee on Monday. Then have each per- 
son do the opposite treatment on Wednesday. (b) S: Ho: fig = 0 
versus Ha: fig< 0, where jig is the true mean difference 
(no coffee — coffee) in the number of words recalled without coffee 
and with coffee. P: Paired t test for zg. Random: The treatments were 
assigned at random. Normal/Large Sample: The histogram below 
shows a symmetric distribution with no outliers. 
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D:x = — 1 and s,= 0.816. t = —3.873, df = 9, and the P-value 
is between 0.001 and 0.0025 (0.0019). C: Because the P-value of 
0.0019 < a = 0.05, we reject Hp. We have convincing evidence 
that the mean difference (no coffee — coffee) in word recall < 0. 

T9.13 S: Ho: w = $158 versus H,: pp ¥ $158, where yu is the true 
mean amount spent on food by households in this city. P: One- 
sample t test for 4. Random: Random sample. 10%: The sample 
size (50) < 10% of households in this small city. Normal/Large 


S-44 Solutions 


Sample: n = 50 = 30. D: t = 2.47; using df = 40, the P-value is 
between 0.01 and 0.02 (using df = 49, 0.0168). C: Because the 
P-value of 0.0168 < a = 0.05, we reject Hy. We have convincing 
evidence that the true mean amount spent on food per household 
in this city is different from the national average of $158. 


Chapter 10 
Section 10.1 


Answers to Check Your Understanding 

page 619: S: p; = true proportion of teens who go online every day 
and p2 = true proportion of adults who go online every day. P: 'Two- 
sample z interval for p; — p2. Random: Independent random sam- 
ples. 10%: nj = 799 < 10% of teens and nz = 2253 < 10% of 
adults. Large Counts: 503, 296, 1532, and 721 are all = 10. D: 

0.63(0.37) | 0.68(0.32) _ 

(0.63 — 0.68) + 1.6454] 799 + 5953 
(—0.0824,—0.0176). C: We are 90% confident that the interval 
from —0.0824 to —0.0176 captures the true difference in the pro- 


portion of U.S. adults and teens who go online every day. 

page 628: S: Ho: p) — p2 = 0 versus H,:p; — p2 > 0, where pj, is 
the true proportion of children like the ones in the study who do not 
attend preschool that use social services later and pz is the true propor- 
tion of children like the ones in the study who attend preschool that 
use social services later. P: ‘Iwo-sample z test for p, — p2. Random: 
‘Two groups in a randomized experiment. Large Counts: 49, 12, 38, 


(0.8033 — 0.6129) —0 _ 
pe _ 9.7073(0.2927) 


24 are all = 10.D:z = 2.32 


61 62 
and P-value = 0.0102. C: Because the P-value of 0.0102 < a= 
0.05, we reject Hp. There is convincing evidence that the true pro- 
portion of children like the ones in the study who do not attend 
preschool that use social services later is greater than the true pro- 
portion of children like the ones in the study who attend preschool 
that use social services later. 


Answers to Odd-Numbered Section 10.1 Exercises 

10.1 (a) Approximately Normal because — 100(0.25) = 25, 
100(0.75) = 75, 100(0.35) = 35, and 100(0.65) = 65 are all at least 
10. (b) Mp, —p, = 0.25 — 0.35 = —0.10. (ec) ~~ Because 
n, = 100 < 10% of the first bag and n; = 100 < 10% of the sec- 


0.25(0.75) . 0.35(0.65 
ond bag, a5 -p =f a Ta) EO? 2 eee 


100 100 
10.3 (a) Approximately Normal because 50(0.30) = 15, 50(0.7) = 
35, 100(0.15) = 15, and 100(0.85)=85 are all at least 10. 
(b) pug, -p, = 9.30 — 0.15 = 0.15. (c) Because ng = 50 < 10% of 
the jelly beans in the Child mix and ng = 100 < 10% of the jelly 
beans in the Adult mix, o%, —,, = (202 —_ = 
0.0740. 


10.5 The data do not come from independent random samples or 


two groups in a randomized experiment. Also, there were less than 
10 successes (3) in the group from the west side of Woburn. 

10.7 There were less than 10 failures (0) in the treatment group, 
less than 10 successes (8) in the control group, and less than 10 
failures in the control group (4). 


0.26(1 — 0.26) _ 0.14(1 — 0.14) 
10.9 (a) SEs», r 316 537 
If we were to take many random samples of 316 young adults and 
532 older adults, the difference in the sample proportions of young 
adults and older adults who use ‘Twitter will typically be 0.0289 
from the true difference. (b) S: p; = true proportion of young adults 
who use ‘Twitter and p2 = true proportion of older adults who use 
‘Twitter. P: Two-sample <z interval for p; — pz. Random: ‘Two 
independent random samples. 10%: nj = 316 < 10% of all young 
adults and nz = 532 < 10% of all older adults. Large Counts: 82, 
234, 74, 458 are all at least 10. D: (0.072,0.168). C: We are 90% 
confident that the interval from 0.072 to 0.168 captures the true 
difference in the proportions of young adults and older adults who 


= 0.0289. 


use ‘Twitter. 

10.11 (a) S: p; = true proportion of young men who live in their 
parents’ home and 2 = true proportion of young women who 
live in their parents’ home. P: ‘Two-sample z interval for p; — po. 
Random: Reasonable to consider these independent random 
samples. 10%: ny = 2253 < 10% of the population of young men 
and nz = 2629 < 10% of the population of young women. Large 
Counts: 986, 1267, 923, 1706 are all at least 10. D: (0.051,0.123). 
C: We are 99% confident that the interval from 0.051 to 0.123 
captures the true difference in the proportions of young men and 
young women who live in their parents’ home. (b) Because the 
interval does not contain 0, there is convincing evidence that the 
true proportion of young men who live in their parents’ home is 
different from the true proportion of young women who live in 
their parents’ home. 

10.13 Ho: Pp) — p2=0 versus H,: Pp; — p2 #0, where p, is the 
true proportion of all teens who would say that they own an iPod or 
MP3 player and 3 is the true proportion of all young adults who 
would say that they own an iPod or MP3 player. 

10.15 P:'Two-sample z test for p; — pz. Random: Independent ran- 
dom samples. 10%: nj =800<10% of all teens and 
nz = 400 < 10% of all young adults. Large Counts: 632, 168, 268, 
and 132 are all at least 10. D:z=4.53 and P-value ~ 0. C: 
Because the P-value of close to 0 < a = 0.05, we reject Hp. There 
is convincing evidence that the true proportion of teens who would 
say that they own an iPod or MP3 player is different from the true 
proportion of young adults who would say that they own an iPod or 
MP3 player. 

10.17 D : (0.066,0.174). C : We are 95% confident that the inter- 
val from 0.066 to 0.174 captures the true difference in proportions 
of teens and young adults who own iPods or MP3 players. Because 
0 is not included in the interval, it is consistent with the results of 
Exercise 15. 

10.19 S: Ho: pi — p2 = 0 versus Hy: pi — p2 > 0, where py is the 
true proportion of 6- to 7-year-olds who would sort correctly and p2 
is the true proportion of 4- to 5-year-olds who would sort correctly. 
P: ‘Two-sample < test for p; — p2. Random: Independent random 
samples. 10%: nj =53 < 10% of all 6 to 7-year olds and 
nz = 50 < 10% ofall 4- to 5-year-olds. Large Counts: 28, 25, 10, 40 
are all = 10. D: z= 3.45 and P-value = 0.0003. C: Because the 
P-value of 0.0003 < a = 0.05, we reject Hp. We have convincing 
evidence that the true proportion of 6- to 7-year-olds who would sort 
correctly is greater than the true proportion of 4- to 5-year-olds who 
would sort correctly. 

10.21 (a) S: Ho: pa — pp = 0 versus H,: pa — pg > 0, where pag is 
the true proportion of students like these who would pass the driv- 
er’s license exam when taught by instructor A and pg is the true 


proportion of students like these who would pass the driver’s license 
exam when taught by instructor B. P: ‘Two-sample z test for p, — pp. 
Random: Two groups in a randomized experiment. Large Counts: 
30, 20, 22, 28 are all = 10. D: z = 1.60 and P-value = 0.0547. C: 
Because the P-value of 0.0547 > a = 0.05, we fail to reject Ho. 
There is not convincing evidence that the true proportion of stu- 
dents like these who would pass using instructor A is greater than 
the true proportion who would pass using instructor B. (b) I: 
Finding convincing evidence that instructor A is more effective 
than instructor B, when in reality the instructors are equally 
effective. II: Not finding convincing evidence that instructor A is 
better, when in reality instructor A is more effective. It is possible we 
made a ‘Type II error. 

10.23 (a) Two-sample z test for pj — pz. Random: ‘Two groups in a 
randomized experiment. Large Counts: 44, 44, 21, 60 are all = 10. 
(b) If no difference exists in the true pregnancy rates of women who 
are being prayed for and those who are not, there is a 0.0007 prob- 
ability of getting a difference in pregnancy rates as large or larger 
than the one observed in the experiment by chance alone. 
(c) Because the P-value of 0.0007 < a = 0.05, we reject Hy. There 
is convincing evidence that the pregnancy rates among women like 
these who are prayed for are higher than the pregnancy rates for 
those who are not prayed for. (d) Knowing they were being prayed 
for might have affected their behavior in some way that would have 
affected whether they became pregnant or not. Then we wouldn't 
know if it was the prayer or the other behaviors that caused the 
higher pregnancy rate. 

10.25 a 

10.27 c 

10.29 (a) y = —13,832 + 14,954x, where p = the predicted mile- 
age and x = the age in years of the cars. (b) For each year older 
the car is, the predicted mileage will increase by 14,954 miles. (c) 
Residual = —25,708. The student’s car had 25,708 fewer miles 
than expected, based on its age. 


Section 10.2 


Answers to Check Your Understanding 

page 644: S:p, = the true mean price of wheat in July and paz = 
the true mean price of wheat in September. P: ‘Two-sample t 
interval for ju; — 2. Random: Independent random samples. 10%: 
n, = 90 < 10% of all wheat producers in July and nz = 45 < 10% 
of all wheat producers in September. Normal/Large Sample: 
n, = 90 = 30 and nn, =45 = 30. D: Using df= 40, (—0.759, 
—0.561). Using df = 100.45, (—0.756, —0.564). C: We are 99% 
confident that the interval from —0.756 to —0.564 captures the 
true difference in mean wheat prices in July and September. 

page 649: S: Ho:j4; — W2 = 0 versus Hy:/4, — fur > 0, where py is 
the true mean breaking strength for polyester fabric buried for 2 
weeks and ju; is the true mean breaking strength for polyester fabric 
buried for 16 weeks. P: Two-sample t test. Random: ‘Two groups in 
a randomized experiment. Normal/Large Sample: The dotplots 
below show no strong skewness or outliers in either group. 
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D: x, = 123.8, s; = 4.60, x. = 116.4, s2 = 16.09. t= 0.989. Us- 
ing df = 4, the P-value is between 0.15 and 0.20. Using df = 4.65, 
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P-value = 0.1857. C: Because the P-value of 0.1857 > a = 0.05, 
we fail to reject Hy. We do not have convincing evidence that the 
true mean breaking strength of polyester fabric that is buried for 2 
weeks is greater than the true mean breaking strength for polyester 
fabric that is buried for 16 weeks. 


Answers to Odd-Numbered Section 10.2 Exercises 

10.31 (a) Because the distributions of M and B are Normal, the 
distribution of xy; — xg is also Normal. (b) #xz,-z = 188 — 170 
= 18 mg/dl. (c) Because 25 < 10% of all 20- to 34-year-old 
males and 36< 10% of all 

a + = 9.60 mg/dl. 

10.33 Random: ‘Two independent random samples. 10%: 
20 < 10% ofall males at the school and 20 < 10% of all females at 
the school. Normal/Large Sample: not met because there are fewer 


I4-year-old _ boys, oe 


x, 


than 30 observations in each group and the stemplot for Males 
shows several outliers. 

10.35 Random: not met because these data are not from two inde- 
pendent random samples. Knowing the literacy percent for females 
in a country helps us predict the literacy percent for males in that 
country. 10%: not met because 24 is more than 10% of Islamic 
countries. Normal/Large Sample: not met because the samples 
sizes are both small and both distributions are skewed to the left and 
have an outlier (see boxplots below). 
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10.37 (a) The distributions of percent change are both slightly 
skewed to the left. People drinking red wine generally have more 
polyphenols in their blood, on average. The distribution of percent 
change for the white wine drinkers is a little bit more variable. 
(b) S: y; = the true mean change in polyphenol level in the blood 
of people like those in the study who drink red wine and pz = the 
true mean polyphenol level in the blood of people like those in the 
study who drink white wine. P: ‘Two-sample t interval for 4 — /12. 
Random: ‘Two groups in a randomized experiment. Normal/Large 
Sample: The dotplots given in the problem do not show strong 
skewness or outliers. D: x; = 5.5, 8) = 2.517, x2 = 0.23, s2 = 3.292. 
Using df = 8, (2.701, 7.839). Using df = 14.97, (2.845, 7.689). C: 
We are 90% confident that the interval from 2.845 to 7.689 cap- 
tures the true difference in mean change in polyphenol level for 
men like these who drink red wine and men like these who drink 
white wine. (c) Because all of the plausible values in the interval are 
positive, this interval supports the researcher’s belief that red wine 
is more effective than white wine. 

10.39 (a) Earnings amounts cannot be negative, yet the standard 
deviation is almost as large as the distance between the mean 
and 0. However, the sample sizes are both very large 
(675 = 30and621 = 30). (b) S: j; = the true mean summer earn- 
ings of male students and ju2 = the true mean summer earnings of 
female students. P: ‘Two-sample ¢ interval for ju) — 2. Random: 
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Reasonable to consider these independent random samples. 10%: 
ny = 675 < 10% of male students at a large university and 
nz = 621 <10% of female students at a large university. 
Normal/Large Sample: nj, = 675 = 30 and n= 621 = 30. 
D: Using df = 100, (412.68, 635.58). Using df = 1249.21, 
(413.62, 634.64). C: We are 90% confident that the interval from 
$413.62 to $634.64 captures the true difference in mean summer 
earnings of male students and female students at this large univer- 
sity. (c) If we took many random samples of 675 males and 621 
females from this university and each time constructed a 90% con- 
fidence interval in this same way, about 90% of the resulting 
intervals would capture the true difference in mean earnings for 
males and females. 

10.41 (a) S: Ho:~) — 2 = 0 versus Hg:f4) — paz < 0, where 1) is 
the true mean time to breeding for the birds relying on natural food 
supply and pz is the true mean time to breeding for birds with food 
supplementation. P: ‘Two-sample t test. Random: Two groups ina 
randomized experiment. Normal/Large Sample: Neither distribu- 
tion displays strong skewness or outliers. 
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D: x, = 4.0, 8; = 3.11, x2 = 11.3, s. = 3.93. t= —3.74. Using 
df = 5, the P-value is between 0.005 and 0.01. Using df = 10.95, 
P-value = 0.0016. C: Because the P-value of 0.0016 < a = 0.05, 
we reject Hy. We have convincing evidence that the true mean 
time to breeding is less for birds relying on natural food supply 
than for birds with food supplements. (b) Assuming that the true 
mean time to breeding is the same for birds relying on natural 
food supply and birds with food supplements, there is a 0.0016 
probability that we would observe a difference in sample means of 
—7.3 or smaller by chance alone. 

10.43 S: Ho: 4, — pr = 0 versus Hg: (4) — pr ¥ 0, where 1; is the 
true mean number of words spoken per day by female students and 
}lz is the true mean number of words spoken per day by male stu- 
dents. P: Two-sample t test. Random: Independent random sam- 
ples. 10%: nj = 56 < 10% of females at a large university and 
nz =56 < 10% of males at a large university. Normal/Large 
Sample: nj = 56 = 30 and nz = 56 = 30. D: t = —0.248. Using 
df = 50, P-value > 0.50. Using df = 106.20, P-value = 0.8043. C: 
Because the P-value of 0.8043 > a = 0.05, we fail to reject Hp. We 
do not have convincing evidence that the true mean number of 
words spoken per day by female students is different than the true 
mean number of words spoken per day by male students at this 
university. 

10.45 (a) The distribution for the activities group is slightly skewed 
to the left, while the distribution for the control group is slightly 
skewed to the right. The center of the activities group is higher than 
the center of the control group. The scores in the activities group 
are less variable than the scores in the control group. (b) S: 
Ho: p41 — f2 = 0 versus Hy: (4) — pz > 0, where j1 is the true mean 
DRP score for third-grade students like the ones in the experiment 
who do the activities and jz2 is the true mean DRP score for third- 
grade students like the ones in the experiment who don’t do the 
activities. P: ‘T'wo-sample t test. Random: Two groups in a random- 
ized experiment. Normal/Large Sample: No strong skewness or 
outliers in either boxplot. D: t = 2.311. Using df = 20, the P-value 


is between 0.01 and 0.02. Using df = 37.86, P-value = 0.0132. C: 
Because the P-value of 0.0132 < a = 0.05, we reject Ho. We have 
convincing evidence that the true mean DRP score for third-grade 
students like the ones in the experiment who do the activities is 
greater than the true mean DRP score for third-grade students like 
the ones in the experiment who don’t do the activities. (c) Because 
this was a randomized controlled experiment, we can conclude that 
the activities caused the increase in the mean DRP score. 
10.47 D: Using df= 50, (—3563, 2779). Using df= 106.2, 
(—3521, 2737). C: We are 95% confident that the interval from 
—3521 to 2737 captures the true difference between mean number 
of words spoken per day by female students and the mean number of 
words spoken per day by male students. This interval allows us to 
determine if 0 is a plausible value for the difference in means and also 
provides other plausible values for the difference in mean words spo- 
ken per day. 
10.49 (a) S: Ho: 4) — 2 = 10 versus Hy: 4) — 2 > 10, where py 
is the true mean cholesterol reduction for people like the ones in 
the study when using the new drug and ju; is the true mean choles- 
terol reduction for people like the ones in the study when using the 
current drug. P: Two-sample t test. Random: Two groups in a ran- 
domized experiment. Normal/Large Sample: No strong skewness 
or outliers. D: t = 0.982. Using df = 13, the P-value is between 
0.15 and 0.20. Using df = 26.96, P-value = 0.1675. C: Because the 
P-value of 0.1675 > a = 0.05, we fail to reject Hp. We do not have 
convincing evidence that the true mean cholesterol reduction is 
more than 10 mg/dl greater for the new drug than for the current 
drug. (b) Type II error. It is possible that the difference in mean 
cholesterol reduction is more than 10 mg/dl greater for the new 
drug than the current drug, but we didn’t find convincing evidence 
that it was. 
10.51 (a) The researchers randomly assigned the subjects to create 
two groups that were roughly equivalent at the beginning of the 
experiment. (b) Only about 5 out of the 1000 differences were 
= 4.15, Pvalue ~ 0.005. Because the P-value of 0.005 < a = 0.05, 
we have convincing evidence that the true mean rating for students 
like these that are provided with internal reasons is higher than the 
true mean rating for students like these that are provided with exter- 
nal reasons. (c) Because we found convincing evidence that the 
mean is higher for students with internal reasons when it is possible 
that there is no difference in the means, we could have made a 
‘Type I error. 
10.53 (a) ‘Two-sample. Two distinct groups of cars in a randomized 
experiment. (b) Paired. Both treatments are applied to each sub- 
ject. (c) Two-sample. ‘Two distinct groups of women. 
10.55 (a) Paired, because we have two scores for each student. 
(b) S: Ho: 4g = 0 versus H,: fg > 0, where jug is the true mean 
increase in SAT verbal scores of students who were coached. P: 
Paired ¢ test for wg. Random: Random sample. 10%: ng = 427 < 10% 
of students who are coached. Normal/Large Sample: 427 = 30. 
D: t = 10.16. Using df = 426, P-value ~ 0. C: Because the P-value 
of approximately 0 < a = 0.05, we reject Ho. There is convincing 
evidence that students who are coached increase their scores on the 
SAT verbal test, on average. 
10,57 a 
10.59 b 
10.61 (a) One-sample z interval for a proportion. (b) Paired t test 
for the mean difference. (c) ‘Two-sample z interval for the differ- 
ence in proportions. (d) Two-sample t test for a difference in means. 


10.63 (a) P(at least one mean outside interval) = 1 — P(neither 
mean outside interval) = 1—(0.95)* = 1—0.9025 = 0.0975. (b) Let 
X = the number of samples that must be taken to observe one fall- 
ing above z+ 20;. Then X is a geometric random variable with 
p = 0.025. P(X = 4) = (1 — 0.025)3(0.025) = 0.0232. (ec) Let X = 
the number of sample means out of 5 that fall outside this interval. 
X is a binomial random variable with n = 5 and p = 0.32. We 
want P(X = 4) = 1— P(X S 3)=1-—binomedf (trials:5, 
p:0.32,x value:3) =1-— 0.961 = 0.039. This is a reason- 
able criterion because when the process is under control, we would 
only get a “false alarm” about 4% of the time. 

10.65 (a) Perhaps the people who responded are prouder of their 


improvements and are more willing to share. This could lead to an 
overestimate of the true mean improvement. (b) This was an obser- 
vational study, not an experiment. The students (or their parents) 
chose whether or not to be coached; students who choose coaching 
might have other motivating factors that help them do better the 
second time. 


Answers to Chapter 10 Review Exercises 


R10.1 (a) Paired t test for the mean difference. (b) ‘Two-sample z 
interval for the difference in proportions. (c) One-sample ¢ interval 
for the mean. (d) ‘Two-sample t interval for the difference between 
two means. 


R102 SE [1 — 0.832) | 0.581(1 — 0.581) 
ote) SERA 220 | 117 

= 0.0521. If we were to take many random samples of 220 Hispanic 
female drivers in New York and 117 Hispanic female drivers in 
Boston, the difference in the sample proportions who wear seatbelts 
will typically be 0.0521 from the true difference in proportions of 
all Hispanic female drivers in New York and Boston who wear seat 


belts. (b) S: p; = proportion of all Hispanic female drivers in New 
York who wear seat belts and p2 = proportion of all Hispanic female 
drivers in Boston who wear seat belts. P: T'wo-sample z interval for 
pPi—p2. Random: Independent random samples. 10%: 
n, = 220 < 10% of all Hispanic female drivers in New York and 
nz = 117 < 10% of all Hispanic female drivers in Boston. Large 
Counts: 183, 37, 68, 49 are all = 10. D: (0.149, 0.353). C: We are 
95% confident that the interval from 0.149 to 0.353 captures the 
true difference in the proportions of Hispanic women drivers in 
New York and Boston who wear their seat belts. 

R10.3 (a) The women in the study were randomly assigned to one 
of the two treatments. (b) Because both groups are large 
(ng = 45 = 30 and ng = 45 = 30), the sampling distribution of 
Xc — xq should be approximately Normal. (c) Assuming no differ- 
ence exists in the true mean ratings of the product for women like 
these who read or don’t read the news story, there is less than a 0.01 
probability of observing a difference as large as or larger than 0.49 
by chance alone. 

R10.4 (a) S:) = the true mean NAEP quantitative skills test score 
for young men and py = the true mean NAEP quantitative skills 
test score for young women. P: ‘T'wo-sample ¢ interval for ju) — fu. 
Random: Reasonable to consider these independent random sam- 
ples. 10%: n, = 840 < 10% ofall young menand nz = 1077 < 10% 
of all young women. Normal/Large Sample: n; = 840 = 30 and 
n2 = 1077 = 30. -D: Using df= 100, = (—6.80,2.14). Using 
df = 1777.52, (—6.76,2.10). C: We are 90% confident that the 
interval from —6.76 to 2.10 captures the true difference in the 
mean NAEP quantitative skills test score for young men and the 
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mean NAEP quantitative skills test score for young women. 
(b) Because 0 is in the interval, we do not have convincing evidence 
of a difference in mean score for male and female young adults. 
R10.5 (a) S: Ho: pi — p2 = 0 versus Hy: pi — p2 < 0, where py is 
the true proportion of patients like these who take AZT and develop 
AIDS and > is the true proportion of patients like these who take 
placebo and develop AIDS. P: Two-sample z test for p, — po. 
Random: ‘Two groups in a randomized experiment. Large Counts: 
17, 418, 38, 397 are all = 10. D: z = —2.91, P-value = 0.0018. C: 
Because the P-value of 0.0018 < a = 0.05, we reject Hp. We have 
convincing evidence that taking AZT lowers the proportion of 
patients like these who develop AIDS compared to a placebo. (b) |: 
Finding convincing evidence that AZT lowers the risk of developing 
AIDS, when in reality it does not. Consequence: patients will pay for 
a drug that doesn’t help. II: Not finding convincing evidence that 
AZT lowers the risk of developing AIDS, when in reality it does. 
Consequence: patients won't take the drug when it could actually 
delay the onset of AIDS. It is possible that we made a ‘Type I error. 
R10.6 (a) The Large Counts condition is not met because there 
are only 7 failures in the control area. (b) The Normal/Large 
Sample condition is not met because both sample sizes are small 
and there are outliers in the male distribution. 

R10.7 (a) Even though each subject has two scores (before and 
after), the two groups of students are independent. (b) ‘The distribu- 
tion for the control group is slightly skewed to the right, while the 
distribution for the treatment group is roughly symmetric. The center 
for the treatment group is greater than the center for the control 
group. ‘The differences in the control group are more variable than 
the differences in the treatment group. (c) S: Ho: /4) — juz = 0 versus 
Hg: [41 — 2 > 0, where ju; = the true mean difference in test scores 
for students like these who get the treatment message and juz = the 
true mean difference in test scores for students like these who get the 
neutral message. P: ‘T'wo-sample ¢ test for ju; — 2. Random: ‘Two 
groups in a randomized experiment. Normal/Large Sample: Neither 
boxplot showed strong skewness or any outliers. D: Using the differ- 
ences, x; = 11.4, s; = 3.169, x) = 8.25, s) = 3.69. t = 1.91. Using 
df = 7, the P-value is between 0.025 and 0.05. Using df= 13.92, 
P-value = 0.0382. C: Because the P-value of 0.0382 < a = 0.05, 
we reject Hp. There is convincing evidence that the true mean differ- 
ence in test scores for students like these who get the treatment mes- 
sage is greater than the true mean difference in test scores for students 
like these who get the neutral message. (d) We cannot generalize to 
all students who failed the test because our sample was not a random 
sample of all students who failed the test. 


Answers to Chapter 10 AP® Statistics Practice Test 


T10.1 
T10.2 
T10.3 
T10.4 
T10.5 
T10.6 
TH 
T10.8 
T10.9 b 

T10.10 a 

T10.11 (a) S: 4) = the true mean hospital stay for patients like 
these who get heating blankets during surgery and juz = the true 
mean hospital stay for patients like these who have core tempera- 
tures reduced during surgery. P: ‘Two-sample t interval for 44) — [12. 
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Random: Two groups in a randomized experiment. Normal/Large 
Sample: n, = 104 = 30 and nz = 96 = 30. D: Using df= 80, 
(—4.17, —1.03). Using df= 165.12, (—4.16, —1.04). C: We are 
95% confident that the interval from —4.16 to —1.04 captures the 
true difference in mean length of hospital stay for patients like these 
who get heating blankets during surgery and those who have their 
core temperatures reduced during surgery. (b) Yes. Because 0 is not 
in the interval, we have convincing evidence that the true mean 
hospital stay for patients like these who get heating blankets during 
surgery is different than the true mean hospital stay for patients like 
these who have core temperatures reduced during surgery. (c) If we 
were to repeat this experiment many times and calculate 95% confi- 
dence intervals for the difference in means each time, about 95% of 
the intervals would capture the true difference in mean hospital stay 
for patients like these who get heating blankets during surgery and 
mean hospital stay for patients like these who have core tempera- 
tures reduced during surgery. 

T10.12 (a) S: Ho: p) — p2 = 0 versus H,: p; — p2 > 0, where py is 
the true proportion of cars that have the brake defect in last year’s 
model and 2 is the true proportion of cars that have the brake 
defect in this year’s model. P: Two-sample z test for p) — p2. 
Random: Independent random samples. 10%: nj = 100 < 10% of 
last year’s model and nz = 350 < 10% of this year’s model. Large 
Counts: 20, 80, 50, 300 areall = 10. D: z = 1.39, P-value = 0.0822. 
C: Because the P-value of 0.0822 > a = 0.05, we fail to reject Hp. 
We do not have convincing evidence that the true proportion of 
brake defects is smaller in this year’s model compared to last year’s 
model. (b) I: Finding convincing evidence that there is a smaller 
proportion of brake defects in this year’s car model, when in reality 
there is not. This might result in more accidents because people 
think that their brakes are safe. II: Not finding convincing evidence 
that there is a smaller proportion of brake defects in this year’s 
model, when in reality there is a smaller proportion. ‘This might 
result in reduced sales of this year’s model. 

T10.13 (a) Ho: 4) — pz = 0 versus Hg: 4) — 2 < 0, where py = 

the true mean rent for one-bedroom apartments in the area of her 
college campus and jz = the true mean rent for two-bedroom 
apartments in the area of her college campus. (b) Two-sample t test 
for {1; — f2. Random: Independent random samples. 10%: 
n, = 10 < 10% of all one-bedroom apartments in this area and 
nz = 10 < 10% of all two-bedroom apartments in this area. 
Normal/Large Sample: The dotplots below show no strong skew- 
ness or outliers in either distribution. 
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(c) Assuming the true mean rent of the two types of apartments is 
really the same, there is a 0.029 probability of getting an observed 
difference in mean rents as large as or larger than the one in this 
study. (d) Because the P-value of 0.029 < a = 0.05, Pat should re- 
ject Hp. She has convincing evidence that the true mean rent of 
two-bedroom apartments is greater than the true mean rent of one- 
bedroom apartments in the area of her college campus. 
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AP3.1 e 
AP3.2 e 
AP3.3 d 


AP3.4 c 
AP3.5 d 
AP3.6 d 
AP3.7 c 
AP3.8 a 
AP3.9 d 
AP3.10 

AP3.11 

AP3.12 

AP3.13 

AP3.14 

AP3.15 

AP3.16 

AP3.17 

AP3.18 

AP3.19 

AP3.20 

AP3.21 

AP3.22 

AP3.23 

AP3.24 

AP3.25 

AP3.26 

AP3.27 

AP3.28 

AP3.29 a 

AP3.30 b 

AP3.31 S: Ho: ug = 0 versus Hy: ug <0, where pig is the true 
mean change in weight (after — before) in pounds for people like 
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these who follow a five-week crash diet. P: Paired t test for jug. 
Random: Random sample. 10%: ng = 15 is less than 10% of all 
dieters. Normal/Large Sample: There is no strong skewness or 
outliers. 
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D:x = —3.6 and s,= 11.53. t=—1.21. Using df= 14, the 
P-value is between 0.10 and 0.15 (0.1232). C: Because the P-value 
of 0.1232 is greater than a = 0.05, we fail to reject Hy. We do not 
have convincing evidence that the true mean change in weight 
(after — before) for people like these who follow a five-week crash 
diet is less than 0. 

AP3.32 (a) Observational study. No treatments were imposed on 
the individuals in the study. (b) Ho: fp; —p2=0_ versus 
Hq: p1 — p2 < 0, where py is the true proportion of VLBW babies 
who graduate from high school by age 20 and pz is the true propor- 
tion of non-VLBW babies who graduate from high school by age 
20. P: Two-sample z test for p; — p2. Random: Independent ran- 
dom samples. 10%: ny = 242 is less than 10% of all VLBW babies 
and nz = 233 is less than 10% of all non-VLBW babies. Large 
Counts: 179, 63, 193, 40 are all = 10. Do: z= —2.34 and 
P-value = 0.0095. Conclude: Because the P-value of 0.0095 is less 
than a = 0.05, we reject Hp. We have convincing evidence that the 
true proportion of VLBW babies who graduate from high school by 


age 20 is less than the true proportion of non-VLBW babies who 
graduate from high school by age 20. 
AP3.33 (a) y = —73.64 + 5.7188x, where » = predicted distance 
and x = temperature (degrees Celsius) (b) For each increase of 
1°C in the water discharge temperature, the predicted distance 
from the nearest fish to the outflow pipe increases by about 5.7188 
meters. (c) Yes. The residual plot shows no leftover pattern. (d) 
residual = 78 — 92.21 = —14.21 meters. The actual distance on 
this afternoon was 14.21 meters closer than expected, based on the 
temperature of the water. 
AP3.34 (a) Define W= the weight of a randomly selected 
gift box. Then pw = 8(2) + 2(4) + 3 = 27 ounces and ow= 
V/8(0.52) + 2(12) + 0.22 = 2.01 ounces. (b) We want to find 
30 — 27 

2.01 
and P(W > 30) = 0.0681. Using technology: 0.0678. There is a 
0.0678 probability of randomly selecting a box that weighs more 


= 1.49 


P(W > 30) using the N(27, 2.01) distribution. z = 


than 30 ounces. (c) P(at least one box is greater than 30 ounces) 
=1-—P(none of the boxes is greater than 30 ounces) = 
1 — (1 — 0.0678)’ = 1 — (0.9322)* = 0.2960. (d) Because the dis- 
tribution of W is Normal, the distribution of W will also be Normal, 
with mean pay = 27 ounces and standard deviation oF = 


2.01 -_ 30 — 27 
—— = 0.899. We want to find P(W > 30). z= 


WA 0.899 
and P(Z > 3.34) = 0.0004. There is a 0.0004 probability of ran- 


domly selecting 5 boxes that have a mean weight of more than 30 


= 3.34 


ounces. 
AP3.35 (a) S: Ho: fa — fp = O versus Hy: a — pep ¥ 0, where pa 
is the true mean annualized return for stock A and ig is the true 
mean annualized return for stock B. P: Two-sample t test. 
Random: Independent random samples. 10%: ng = 50 is less than 
10% of all days in the past 5 years and ng = 50 is less than 10% of 
all days in the past 5 years. Normal/Large Sample: ny = 50 = 30 
and ng = 50 = 30. D:t = 2.07. Using df= 40, the P-value is 
between 0.04 and 0.05. Using df= 90.53, P-value = 0.0416. C: 
Because the P-value of 0.0416 is less than a = 0.05, we reject Ho. 
We have convincing evidence that the true mean annualized 
return for stock A is different than the true mean annualized 
return for stock B. (b) H,: a4 — og = 0 vs. Hg: 0, — og > 0, where 
a is the true standard deviation of returns for stock A and op is 
the true standard deviation of returns for stock B. (c) When the 
standard deviation of stock A is greater than the standard deviation 
of stock B, the variance of stock A will be bigger than the variance 
of stock B. Thus, values of F that are significantly greater than | 
would indicate that the price volatility for stock A is higher than 
_ (12.9) 
(9.6) 
test statistic of 1.806 or greater occurred in only 6 out of the 200 
trials. Thus, the approximate P-value is 6/200 = 0.03. Because the 
approximate P-value of 0.03 is less than a = 0.05, we reject Hp. 


that for stock B. (d)F 


= 1.806. (e) In the simulation, a 


‘There is convincing evidence that the true standard deviation of 
returns for stock A is greater than the true standard deviation of 
returns for stock B. 


Solutions S-49 


Chapter 11 
Section 11.1 


Answers to Check Your Understanding 
page 684: 1. Ho: The company’s claimed color distribution for its 
Peanut M&M'°S is correct versus H,: The company’s claimed color 
distribution is not correct. 2. ‘The expected count of both blue and 
orange candies is 46(0.23) = 10.58, for green and yellow is 
46(0.15) = 6.9, and for red and brown is 46(0.12) = 5.52. 
Had (12 — 10.58) A le 10.58)? _ G3 — 6.9) (4 - 6.9) 
x 1058~——t—“<«~S2SB Gt D:C«D 
—552 (2-5.52) 
eed A Y= 11.3724 


5.52 5.52 
page 687: 1. The expected counts are all at least 5. df = 6-1 = 5. 
2 


P-value 
T T T 
10 \ 15-20 
11.3724 
Chi-square distribution with 5 df 


0 > 


3. The P-value is between 0.025 and 0.05 (0.0445). 4. Because 
the P-value of 0.0445 < a = 0.05, we reject Ho. There is convinc- 
ing evidence that the color distribution of M&M’S® Peanut 
Chocolate Candies is different from what the company claims. 
page 691: S: Ho: The distribution of eye color and wing shape is 
the same as what the biologists predict versus H,: The distribution 
of eye color and wing shape is not what the biologists predict. P: 
Chi-square test for goodness of fit. Random: Random sample. 10%: 
n = 200 < 10% of all fruit flies. Large Counts: 112.5, 37.5, 37.5, 
12.5 all = 5. D: x? = 6.1867, df = 3, the P-value is between 0.10 
and 0.15 (0.1029). C: Because the P-value of 0.1029 > a = 0.01, 
we fail to reject Hy. We do not have convincing evidence that the 
distribution of eye color and wing shape is different from what the 
biologists predict. 


Answers to Odd-Numbered Section 11.1 Exercises 
11.1 (a) Ho: The company’s claimed distribution for its deluxe 
mixed nuts is correct versus H,: The company’s claimed distribu- 
tion is not correct. (b) Cashews: 150(0.52) = 78, almonds: 
150(0.27) = 40.5, macadamia nuts: 150(0.13) = 19.5, brazil nuts: 
150(0.08) = 12. 

, (83-78) (29 — 40.5)? _ 20- 19.5)? . (Ue = 12) 


ace 2 «4405 «424195 2 


= 6.599 
11.5 (a) Expected counts are all at least 5 and df = 3. 


(b) 


P-value 


0 2 465998 10 12 14 
Chi-square distribution with 3 df 
(c) The P-value is between 0.05 and 0.10 (0.0858). (d) Because the 
P-value of 0.0858 > a = 0.05, we fail to reject Hy. We do not have 
convincing evidence that the company’s claimed distribution for its 
deluxe mixed nuts is incorrect. 


S-50 Solutions 


11.7 S: Ho: Nuthatches do not prefer particular types of trees 
when searching for seeds and insects versus Hy: Nuthatches do 
prefer particular types of trees when searching for seeds and insects. 
P: Chi-square test for goodness of fit. Random: Random sample. 
10%:n = 156 < 10% of all nuthatches. Large Counts: 84.24, 
62.4, 9.36 all = 5. D: x7 =7.418. With df= 2, the P-value is 
between 0.02 and 0.025 (0.0245). C: Because the P-value of 
0.0245 < a= 0.05, we reject Hy. There is convincing evidence 
that nuthatches prefer particular types of trees when they are 
searching for seeds and insects. 

11.9 Time spent doing homework is quantitative. Chi-square tests 
for goodness of fit should be used only for distributions of categori- 
cal data. 

11.11 (a) S: Ho: The first digit of invoices from this company follow 
Benford’s law versus H,: The first digit of invoices from this com- 
pany do not follow Benford’s law. P: Chi-square test for goodness 
of fit. Random: Random sample. 10%: Assume n = 250 < 10% 
of all invoices from this company. Large Counts: 75.25, 44, 31.25, 
24.25, 19.75, 16.75, 14.5, 12.75, 11.5 all = 5. D: x? = 21.563. 
With df= 8, the P-value is between 0.005 and 0.01 (0.0058). C: 
Because the P-value of 0.0058 < a = 0.05, we reject Ho. There is 
convincing evidence that the first digit of invoices from this com- 
pany do not follow Benford’s law. Follow-up analysis: The largest 
contributors to the statistic are amounts with first digit 3, + and 7. 
There are more invoices that start with 3 or 4 than expected and 
fewer invoices that start with 7 than expected. (b) I: Finding con- 
vincing evidence that the company’s invoices do not follow 
Benford’s law (suggesting fraud), when in reality they are consis- 
tent with Benford’s law. A consequence is falsely accusing this 
company of fraud. II: Not finding convincing evidence that the 
invoices do not follow Benford’s law (suggesting fraud), when in 
reality they do not. A consequence is allowing this company to 
continue committing fraud. A ‘Type I error would be more serious 
for the accountant. 

11.13 (a) Ho: The true distribution of flavors for Skittles candies is 
the same as the company’s claim versus H,: The true distribution of 
flavors for Skittles candies is not the same as the company’s claim. 
(b) Expected counts all = 12. (c) Using df = 4, y? statistics greater 
than 9.49 would provide significant evidence at the a = 0.05 level 
and x? values greater than 13.28 would provide significant evi- 
dence at the a = 0.01 level. (d) Answers will vary. 

11.15 S: Hy: All 12 astrological signs are equally likely versus H,: 
All 12 astrological signs are not equally likely. P: Chi-square test for 
goodness of fit. Random: Random sample. 10%: n = 4344 < 10% 
of all people in the United States. Large Counts: All expected 
counts = 362, which are = 5. D: y? = 19.76. With df= 11, the 
P-value is between 0.025 and 0.05 (0.0487). C: Because the P-value 
of 0.0487 < a = 0.05, we reject Ho. There is convincing evidence 
that the 12 astrological signs are not equally likely. Follow-up analy- 
sis: The largest contributors to the statistic are Aries and Virgo. 
There are fewer Aries (321 — 362 = —41) and more Virgos 
(402 — 362 = 40) than we would expect. 

11.17 S: Ho: Mendel’s 3:1 genetic model is correct versus H,: 
Mendel’s 3:1 genetic model is not correct. P: Chi-square test for 
goodness of fit. Conditions are met. D: y? = 0.3453. With df = 1, 
the P-value > 0.25 (0.5568). C: Because the P-value of 
0.5568 > a = 0.05, we fail to reject Hy. We do not have convinc- 
ing evidence that Mendel’s 3:1 genetic model is wrong. 

11.19 d 

11,21 ¢ 


11.23 The distribution of English grades for the heavy readers is 
skewed to the left, while the distribution of English grades for the 
light readers is roughly symmetric. The center of the distribution of 
English grades is greater for the heavy readers than for the light 
readers. ‘The English grades are more variable for the light readers. 
There is one low outlier in the heavy reading group but no outliers 
in the light reading group. 

11.25 (a) For each additional book read, the predicted English 
GPA increases by about 0.024. The predicted English grade for a 
student who has read 0 books is about 3.42. (b) residual 
=2.85 — 3.828 = —0.978. This student’s English GPA is 0.978 
less than predicted, based on the number of books this student has 
read. (c) Not very strong. On the scatterplot, the points are quite 
spread out from the line. Also, the value of r’ is 0.083, which 
means that only 8.3% of the variation in English grades is 
accounted for by the linear model relating English GPA to num- 
ber of books read. 


Section 11.2 


Answers to Check Your Understanding 

page 699: 1. Main: 0.060 several times a month or less, 0.236 at 
least once a week, 0.703 at least once a day. Commonwealth: 0.121 
several times a month or less, 0.250 at least once a week, 0.628 at 
least once a day. 2. Because there was such a big difference in the 
sample size from the two different types of campuses. 3. Students 
on the main campus are more likely to be everyday users of 
Facebook. Also, those on the commonwealth campuses are more 
likely to use Facebook several times a month or less. 


105 
2 co 
a 
—E 50-4 
§ 
5 40-4 
~ 305 
| 
8 20-4 
o 
0 [a ee Lr 
RS £ >) s x 
SoS os? 
Facebook o & ¢ a) é 
use Se OF SOF SS 
es ~ s 
Fes Fes 
oe F ro 
Campus Main Commonwealth 


type 


page 705: 1. Ho: There is no difference in the distributions of 
Facebook use among students at the main campus and students at 
the commonwealth campuses versus H,: There is a difference in 
the distributions of Facebook use among students at the main cam- 


pus and students at the commonwealth campuses. 
oe = 151.75, a = 612.19, oe 421.81 
55 — 77.56)? 94 — 421.81) 
=! os Pete asl Baus 


4. With df=2, the P-value < 0.0005 (0.000059). 5. Assuming 
that no difference exists in the distributions of Facebook use between 
students on Penn State’s main campus and students at Penn State’s 
commonwealth campuses, there is a 0.000059 probability of obsery- 
ing samples that show a difference in the distributions of Facebook 


use among students at the main campus and the commonwealth 
campuses as large or larger than the one found in this study. 
6. Because the P-value of 0.000059 < a=0.05, we reject Hp. 
There is convincing evidence that the distribution of Facebook use 
is different among students at Penn State’s main campus and stu- 
dents at Penn State’s commonwealth campuses. 


page 711: 1. 
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2. S: Ho: There is no difference in the distribution of quality of life 
for patients who have suffered a heart attack in Canada and the U.S. 
versus H,: There is a difference. . . . P: Chi-square test for homoge- 
neity. Random: Independent random samples. 10%: n, = 311 < 10 
% of all Canadian heart attack patients and nz = 2165 < 10% ofall 
US. heart attack patients. Large Counts: 77.37, 538.63, 71.47, 
497.53, 109.91, 765.09, 41.70, 290.30, 10.55, 73.45 all = 5. D: 
x? = 11.725. With df= 4, the P-value is between 0.01 and 0.02 
(0.0195). C: Because the P-value of 0.0195 > a = 0.01, we fail to 
reject Ho. There is not convincing evidence that a difference exists 
in the distribution of quality of life for heart attack patients in 
Canada and the United States. 

page 717: S: Ho: There is no association between an exclusive terti- 
tory clause and business survival versus H,: There is an association. . . . 
P: Chi-square test for independence. Random: Random sample. 
10%: We assume that n = 170 < 10% of all new franchise firms. 
Large Counts: 102.74, 20.26, 39.26, 7.74 all = 5. D: y7 = 5.911. 
Using df= 1, the P-value is between 0.01 and 0.02 (0.0150). C: 
Because the P-value of 0.0150 > a= 0.01, we fail to reject Ho. 
‘There is not convincing evidence of an association between exclusive 
territory clause and business survival. 


Answers to Odd-Numbered Section 11.2 Exercises 
11.27 (a) Female: 0.209, 0.104, 0.313, 0.373. Male: 0.463, 0.269, 
0.075, 0.194. 


(b) 
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(c) In general, it appears that females were classified mostly as low 
social comparison, whereas males were classified mostly as high so- 
cial comparison. However, about an equal percentage of males and 
females were classified as high mastery. 


Solutions S-51 


11.29 (a) Ho: There is no difference in the distribution of sports 
goals for male and female undergraduates at this university versus 
H,: There is a difference. . . . (b) 22.5, 12.5, 13, 19, 22.5, 12.5, 13, 
19. (c) x? = 24.898. 
11.31 (a) Random: Independent random samples. 10%: 
n, = 67 < 10% of all males and n2 = 67 < 10% of all females at 
the university. Large Counts: All expected counts = 5. (b) With 
df = 3, the P-value < 0.0005 (0.000016). (c) Assuming that no dif- 
ference exists in the distributions of goals for playing sports among 
males and females, there is a 0.000016 probability of observing 
independent random samples that show a difference in the distribu- 
tions of goals for playing sports among males and females as large or 
larger than the one found in this study. (d) Because the P-value of 
0.000016 < a = 0.05, we reject Hp. There is convincing evidence 
of a difference in the distribution of goals for playing sports among 
male and female undergraduates at this university. 
11.33 (a) Cold: 0.593 hatched. Neutral: 0.679 hatched. Hot: 0.721 
hatched. As the temperature warms up from cold to neutral to hot, 
the proportion of eggs that hatch appears to increase. (b) S: Ho: 
There is no difference in the true proportion of eggs that hatch in 
cold, neutral, or hot water versus H,: There is a difference. .. . P: 
Chi-square test for homogeneity. Random: 3 groups in a random- 
ized experiment. Large Counts: 18.63, 38.63, 71.74, 8.37, 17.37, 
32.26 all =5. D: y?=1.703. With df=2, the P-value 
> 0.25 (0.4267). C: Because the P-value of 0.4267 > a = 0.05, 
we fail to reject Ho. We do not have convincing evidence that there 
is a difference in the true proportions of eggs that hatch in cold, 
neutral, or hot water. 
11.35 We do not have the actual counts of the travelers in each 
category. We also do not know if the sample was taken randomly or 
if the samples are independent. 
11.37 (a) The data are given in the table below. The best success 
rate is for the patch plus the drug (0.355), followed by the drug 
alone (0.303). The patch alone (0.164) is just a little better than the 
placebo (0.156). 


Nicotine Drug Patch Placebo Total 
Patch plus drug 
Success 40 74 87 25 226 
Failure 204 170 158 135 667 
Total 244 244 245 160 893 


(b) Each of the four treatments has the same probability of success 
for smokers like these. (c) S: Hp: The true proportions of smokers 
like these who are able to quit for a year are the same for each 
of the four treatments versus H,: The true proportions are not the 
same. ... P: Chi-square test for homogeneity. Random: 4 groups in 
a randomized experiment. Large Counts: 61.75, 61.75, 62, 40.49, 
182.25, 182.25, 183, 119.5] all = 5. D:y? = 34.937. With df = 3, 
the P-value < 0.0005. C: Because the P-value of approximately 
0 < a=0.05, we reject Ho. There is convincing evidence that the 
true proportions of smokers like these who are able to quit for a year 
are not the same for each of the four treatments. 

11.39 The largest component comes from those who had success 
using both the patch and the drug (25 more than expected). The 
next largest component comes from those who had success using 
just the patch (21.75 less than expected). 

11.41 Buyers are much more likely to think the quality of recycled 
coffee filters is higher, while nonbuyers are more likely to think the 
quality is the same or lower. 


S-52 Solutions 
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11.43 (a) Ho: There is no association between beliefs about the 
quality of recycled products and whether or not a person buys recy- 
cled products in the population of adults versus H,: There is an 
association. . . . (b)13.26, 35.74, 8.66, 23.34, 14.08, 37.92 
(c) y? = 7.64. With df = 2, the P-value is between 0.02 and 0.025 
(0.022). (d) Because the P-value of 0.022 < a = 0.05, we reject Ho. 
There is convincing evidence of an association between beliefs 
about the quality of recycled products and whether or not a person 
buys recycled products in the population of adults. 

11.45 S: Ho: There is no association between education level and 
opinion about a handgun ban in the adult population versus H,: 
There is an association. . . . P: Chi-square test for independence. 
Random: Random sample. 10%:n = 1201 < 10% of all adults. 
Large Counts: 46.94, 86.19, 187.36, 94.29, 71.22, 69.06, 126.81, 
275.64, 138.71, 104.78 all = 5. D: y? = 8.525. With df= 4, the 
P-value is between 0.05 and 0.10 (0.0741). C: Because the P-value 
of 0.0741 > a = 0.05, we fail to reject Hp. We do not have con- 
vincing evidence that there is an association between educational 
level and opinion about a handgun ban in the adult population. 
11.47 (a) Independence, because the data come from a single ran- 
dom sample. (b) Ho: There is no association between gender and 
where people live in the population of young adults versus H,: 
There is an association. . . . (ec) Random: Random sample. 
10% :n = 4854 < 10% of all young adults. Large Counts: The 
expected counts are all at least 5. (d) P-value: If no association exists 
between gender and where people live in the population of young 
adults, there is a 0.012 probability of getting a random sample of 
4854 young adults with an association as strong or even stronger 
than the one found in this study. Conclusion: Because the P-value 
of 0.012 < a= 0.05, we reject Ho. There is convincing evidence 
that an association exists between gender and where people live in 
the population of young adults. 

11.49 (a) Hypotheses: Ho: There is no difference in the improve- 
ment rates for patients like these who receive gastric freezing and 
those who receive the placebo versus H,: There is a difference. . . . 
P-value: Assuming that no difference exists in the improvement 
rates between those receiving gastric freezing and those receiving 
the placebo, there is a 0.570 probability of observing a difference 
in improvement rates as large or larger than the difference 
observed in the study by chance alone. Conclusion: Because the 
P-value of 0.570 is larger than a = 0.05, we fail to reject Ho. 
There is not convincing evidence that a difference exists in the 
improvement rates for patients like these who receive gastric 
freezing and those who receive the placebo. (b) The P-values are 
equal and z7 = (—0.57)? = 0.3249 = x? = 0.322. 

11.51 d 

11.53 d 

11.55 a 

11.57 (a) One-sample t interval for a mean. (b) ‘Two-sample z test 
for the difference between two proportions. 


11.59 (a) Experiment, because a treatment (type of rating scale) 
was deliberately imposed on the students who took part in the study. 
(b) Several of the expected counts are less than 5. 


Answers to Chapter 11 Review Exercises 


R11.1 S: Ho: The proposed 1:2:1 genetic model is correct versus 
H,: The proposed 1:2:1 genetic model is not correct. P: Chi-square 
test for goodness of fit. Random: Random _ sample. 
10% :n = 84 < 10% of all yellow-green parent plants. Large 
Counts: 21,42, 21all =5. D: y?= 6.476. Using df= 2, the 
P-value is between 0.025 and 0.05 (0.0392). C: Because the P-value 
of 0.0392 > a = 0.01, we fail to reject Hy. We do not have con- 
vincing evidence that the proposed 1:2:1 genetic model is not 
correct. 

R11.2 Several of the expected counts are less than 5. 


R113 (a) 


Stress Exercise Usual Total 
management care 
Suffered cardiac event 3 7 12 22 
No cardiac event 30 27 28 85 
Total 33 34 40 107 


(b) The success rate was highest for stress management (0.909), 
followed by exercise (0.794) and usual care (0.70). (c) S: Hp: The 
true success rates for patients like these are the same for all three 
treatments versus H,: The true success rates are not all the same. 
... P: Chi-square test for homogeneity. Random: 3 groups in a ran- 
domized experiment. Large Counts: 6.79, 6.99, 8.22, 26.21, 27.01, 
31.78 all = 5. D: x? = 4.840. With df = 2, the P-value is between 
0.05 and 0.10 (0.0889). C: Because the P-value > a = 0.05, we 
fail to reject Hy. We do not have convincing evidence that the true 


success rates for patients like these are not the same for all three 
treatments. 

R11.4 (a) The data could have been collected from 3 independent 
random samples—a random sample of ads from magazines aimed at 
young men, a random sample of ads from magazines aimed at young 
women, and a random sample of ads aimed at young adults in gen- 
eral. In each sample, the ads would be classified as sexual or not 
sexual. (b) The data could have been collected from a single ran- 
dom sample of ads from magazines aimed at young adults. Then 
each ad in the sample would be classified as sexual or not sexual, 
and the magazine that the ad was from would be classified as aimed 
at young men, young women, or young adults in general. 


35] (1113)(576) (351 — 424.8 
2?* = 9.6094 = 60.94%. = 
iVggg  CeOet Olen ae09 424.8 


12.82. (‘The difference is due to rounding error.) (d) ‘The “sexual, 


= 424.8. 


Women” cell. There were 225 observed ads in this cell, which was 
73.8 more than expected. 
R11.5 (a) 


Percent 


Both groups of children have the largest percentage reporting 
grades as the goal. But after that, boys were more likely to pick 
sports, whereas girls were more likely to pick being popular. 

(b) S: Ho: There is no association between gender and goals for 4th, 
5th, and 6th grade students versus H,: There is an association. . . . 
P: Chi-square test for independence. Random: Random sample. 
10% :n =478 < 10% of all 4th, 5th, and 6th grade students. 
Large Counts: 129.70, 117.30, 74.04, 66.96, 47.26, 42.74 all = 5. 
D: x? = 21.455. With df = 2, the P-value < 0.0005 (0.00002). C: 
Because the P-value of 0.00002 < a = 0.05, we reject Hp. There is 
convincing evidence that an association exists between gender and 
goals for 4th, 5th, and 6th grade students. 


Answers to Chapter 11 AP® Statistics Practice Test 


T1l1.1 b 
T11.2 
T11.3 
T11.4 
TALS 
T11.6 
T11.7 b 

T118 a 

T11.9 d 

T11.10 d 

T11.11 S: Ho: The distribution of gas types is the same as the dis- 
tributor’s claim versus H,: The distribution of gas types is not the 
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same as the distributor’s claim. P: Chi-square test for goodness of fit. 
Random: Random sample. 10% :n = 400 < 10% of all customers 
at this distributor’s service stations. Large Counts: 240, 80, 80 all = 5. 
D: y? = 13.15. With df = 2, the P-value is between 0.001 and 0.0025 
(0.0014). C: Because the P-value of 0.0014 < a = 0.05, we reject 
Ho. There is convincing evidence that the distribution of gas type is 
not the same as the distributor claims. 

T11.12 (a) Random assignment was used to create three roughly 
equivalent groups at the beginning of the study. 


(b) 
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(c) Ho: The true proportion of spouse abusers like the ones in the 
study who will be arrested within 6 months is the same for all three 
police responses versus H,: ‘The true proportions are not all the 
same. (d) P-value: If the true proportion of spouse abusers like the 
ones in the study who will be arrested within 6 months is the same 
for all three police responses, there is a 0.0796 probability of getting 
differences between the three groups as large as or larger than the 
ones observed by chance alone. Conclusion: Because the P-value 
of 0.0796 is larger than a = 0.05, we fail to reject Hy. There is not 
convincing evidence that true proportion of spouse abusers like the 
ones in the study who will be arrested within 6 months is not the 
same for all three police responses. 


Solutions S-53 


T11.13 (a) S: Ho: There is no association between smoking status 
and educational level among French men aged 20 to 60 years versus 
H,: There is an association. . . . P: Chi-square test for independence. 
Random: Random sample. 10% :n = 459 < 10% of all French 
men aged 20 to 60 years. Large Counts: 59.48, 44.21, 42.31, 50.93, 
37.85, 36.22, 42.37, 31.49, 30.14, 34.22, 25.44, 24.34 all = 5. 
D: y? = 13.305. With df = 6, the P-value is between 0.025 and 0.05 
(0.0384). C: Because the P-value of 0.0384 < a = 0.05, we reject 
Ho. There is convincing evidence of an association between smok- 
ing status and educational level among French men aged 20 to 60 
years. 


Chapter 12 
Section 12.1 


Answers to Check Your Understanding 

page 752: S: 8 = slope of the population regression line relating 
fat gain to change in NEA. P: t interval for the slope. Linear: ‘There 
is no leftover pattern in the residual plot. Independent: ‘The sample 
size (n = 16) is less than 10% of all healthy young adults. Normal: 
The histogram of the residuals shows no strong skewness or outliers. 
Equal SD: Other than one point with a large positive residual, the 
residual plot shows roughly equal scatter for all x values. Random: 
Random sample. D: With df= 14, (—0.005032, —0.001852). C: 
We are 95% confident that the interval from —0.005032 to 
—0.001852 captures the slope of the population regression line 
relating fat gain to change in NEA. 

page 757: S: Ho: 6 = 0 versus H,: 6 < 0, where @ is the slope of 
the true regression line relating fat gain to NEA change. P: t test for 
the slope 3. D: t = —4.64. P-value ~ 0.000/2 ~ 0. C: Because the 
P-value of approximately 0 is less than a = 0.05, we reject Ho. 
There is convincing evidence that the slope of the true regression 
line relating fat gain to NEA change is negative. 


Answers to Odd-Numbered Section 12.1 Exercises 

12.1 The Equal SD condition is not met because the SD of the 
residuals clearly increases as the laboratory measurement (x) 
increases. 

12.3 Linear: There is no leftover pattern in the residual plot. 
Independent: Knowing the BAC for one subject should not help us 
predict the BAC for another subject. Normal: The histogram of the 
residuals shows no strong skewness or outliers. Equal SD: The 
residual plot shows roughly equal scatter for all x values. Random: 
These data come from a randomized experiment. 

12.5 qa is the true y intercept, which measures the true mean BAC 
level if no beers had been drunk (a = —0.012701). @ is the true 
slope, which measures how much the true mean BAC changes with 
the drinking of one additional beer (b = 0.018). Finally, o is the 
true standard deviation of the residuals, which measures how much 
the observed values of BAC typically vary from the population 
regression line (s = 0.0204). 

12.7 (a) SE, = 0.0024. If we repeated the experiment many times, 
the slope of the sample regression line would typically vary by 
about 0.0024 from the slope of the true regression line for predict- 
ing BAC from the number of beers consumed. (b) With 
df = 14, 0.018 + 2.977(0.0024) = (0.011,0.025). (ce) We are 99% 
confident that the interval from 0.011 to 0.025 captures the slope 
of the true regression line for predicting BAC from the number of 
beers consumed. (d) If we repeated the experiment many times 
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and computed a confidence interval for the slope each time, about 
99% of the resulting intervals would contain the slope of the true 
regression line for predicting BAC from the number of beers 
consumed. 

12.9 S: 6 = the slope of the population regression line relating 
number of clusters of beetle larvae to number of stumps. P: t inter- 
val for B. D: With df = 21, (8.678, 15.11). C: We are 99% confi- 
dent that the interval from 8.678 to 15.11 captures the slope of the 
population regression line relating number of clusters of beetle lar- 
vae to number of stumps. 

12.11 (a) y = —1.286 + 11.894(5) = 58.184 clusters. (b) s = 6.419, 
so we would expect our prediction to be off from the actual number 
of clusters by about 6.419 clusters. 

12.13 (a) y = 166.483 — 1.0987x, where y is the predicted corn 
yield and x is the number of weeds per meter. Slope: for each addi- 
tional weed per meter, the predicted corn yield will decrease by 
about 1.0987 bushels/acre. y intercept: if there are no weeds per 
meter, we would predict a corn yield of 166.483 bushels/acre. 
(b) When using weeds per meter to predict corn yield, the actual 
yield will typically vary from the predicted yield by about 7.98 bushels/ 
acre. (c) S: Ho: 6 = 0 versus H,: 6 < 0, where (2 is the slope of the 
true regression line relating corn yield to weeds per meter. P: t test 
for GB. D: t = —1.92. P-value = 0.075/2 = 0.0375. C: Because the 
P-value of 0.0375 is less than a = 0.05, we reject Hy. There is con- 
vincing evidence that the slope of the true regression line relating 
corn yield to weeds per meter is negative. 

12.15 S: Ho: 6 = 0 versus H,: 8 < 0, where @ is the slope of the 
population regression line relating heart disease death rate to 
wine consumption in the population of countries. P: t test for @. 
Linear: There is no leftover pattern in the residual plot. 
Independent: The sample size (n = 19) is less than 10% of all 
countries. Normal: The histogram of residuals shows no strong 
skewness or outliers. Equal SD: The residual plot shows that the 
standard deviation of the death rates might be a little smaller for 
large values of wine consumption, x, but it is hard to tell with so 
few data values. Random: Random sample. 
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D: t = —6.46, df = 17, and P-value ~ 0. C: Because the P-value 
of approximately 0 is less than a = 0.05, we reject Ho. There is 
convincing evidence of a negative linear relationship between 
wine consumption and heart disease death rate in the population 
of countries. 

12.17 (a) With df= 19, 11,630.6 + 2.093(1249) = (9016.4, 
14,244.8). (b) Because the automotive group claims that people 
drive 15,000 miles per year, this says that for every increase of | 
year, the mileage would increase by 15,000 miles. 
(c) t= —2.70. With df = 19, the P-value is between 0.01 and 0.02 
(0.0142). Because the P-value of 0.0142 is less than a = 0.05, we 
reject Hp. We have convincing evidence that the slope of the 
population regression line relating miles to years is not equal to 
15,000. (d) Yes. Because the interval in part (a) does not include 
the value 15,000, the interval also provides convincing evidence 
that the slope of the population regression line relating miles to 
years is not equal to 15,000. 

12.19 c 

12.21 a 

12.23 b 

12.25 (a) The two treatments (say the color, read the word) were 
deliberately assigned to the students. (b) He used a randomized 
block design where each student was a block. He did this to help 
account for the different abilities of students to read the words or to 
say the color they were printed in. (c) To help average out the 
effects of the order in which people did the two treatments. If every 
subject said the color of the printed word first and were frustrated by 
this task, the times for the second treatment might be worse. ‘Then 
we wouldn’t know the reason the times were longer for the second 
treatment—because of frustration or because the second method 
actually takes longer. 

12.27 There is a small number of differences (ng = 16 < 30) and 


there is an outlier. 


ie 299) = ag 295-4 77 + 212. 
12.29: (a) Ga) 1526> 0.1933. (ii) ~1526.~—O 0.3827. 
212 
(iii) 305 0.6951. (b) No. The probability that a person is a snow- 


mobile owner (295/1526 = 0.1933) is different from the probabil- 
ity that the person is a snowmobile owner given that he or she 
belongs to an environmental organization (16/305 = 0.0525). 


: 295 \/ 294 
(c) (i) P(both are owners) = (Se )(ass) = 0.0373 


(ii) P(at least one belongs to an environmental organization) 


; = 1221\f1220 
= | — P(neither belong) = | GelGca) = 0.3599 


Section 12.2 
Answers to Check Your Understanding 


—™. 
page 782: 1. Option 1: premium = —343 + 8.63(58) = $157.54 


— ~~. 
Option 2: In(premium) = —12.98 + 4.416(In 58) = 4.9509 
> y= oF PY = $141.30 

a ~~ 


Option 3: In(premium) = —0.063 + 0.0859(58) = 4.9192 

> y= oF? = $136.89 

2. Exponential (Option 3), because the scatterplot showing 
In(premium) versus age was the most linear and this model had the 
most randomly scattered residual plot. 


Answers to Odd-Numbered Section 12.2 Exercises 

12.31 (a) The scatterplot shows a fairly strong, positive, slightly 
curved association between length and period with one very 
unusual point (106.5, 2.115) in the top right corner. 
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(b) The class used the square root of x = length. (c) The class used 
the square of y = period. - 
12.33 (a) 1: = —0.08594 + 0.21Vx, where y is the period and x 


is the length. 2: - —0.15465 + 0.0428x, where y is the period and 
xisthe length. (b) 1: » = —0.08594 + 0.2180 = 1.792 seconds. 2: 


— 


y? = —0.15465 + 0.0428(80) = 3.269, so py = _V 3.269 = 1.808 
seconds. 

12.35 (a) The scatterplot of log(period) versus log(length) is 
roughly linear and the residual plot shows no obvious leftover pat- 


terns. (b) log = —0.73675 + 0.51701 log(x), where y is the period 
and x is the length. 


——™~ 
12.37 log y = —0.73675 + 0.51701 log(80) = 0.24717. Thus, y = 
10°747!7 = 1.77 seconds. 


— 
12.39 log y = 1.01 + 0.72 log(127) = 2.525. Thus, 

y = 10799 = 334.97 grams. 

12.41 (a) The relationship between bacteria count and time is 
strong, negative, and curved with a possible outlier in the top left- 
hand corner. 
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(b) Because the scatterplot of In(count) versus time is fairly 
linear. (c) In y = 5.97316 — 0.218425x, where y is the count of 
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surviving bacteria and x is time in minutes. (d) Iny = 5.97316 — 

0.218425(17) = 2.26, so py = e°6 = 9.58 or 958 bacteria. 

12.43 (a) Exponential, because the scatterplot of log(height) versus 
bounce number is more linear. (b) log y = 0.45374 — 0.1171 6x, 
where y = height in feet and x = bounce number. 

(c) log y = 0.45374 — 0.11716(7) = — 0.36638, so 

y = 10793665 = 0.43 feet. (d) The trend in the residual plot sug- 
gests that the residual for x = 7 would be positive, meaning that the 
predicted height will be less than the actual height. 

12.45 (a) There is a strong, positive curved relationship between 
heart weight and length of left ventricle for mammals. 
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(b) Two scatterplots are given below. Because the relationship 
between In(weight) and In(length) is roughly linear, heart weight 
and length seem to follow a power model. 
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(c) Iny = —0.314 + 3.1387 In x, where y is the weight of the heart 
and x is the length of the cavity of the left ventricle. (d) Iny = 
—0.314 + 3.1387 In (6.8) = 5.703, so y =e? = 299.77 grams. 
12.47 c 

1249 ¢ 

12.51 (a) For Marcela, X = the length of her shower on a ran- 
domly selected day follows a Normal distribution with mean 4.5 
minutes and standard deviation 0.9 minutes. We want to find 


3-45 6-45 
P3B<X<6).z= 09 = — 1.67 and z= 09 = 1.67, so 


PB <X <6) = 0.9050. Using technology: 0.9044. There is a 
0.9044 probability that Marcela’s shower lasts between 3 and 6 min- 
— 4.5 
utes. (b) Solving — 0.67 = a 
QO; — 4.5 
0.9 
ogy: Q; = 3.893 minutes and Q; = 5.107 minutes. Thus, an outlier 
is any value above 5.107 + 1.5(5.107 — 3.893) = 6.928. Because 
7 > 6.928, a shower of 7 minutes would be considered an outlier 
for Marcela. (c) P(X > 7) = 0.0027. Let Y = the number of days 
that Marcela’s shower is 7 minutes or higher. Y is a binomial random 
variable with n = 10 and p = 0.0027. P(Y = 2)=1-— PY S)l)= 
1—binomcdf (trials: 10, p: 0.0027, x value: 1) 
= 0.0003. (d) x follows a N(4.5, 0.285) distribution and we want to 
_ 5-45 
find Px > 5).z= 0285 1.75 and P(Z > 1.75) =. Using tech- 
nology: 0.0397. There is a 0.0397 probability that the mean length 
of Marcela’s showers on these 10 days exceeds 5 minutes. 


gives Q) = 3.897 minutes. 


Solving 0.67 = gives Q; = 5.103 minutes. Using technol- 


8 10 12 14 16 18 =I 0 1 2 3 
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12.53 (a) S: p = true proportion of all AP® teachers attending this 
workshop who have tattoos. P: One-sample z interval for p. Random: 
Random sample. 10%: The sample size (n = 98) is less than 10% of 
the population of teachers at this workshop (1100). Large Counts: 
23 and 75 are both = 10. D: (0.151, 0.319). C: We are 95% confi- 
dent that the interval from 0.151 to 0.319 captures the true propor- 
tion of AP® teachers at this workshop who have tattoos. (b) Yes. 
Because the value 0.14 is not included in the interval, we have con- 
vincing evidence that the true proportion of teachers at the work- 
shop who have a tattoo is not 0.14. (c) If we had two more failures, 
the interval will shift to lower values and might include the value 
0.14. However, the new interval is (0.148, 0.312), which does not 
include the value 0.14. So the answer would not change if we got 
responses from the 2 nonresponders. 


Answers to Chapter 12 Review Exercises 


R12.1 (a) There is a moderately strong, positive linear relationship 
between the thickness and the velocity. (b) y = 70.44 + 274.78x, 
where y is the velocity and x is the thickness. (c) Residual = 
104.8 — 180.352 = — 75.552, so the line overpredicts the velocity 
by 75.552 ft/sec. (d) The linear model is appropriate. ‘The scatter- 
plot shows a linear relationship and the residual plot has no leftover 
patterns. (e) Slope: For each increase of an inch in thickness, the 
predicted velocity increases by 274.78 feet/second. s: When using 
the least-squares regression line with x = thickness to predict y = 
velocity, we will typically be off by about 56.36 feet per second. 7”: 
About 49.3% of the variation in velocity is accounted for by the 
linear relationship relating velocity to thickness. SEy: If we take 
many different random samples of 12 pistons and compute the 
least-squares regression line for each sample, the estimated slope 
will typically vary from the slope of the population regression line 
for predicting velocity from thickness by about 88.18. 

R12.2 S: Ho: = 0 versus H,:3 # 0, where (2 is the slope of the pop- 
ulation regression line relating thickness to velocity. P: t test for . 
Linear: The residual plot shows no leftover patterns. Independent: 
Knowing the velocity for one piston should not help us predict the 
velocity for another piston. Also, the sample size (n = 12) is less than 
10% of the pistons in the population. Normal: We are told that the 
Normal probability plot of the residuals is roughly linear. Equal SD: 
The residual plot shows roughly equal scatter for all x values. Random: 
The data come from a random sample. D: t = 3.116. With df= 10, 
the P-value is between 0.01 and 0.02 (0.0109). C: Because the P-value 
of 0.0109 is less than a = 0.05, we reject Ho. There is convincing 
evidence of a linear relationship between thickness and gate velocity 
in the population of pistons formed from this alloy of metal. 

R12.3 D: With df= 12 — 2 = 10, (78.315, 471.245). C: We are 
95% confident that the interval from 78.315 to 471.245 captures the 
slope of the population regression line for predicting velocity from 
thickness for the population of pistons formed from this alloy of 
metal. Because 0 is not in the interval, we reject 0 as a plausible 
value for the slope of the population regression line, as in R12.2. 
R12.4 The Linear condition is violated because there is clear cur- 
vature to the scatterplot and an obvious curved pattern in the resid- 
ual plot. The Random condition may not be met because we 
weren't told if the sample was selected at random. 

R12.5 (a) Yes, because there is no leftover pattern in the residual 


plot. (b) y = — 0.000595 + 03(4). Here, y = intensity and x = 
2 


P 1 
distance. (c) y = — 0.000595 + 03( 5) = 0.0674 candelas. 


R12.6 (a) 


-2.5 4 
5.0 5 


Residual 


Time 


(b) There is a leftover pattern in the residual plot, so the relation- 
ship between practice time and percent of words recalled is not 
linear. (c) Power, because the scatterplot showing In(recall) versus 
In(time) is more linear than the scatterplot showing In(recall) ver- 


sus time. (d) Power: Iny = 3.48 + 0.293 In (25) =4.423 and 


ee recalled. Exponential: 


y = et? = 83.35 percent of words 
jn y = 3.69 + 0.0304(25) = 4.45 and j =e = 85.63 percent of 
words recalled. Based on my answer to part (c), I think the power 


model will give a better prediction. 


Answers to Chapter 12 AP® Statistics Practice Test 


TI2.0 ic 

T12.2 b 

T12.3 d 

T12.4 a 

BIZ id 

T12.6 d 

Wize € 

T12.8 d 

T12.9 d 

T12.10 c 

T12.11 (a) » = 4.546 + 4.832x, where y is the weight gain and x is 
the dose of growth hormone. (b) (i) For each 1-mg increase in 
growth hormone, the predicted weight gain increases by about 
4.832 ounces. (ii) Ifa chicken is given no growth hormone (x = 0), 
the predicted weight gain is 4.546 ounces. (iii) When using the 
least-squares regression line with x = dose of growth hormone to 
predict y = weight gain, we will typically be off by about 3.135 
ounces. (iv) If we repeated this experiment many times, the sample 
slope will typically vary by about 1.0164 from the true slope of the 
least-squares regression line with y = weight gain and x = dose of 
growth hormone. (v) About 38.4% of the variation in weight gain is 
accounted for by the linear model relating weight gain to the dose 
of growth hormone. (c) S: Ho:3 = 0 versus H,:3 4 0, where (3 is 
the slope of the true regression line relating y = weight gain to x = 

dose of growth hormone. P: t test for 6. D: t = 4.75, df = 13, and 
P-value = 0.0004. C: Because the P-value of 0.0004 is less than 
a = 0.05, we reject Hp. There is convincing evidence of a linear 
relationship between the dose of growth hormone and weight gain 
for chickens like these. (d) D: With df = 13, (2.6373, 7.0273). C: 
We are 95% confident that the interval from 2.6373 to 7.0273 cap- 
tures the slope of the true regression line relating y = weight gain 
to x = dose of growth hormone for chickens like these. 

112.12 (a) There is clear curvature evident in both the scatterplot 
and the residual plot. (b) 1: p = 2.078 + 0.0042597(30)? = 117.09 
board feet. 2: Iny = 1.2319 + 0.113417(30) = 4.63441 and 
y = et 41 = 102.967 board feet. (c) The residual plot for Option 1 
is much more scattered, while the plot for Option 2 shows curva- 
ture, meaning that the model from Option 1 relating the amount of 
usable lumber to cube of the diameter is more appropriate. 


Answers to Cumulative AP® Practice Test 4 


AP4.1 
AP4.2 
AP4.3 
AP4.4 
AP4.5 
AP4.6 
AP4.7 
AP4.8 
AP4.9 e 
AP4.10 a 
AP4.11 b 
AP4.12 d 
AP4.13-¢ 
AP4.14 d 
AP4.15 b 
AP4.16 a 
AP4.17 d 
AP4.18 c 
AP4.19 a 
AP4.20 b 
AP4,21 ¢ 
AP4.22 e 
AP4.23 b 
AP4.24 c 
AP4.25 d 
€ 
c 
c 
b 
b 
d 
a 
e 
b 
a 
b 
d 
a 
d 
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AP4.26 

AP4.27 

AP4.28 

AP4.29 

AP4.30 

AP4.31 

AP4.32 

AP4.33 

AP4.34 

AP4.35 

AP4.36 

AP4.37 

AP4.38 

AP4.39 

AP4.40 d 

AP4.41 S: Ho:f) — 2 = 0 vs. Hy) — 2 #0, where ju; = true 
mean difference in electrical potential for diabetic mice and juz = 
true mean difference in electrical potential for normal mice. P: 
‘Two-sample t test for 4; — }12. Random: Independent random sam- 
ples. 10%: n, = 24 is less than 10% of all diabetic mice and nz = 18 
is less than 10% of all normal mice. Normal/Large Sample Size: No 
outliers or strong skewness. D: t = 2.55. Using df = 23, the P-value 
is between 0.01 and 0.02. Using df = 38.46, P-value = 0.0149. C: 
Because the P-value of 0.0149 is less than a = 0.05, we reject Ho. 
‘There is convincing evidence that the true mean difference in elec- 
tric potential for diabetic mice is different than for normal mice. 
AP4.42 (a) Ao:p1 = pr = 0 vs. Hap =p 0, where p= the 
true proportion of women like the ones in the study who were phys- 
ically active as teens that would suffer a cognitive decline and p2 = 
the true proportion of women like the ones in the study who were 
not physically active as teens that would suffer a cognitive decline. 
(b) A two-sample z test for p; — 2. (c) No. Because the participants 
were mostly white women from only four states, the findings may 
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not be generalizable to women in other racial and ethnic groups or 
who live in other states. (d) Two variables are confounded when 
their effects on the response variable cannot be distinguished from 
one another. For example, women who were physically active as 
teens might have also done other things differently as well, such as 
eating a healthier diet. We would be unable to determine if it was 
their physically active youth or their healthier diet that slowed their 
level of cognitive decline. 
AP4.43 (a) Because the first question called ita “fat tax,” people may 
have reacted negatively because they believe this is a tax on those 
who are overweight. ‘The second question provides extra information 
that gets people thinking about the obesity problem in the U.S. and 
the increased health care that could be provided as a benefit with the 
tax money. Better: “Would you support or oppose a tax on non-diet 
sugared soda?” (b) This method samples only people at fast-food res- 
taurants. They may go to these restaurants because they like the sug- 
ary drinks and wouldn’t want to pay a tax on their favorite beverages. 
‘Thus, it is likely that the proportion of those who would oppose such 
a tax will be overestimated with this method. Better: take a random 
sample of all New York State residents. (c) Use a stratified random 
sampling method in which each state is a stratum. 
AP4.44 (a) P(S) = (0.1)(0.3) + (0.4)(0.2) + (0.5)(0.1) = 0.16. 
(0.5)(0.1) 
Pye (0.1)(0.3) + (0.4)(0.2) + (0.5)(0.1) tala: 
AP4.45 (a) No. The scatterplot exhibits a strong curved pattern. (b) 
B, because the scatterplot shows a much more linear pattern and its 
residual plot shows no leftover patterns. 


a ~~ 
(c) In@weight) = 15.491 — 1.5222 In (3700) = 2.984, thus 


—™« 

weight = e7°5* = 19.77mg. (d) About 86.3% of the variation in 
In(seed weight) is accounted for by the linear model relating 
In(seed weight) to In(seed count). 

AP4.46 (a) Let X = diameter of a randomly selected lid. Because 
X follows a Normal distribution, the sampling distribution of x also 


0.02 
followsa Normal distribution. uz = 4inchesand og = ——= = 0.004 


V25 
0.004 inches. (b) We want to find P(x < 3.99 or x > 4.01) using the 
Sst cat es ae 401-4 _ 
N(4, 0.004) distribution. z = 0.004 2.50 andz= 0.004 > 


2.50. P(Z < —2.50 or Z > 2.50) = 0.0124. Assuming that the 


machine is working properly, there is a 0.0124 probability that the 
mean diameter of a sample of 25 lids is less than 3.99 inches or 
greater than 4.01 inches. (c) We want to find P44 <x < 4.01) 


using the N(4, 0.004) distribution. z= ae and 
4.01 —4 auld 
i 0 004. 2.50. P(O < Z < 2.50) = 0.4938. Assuming that 


the machine is working properly, there is a 0.4938 probability that 
the mean diameter of a sample of 25 lids is between 4.00 and 4.01 
inches. (d) Let Y= the number of samples (out of 5) in which 
the sample mean is between 4.00 and 4.01. The random variable 
Y has a binomial distribution with n=5 and p =0.4938. 
Using technology: P(X = 4)=1— P(X S$ 3)=1 — binomcdf£ 
(trials:5, p:0.4938, x value:3) =0.1798. (e) 
Because the probability found in part (b) is less than the probabil- 
ity found in part (d), getting a sample mean below 3.99 or above 
4.01 is more convincing evidence that the machine needs to be 
shut down. This event is much less likely to happen by chance 
when the machine is working correctly. (f) Answers will vary. 
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Appendix A 
About the AP® Exam 


Chapter 1 


¢ If you learn to distinguish categorical from quantitative variables 
now, it will pay big rewards later. You will be expected to analyze 
categorical and quantitative variables correctly on the AP® exam. 


¢ When comparing distributions of quantitative data, it’s not 
enough just to list values for the center and spread of each distribu- 
tion. You have to explicitly compare these values, using words like 
“greater than,” “less than,” or “about the same as.” 


¢ If you're asked to make a graph on a free-response question, be 
sure to label and scale your axes. Unless your calculator shows la- 
bels and scaling, don’t just transfer a calculator screen shot to your 
paper. 

* You may be asked to determine whether a quantitative data set 
has any outliers. Be prepared to state and use the rule for identify- 
ing outliers. 


* Use statistical terms carefully and correctly on the AP® exam. 
Don’t say “mean” if you really mean “median.” Range is a single 
number; so are Q), Q3, and JOR. Avoid colloquial use of language 
such as “the outlier skews the mean.” Skewed is a shape. If you 
misuse a term, expect to lose some credit. 


Chapter 2 

* Normal probability plots are not included on the AP® Statistics 
topic outline. However, these graphs are very useful for assessing 
Normality. You may use them on the AP® exam if you wish—just 
be sure that you know what you're looking for (a linear pattern). 


Chapter 3 


° If you are asked to make a scatterplot for a free-response question, 
be sure to label and scale both axes. Don’t just copy an unlabeled 
calculator graph directly onto your paper. 


* If you're asked to interpret a correlation, start by looking at a scat- 
terplot of the data. Then be sure to address direction, form, strength, 
and outliers (sound familiar?) and put your answer in context. 


¢ When displaying the equation of a least-squares regression line, 
the calculator will report the slope and intercept with much more 
precision than is needed. However, there is no firm rule for how 
many decimal places to show for answers on the AP® exam. Our 
advice: Decide how much to round based on the context of the 
problem you are working on. 


° Students often have a hard time interpreting the value of 7° on 
AP® exam questions. They frequently leave out key words in the 
definition. Our advice: ‘Treat this as a fill-in-the-blank exercise. 
Write “ % of the variation in [response variable name] 
is accounted for by the linear model relating [response variable 
name] to [explanatory variable name].” 


¢ The formula sheet for the AP® exam uses different notation for 


Sy 
these equations: b; =r * and by =y — byx. That’s because the 
Sx 
least-squares line is written as y = bo + byx. We prefer our simpler 
versions without the subscripts! 
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* If you're asked to describe how the design of a study leads to bias, 
you're expected to do two things: (1) identify a problem with the 
design, and (2) explain how this problem would lead to an under- 
estimate or overestimate. Suppose you were asked, “Explain how 
using your statistics class as a sample to estimate the proportion 
of all high school students who own a graphing calculator could 
result in bias.” You might respond, “This is a convenience sample. 
It would probably include a much higher proportion of students 
with a graphing calculator than would the population at large be- 
cause a graphing calculator is required for the statistics class. So 
this method would probably lead to an overestimate of the actual 
population proportion.” 


e If you are asked to identify a possible confounding variable in 
a given setting, you are expected to explain how the variable you 
choose (1) is associated with the explanatory variable and (2) affects 
the response variable. 


e If you are asked to describe the design of an experiment on the 
AP® exam, you won't get full credit for a diagram like Figure 4.5 
(page 246). You are expected to describe how the treatments are 
assigned to the experimental units and to clearly state what will be 
measured or compared. Some students prefer to start with a dia- 
gram and then add a few sentences. Others choose to skip the dia- 
gram and put their entire response in narrative form. 


¢ Don’t mix the language of experiments and the language of sam- 
ple surveys or other observational studies. You will lose credit for 
saying things like “Use a randomized block design to select the sam- 
ple for this survey” or “This experiment suffers from nonresponse 
since some subjects dropped out during the study.” 
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* On the AP® exam, you may be asked to describe how you will 
perform a simulation using rows of random digits. If so, provide 
a clear enough description of your simulation process for the 
reader to get the same results you did from only your written 
explanation. 


¢ Many probability problems involve simple computations that you 
can do on your calculator. It may be tempting to write down just 
your final answer without showing the supporting work. Don’t do it! 
A “naked answer,” even if it’s correct, will usually earn you no credit 
on a free-response question. 
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¢ You can write statements like P(B | A) if events A and B are de- 
fined clearly, or you can use a verbal equivalent, such as P(reads 
New York Times | reads USA Today). Use the approach that makes 
the most sense to you. 
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e If the mean of a random variable has a noninteger value but you 
report it as an integer, your answer will not get full credit. 


¢ When showing your work on a free-response question, you 
must include more than a calculator command. Writing 
normalcdf (68,70, 64, 2.7) will not earn you full credit for 
a Normal calculation. Ata minimum, you must indicate what each 
of those calculator inputs represents. Better yet, sketch and label a 
Normal curve to show what you're finding. 


¢ Don’t rely on “calculator speak” when showing your work on free- 
response questions. Writing binompdf (5, 0.25, 3) = 0.08789 
will not earn you full credit for a binomial probability calculation. 
At the very least, you must indicate what each calculator input rep- 
resents. For example, “I used binompdf (trials 5,p:0.25, 
x value:3).” 
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¢ ‘Terminology matters. Don’t say “sample distribution” when you 
mean sampling distribution. You will lose credit on free-response 
questions for misusing statistical terms. 


* Notation matters. The symbols p, x, p, [, 7, Li» Op» He, and 
oz all have specific and different meanings. Either use notation cor- 
rectly—or don’t use it at all. You can expect to lose credit if you use 
incorrect notation. 
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¢ On a given problem, you may be asked to interpret the confi- 
dence interval, the confidence level, or both. Be sure you under- 
stand the difference: the confidence interval gives a set of plausible 
values for the parameter and the confidence level describes the 
long-run capture rate of the method. 


¢ If a free-response question asks you to construct and interpret a 
confidence interval, you are expected to do the entire four-step pro- 
cess. That includes clearly defining the parameter, identifying the 
procedure, and checking conditions. 


¢ You may use your calculator to compute a confidence interval on 
the AP® exam. But there’s a risk involved. If you give just the calcu- 
lator answer with no work, you'll get either full credit for the “Do” 
step (if the interval is correct) or no credit (if it’s wrong). We recom- 
mend showing the calculation with the appropriate formula and 
then checking with your calculator. If you opt for the calculator- 
only method, be sure to name the procedure (e.g., one-proportion z 
interval) and to give the interval (e.g., 0.514 to 0.607). 

° If a question of the AP® exam asks you to calculate a confidence 
interval, all the conditions should be met. However, you are still re- 
quired to state the conditions and show evidence that they are met. 
¢ It is not enough just to make a graph of the data on your calcula- 
tor when assessing Normality. You must sketch the graph on your 


paper to receive credit. You don’t have to draw multiple graphs — 
any appropriate graph will do. 
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¢ The conclusion to a significance test should always include three 
components: (1) an explicit comparison of the P-value to a stated 
significance level, (2) a decision about the null hypothesis: reject or 
fail to reject Hp, and (3) a statement in the context of the problem 
about whether or not there is convincing evidence for H,. 


¢ When a significance test leads to a fail to reject Hg decision, be 
sure to interpret the results as “We don’t have enough evidence to 
conclude H,.” Saying anything that sounds like you believe Hp is 
(or might be) true will lead to a loss of credit. And don’t write text- 
message-type responses, like “FTR the Ho.” 


¢ You can use your calculator to carry out the mechanics of a sig- 
nificance test on the AP® exam. But there’s a risk involved. If you 
give just the calculator answer with no work, and one or more of 
your values are incorrect, you will probably get no credit for the 
“Do” step. We recommend doing the calculation with the appro- 
priate formula and then checking with your calculator. If you opt 
for the calculator-only method, be sure to name the procedure 
(one-proportion z test) and to report the test statistic (¢ = 1.15) and 
P-value (0.1243). 


¢ It is not enough just to make a graph of the data on your calcula- 
tor when assessing Normality. You must sketch the graph on your 
paper to receive credit. You don’t have to draw multiple graphs — 
any appropriate graph will do. 

¢ Remember: If you give just calculator results with no work and 
one or more values are wrong, you probably won’t get any credit for 
the “Do” step. If you opt for the calculator-only method, name the 
procedure (¢ test) and report the test statistic (t = —0.94), degrees 
of freedom (df = 14), and P-value (0.1809). 


Chapter 10 


¢ The formula for the two-sample z interval for p; — pz often leads 
to calculation errors by students. As a result, we recommend using 
the calculator’s 2-PropZInt feature to compute the confidence 
interval on the AP® exam. Be sure to name the procedure (two- 
proportion z interval) and to give the interval (0.076, 0.143) as part 
of the “Do” step. 


¢ The formula for the two-sample z statistic for a test about p; — p2 
often leads to calculation errors by students. As a result, we recom- 
mend using the calculator’s 2-PropZTest feature to perform 
calculations on the AP® exam. Be sure to name the procedure (two- 
proportion z test) and to report the test statistic (¢ = 1.17) and P- 
value (0.2427) as part of the “Do” step. 


¢ The formula for the two-sample t interval for ju; — p42 often leads 
to calculation errors by students. As a result, we recommend using 
the calculator’s 2-SampTInt feature to compute the confidence 
interval on the AP® exam. Be sure to name the procedure (two- 
sample t interval) and to give the interval (3.9362, 17.724) and df 
(55.728) as part of the “Do” step. 


¢ When checking the Normal condition on an AP® exam question 
involving inference about means, be sure to include graphs. Don’t 
expect to receive credit for describing a graph that you made on 
your calculator but didn’t put on paper. 


¢ The formula for the two-sample ¢ statistic for uy — 2 often leads 
to calculation errors by students. As a result, we recommend us- 
ing the calculator’s 2-SampTTest feature to perform calculations 
on the AP® exam. Be sure to name the procedure (two-sample t 
test) and to report the test statistic (t = 1.60), P-value (0.0644), and 
df (15.59) as part of the “Do” step. 
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¢ You can use your calculator to carry out the mechanics of a signif- 
icance test on the AP® exam. But there’s a risk involved. If you give 
just the calculator answer with no work, and one or more of your 
values is incorrect, you will probably get no credit for the “Do” step. 
We recommend writing out the first few terms of the chi-square 
calculation followed by “. . .”. This approach might help you earn 
partial credit if you enter a number incorrectly. Be sure to name the 
procedure (v7 GOF-Test) and to report the test statistic (y? = 11.2), 
degrees of freedom (df = 3), and P-value (0.011). 


¢ In the “Do” step, you aren’t required to show every term in the 
chi-square statistic. Writing the first few terms of the sum followed 
by “.. .” is considered as “showing work.” We suggest that you do 
this and then let your calculator tackle the computations. 


¢ You can use your calculator to carry out the mechanics of a signif 
icance test on the AP® exam. But there’s a risk involved. If you give 
just the calculator answer with no work and one or more of your 
values is incorrect, you will probably get no credit for the “Do” step. 
We recommend writing out the first few terms of the chi-square 
calculation followed by “. . .”. This approach might help you earn 
partial credit if you enter a number incorrectly. Be sure to name the 
procedure (x7-Test for homogeneity) and to report the test statistic 


(x? = 18.279), degrees of freedom (df = 4), and P-value (0.0011). 


* If you have trouble distinguishing the two types of chi-square tests 
for two-way tables, you’re better off just saying “chi-square test” than 
choosing the wrong type. Better yet, learn to tell the difference! 
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¢ The AP® exam formula sheet gives y = by + b,x for the equation 
of the sample (estimated) regression line. We will stick with our 
simpler notation, y = a + bx, which is also used by TT calculators. 
Just remember: The coefficient of x is always the slope, no matter 
what symbol is used. 


¢ The AP® exam formula sheet gives the formula for the standard 
error of the slope as 


Di — Hi) 
n-2 


5b, = See _ x 


‘The numerator is just a fancy way of writing the standard deviation 
of the residuals s. Can you show that the denominator of this for- 
mula is the same as ours? 

¢ The formula for the ¢ interval for the slope of a population (true) 
regression line often leads to calculation errors by students. As a 
result, we recommend using the calculator’s LinRegTInt feature 
to compute the confidence interval on the AP® exam. Be sure to 
name the procedure (t interval for slope) and to give the interval 


(—0.217, —0.108) and df (14) as part of the “Do” step. 


¢ When you see a list of data values on an exam question, don’t just 
start typing the data into your calculator. Read the question first. 
Often, additional information is provided that makes it unnecessary 
for you to enter the data at all. This can save you valuable time on 
the AP® exam. 


¢ The formula for the test statistic in a t test for the slope of a 
population (true) regression line often leads to calculation errors 
by students. As a result, we recommend using the calculator’s 
LinRegTTest feature to perform calculations on the AP® exam. 
Be sure to name the procedure (¢ test for slope) and to report the 
test statistic (t = 3.065), P-value (0.002), and df (36) as part of the 
“Do” step. 
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Texas Instruments released the new ‘Tl-Nspire CX in March 
2011. The new handheld no longer has an interchangeable 'T1-84 
faceplate; however, the body of the Nspire CX is much slimmer 
and its display is in full color. When you click (ctr ) and ar- 
row down to Color, there are several options available: Line Color, 
Fill Color, and Text Color. If you choose the Fill Color option, a 
color palette will appear. Then you can select the colors you want 
for your graphs. This feature is quite useful when displaying mul- 
tiple graphs. When creating a bar graph, the CX even allows you to 
change the color of each bar! 
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The keystrokes used for the new CX are the same as for the T'l- 
Nspire ‘Touchpad. The keystrokes for the older Nspire “clickpad” 
are still different in some ways; therefore they are still shown in 
parentheses when needed. 

Start by updating your device’s OS to ensure that your hand- 
held has full capabilities. Go to 1 and search under 
Downloads — Software, Apps, Operating Systems... to download 
the latest version of the OS. If you have the Tl-Nspire computer 
link software, you should be asked automatically to update your 


handheld’s OS. 
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Chapter 1 TI-Nspire Technology Corners You can now position the pointer (&) over each bar to examine 


2. Histograms on the calculator the classes. ‘The pointer will become an open hand ‘Sy and the 
class size will be displayed along with the number of data values 
1. Insert a New Document by pressing (ctrl) (n ). 


in the class. 
2. Insert a Lists G Spreadsheet page by arrowing down to Add Lists 


& Spreadsheet. 5. Adjust the classes to match those in Figure 1.16 (page 34). 

¢ Name column A foreignbrn. ¢ Arrow into an empty space inside the histogram. 

e ‘Type the data for the percent of state residents born outside the ° Press and select Bin Settings, Equal Bin Width. 
United States into the list. The data can be found on page 33. Enter the values shown. (tab ) to [ox] and press (enter). The 


new histogram should be displayed. 


‘Be 
Bi rein. 


2,16 20 24 28 
3. Insert a Data G Statistics page: press (et) (1 }, arrow to Add La deal 
Data & Statistics, and press (enter). 
e Press and Click to Add Variable on the horizontal axis Notice how the first bar is “off the page.” To adjust this, arrow 


will show the variables available. Select foreignbrn. over until the k becomes $. 


Frequency 


12 16 20 24 2 
foreignbrn 


¢ ‘The data should now move into a dotplot. Notice the organi- 
zation of the graph. Even though the data look “lopsided” in © Press and hold [©] until becomes Sy. 
some places, you should consider the dots as being directly 
above each other in each column. 


e Use the Navpad and, with the down arrow, “pull” the vertical 
axis down. Keep arrowing down until the top of the tallest 
histogram class is visible. 


: ; : 8 42.16 20 24 2 
4. ‘To make a better graphical display, let’s move the data into a foreignbrn 


histogram. Use the Navpad to position the pointer in an empty 
space within the graph. Press (tn ) and select Histogram. 


You will now see the data move into a histogram. 


6. See if you can match the histogram in Figure 1.17 (page 35). 


3. Making calculator boxplots 


One of the added benefits of the T'l-Nspire is its ability to plot 
more than three boxplots at a time in the viewing window. Let’s 
use the calculator to make parallel boxplots of the travel data for 
the samples from North Carolina and New York. 

1. Insert a Lists @ Spreadsheet page: press (et )(1) , arrow to Add 
ey ae Lists G Spreadsheet, and press (enter _). 


12 ‘ 
foreignbrn ¢ Name column A nearolina and column B newyork. 


[8.00, 10.00) 4 points 
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e Enter the travel time data from page 52. 


2. Insert a Data & Statistics page: press (ett) (1 }, arrow to Add 

Data & Statistics, and press (enter_}. 

Press and Click to Add Variable on the horizontal axis 
will show the variables available. Select ncarolina. 

¢ The data will move into a dotplot. Use the Navpad to position 
the pointer in an empty space within the graph. Press 
and select Box Plot. You will now see the data move 
into a boxplot. 

e Using the Navpad, arrow over the plot. You will see the values 
in the five-number summary display one by one as you move 
across the boxplot. 


0 5 10 15 20 25 30 35 40 45 SO 55 606 
ncarolina i 


3. To add the boxplot of newyork travel times, arrow over “ncaroli- 


na” on the horizontal axis, press (menu), and choose Add X 


Variable. Select newyork and the second boxplot will be added 
to the page. 


ncarolina 


newyork 


10 


20 30 40 50 60 70 80 90 


@Ncarolina @newyork 


4. Computing numerical summaries with technology 


Let’s find numerical summaries for the travel times of North Caro- 
lina and New York workers from the previous ‘Technology Corner. 
If you haven't already done so, enter the North Carolina and New 
York data. 

* Insert a Lists © Spreadsheet page: press (et) (1 |, arrow to 
Add Lists & Spreadsheet, and press (enter). Name column A 
nearolina and column B newyork. 

e Arrow down to the first empty cell in column A and type in 
the data. Repeat the process for newyork in column B. 

1. The Nspire can calculate one-variable statistics for several lists at 
the same time (unlike the T1-84 or TI-89). 
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e Press — Statistics — Stat Calculations — One- 
Variable Statistics. A dialogue box should appear asking for 
the number of lists. Press the up arrow (4) to 2 or type “2.” 


tab) to [0K] and press (enter), 


¢ Another dialogue box should appear. Select the lists in the 
drop-down boxes: X1 list: nearolina and X2 list: newyork. 
between the entry boxes to enter the next list and the 


“co” 


column where you want the one-variable stats listed: type “c’”, 


(tab }to [ok], and press (enter). The numerical summaries for 


both states should now be displayed. 


2. You can resize the columns to see which column contains val- 
ues for which state: D has summary statistics for nearolina, and 
E has summary statistics for newyork. 
e Use the Navpad to place the arrow between the columns. Press 
and hold [%] until 44 appears. Use the arrow keys to increase 
the column width. Press [77] again to release the column. 


¢ Repeat the same process to resize the column with one-vari- 
able statistics for newyork. 
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5. From z-scores to areas, and vice versa 


Finding areas: The normedf command on the Nspire can be used to 
find areas under the Normal curve. ‘The syntax is normalcdf (lower 
bound, upper bound, mean, standard deviation). Let’s use this com- 
mand to confirm our answers to the examples on pages 116-118. 

1. On the Home screen, select the Calculate scratchpad. This will 
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take you to a calculator page that is “outside” your current docu- 
ment. Therefore, you do not have to worry about losing/saving 
a document you are working on or about adding an unneeded 
page to that document. 


What proportion of observations from the standard Normal 

distribution are greater than —1.78? 

Recall that the standard Normal distribution has mean 0 and 
standard deviation 1. 

2. On the Calculate scratchpad, press — Statistics > 
Distributions — Normal Cdf. In the dialogue box that appears, 
type the numbers shown. To move between the drop-down box- 
es, press after typing each number. When the last number 


is entered, to [ox] and press (enter_). 


Lower Bound: | -1 78 } 
Upper Bound: | 10000 } 


The proportion should now be displayed on the main screen. 
Note: We chose 10,000 as the upper bound because it is many 
standard deviations above the mean. These results agree with our 
previous answer using ‘Table A: 0.9625. 


0.962462 5 


normCadf{-1.78,10000,0,1) 
| 


What proportion of observations from the standard Normal 
distribution are between —1.25 and 0.81? 


The following screen shot confirms our earlier result of 0.6854 
using Table A. 


normCadf{-1.25,0.81,0,1) 0.68538 = 


Working backward: The Nspire invNorm function calculates the 
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value corresponding to a given percentile in a Normal distribution. 

For this command, the syntax is inv Norm(area to the left, 1, 0). 

3. Let’s start with a “clean slate” by clearing the entries on our 
page. To do this, press — Actions — Clear History. 


Your scratchpad should now be blank. 
What is the 90th percentile of the standard Normal distribution? 


e Press — Statistics > Distributions > Inverse Normal. 
A dialogue box will appear. Type the numbers in the dialogue 
box as shown. To enter the numbers, between the entry 
boxes. When the last number is entered, ( tab_) to [0K] and press 


invNorm(0.9,0, 1) 
| 


6. Normal probability plots 


We will use the state unemployment rates data from page 122 to 
demonstrate how to make a Normal probability plot for a set of 
quantitative data. 

1. Insert a New Document by pressing (ett ) (J. 

2. Insert a Lists @ Spreadsheet page by arrowing down to Add Lists 
@ Spreadsheet. 
¢ Name column A unemploy. 

e Arrow down to the first cell and type in the 50 data values. 

3. Insert a Data G Statistics page by pressing (etn ) 0) and use the 
Navpad to arrow to Add Data © Statistics. Press (enter ). 

4. Press to select the “Click to add variable” for the horizontal 
axis. Arrow to unemploy and press to select it. The data 
will now move into a dotplot. 

5. Arrow up into an empty region of the dotplot and press (etn ) 
(menu ). Select Normal Probability Plot and press [%). 


ob plata 


‘Expected 
o 
Oo 


4567 8 910111213 1415 
eb aed 


Interpretation: ‘The Normal probability plot is quite linear, so it is 
reasonable to believe that the data follow a Normal distribution. 
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Chapter 3 TI-Nspire Technology Corners ees 70,583 129,484 29,932 29,953 24,495 75,678 8359 4447 
7. Scatterplots on the calculator oe 
b: Tnsetha New Docu by pressing (at) (). (ini dela 21,994 9500 29,875 41,995 41,995 28,986 31,891 37,991 
2. Insert a Lists G Spreadsheet page by arrowing down to Add Lists Miles 34,077 58,023 44,447 68,474 144,162 140,776 29,397 131,385 
& Spreadsheet. driven 
¢ Name column A points and column B wins. nee 34,995 29,988 22,896 33,961 16,883 20,897 27,495 13,997 
e ‘Type the corresponding values into each column. The data Coo 
list follows. 1. Insert a New Document by pressing (ett) [ N }. 
oo 2. Insert a Lists @ Spreadsheet page by arrowing down to Add Lists 
Points: 34.8 36.8 25.7 25.5 320 15.8 © Spreadsheet. 
Tine: 12 ll 8 7 10 5 e¢ Name column A miles and column B price. 
e ‘Type the corresponding values into each column. 
Points: 35.7 16.1 25.3 30.1 = 20.3. 26.7 3. Graph the data in a scatterplot putting miles on the horizon- 
Wins: B 3 7 i 5 6 tal axis and price on the vertical axis. Refer to the previous 
—— SSS SSS 'TL-Nspire ‘Technology Corner. 
4. To add a least-squares regression line, first (etn) 4 back to the 
Lists @ Spreadsheet page. 
5. Press (menu), and arrow to Statistics > Stat Calculations, Lin- 
ear Regression (a + bx), (enter ). You should then see a dialogue 
box. In the drop-down boxes, arrow down to miles for the X 
List:, then press and arrow down to price for the Y List:. 
to [ox] and press (enter). 
fies __ 
vist 
3. Press (ctr) (J and use the Navpad to arrow to Add Data G ==: 


Statistics. Press (enter ), frequency List 


4. Press to select the “Click to add variable” for the horizontal Sadie ae | | 
: . Include Categories |v 
axis. Arrow to points and press to select it. 
[ox] {cance 
el | 
Caption: points The linear regression information, a, b, 7’, r, and resid will be 
@155 @20! j displayed in another column within the Lists G Spreadsheet page. 
21994 Title Linear Re. 
9500 RegEqn atb*x 
29875 a 38257.1 
41995 b 0.162919 
5. Press again and the box will move to the vertical axis. Select 41995 1? 0.664248 
wins. ‘The data will now move into a scatterplot. T'l-Nspire la- rp [LinRagialiniien price 1 Gopyvel <1 >| 
bels the x and y axes with the list names, making a well-labeled 
graph to insert into documents. 6. Press (ci )> to return to the Data G Statistics page. Press (menu ). 


8. Least-squares regression lines on the calculator 


arrow to Analyze — Regression — Show Linear (a + bx), and 
press (enter). The least-squares regression line along with the 
equation will appear. If you arrow over the equation, the = 
will appear. Click and hold [%]. When the hand closes, 4, you 


can move the equation using the arrow keys. 


price 


+-0.162919'x 


Let’s use the Ford F-150 data to show how to find the equation 
of the least-squares regression line on the 'T'l-Nspire. Here are the 
data. 
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7. Save the document for later use. Press (ete ) (s}. Name your 
document Truck prices. 


9. Residual plots on the calculator 


Let’s continue the analysis of the Ford F-150 miles driven and 

price data from the previous Technology Corner. You should have 

already made a scatterplot, calculated the equation of the least- 

squares regression line, and graphed the line on your plot. Now 

we want to make a residual plot. 

1. Open the document Truck prices. Press (ttt ) (J, arrow through 
My Documents — Truck prices and press (enter ). 

2. Press (ett_)> to go to the Data & Statistics page. 

3. Press (menu ); arrow to Analyze — Residuals — Show Residual 
Plot. This will split the screen and the residual plot will be dis- 
played below the graph of the least-squares regression line. 
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10. Choosing an SRS 


The TI-Nspire has a function called randSamp that will randomly 
select individuals for a sample with or without replacement from 
a population. 
1. Check that your calculator’s random number generator is 
working properly. 
¢ Open the Calculator Scratchpad by pressing (or (a) 
(@) on the keypad). 
¢ Type randint(1,1750) and press (enter _). 
¢ Compare results with your classmates. If several students have 
the same number, you'll need to seed your calculator’s random 
number generator with different numbers before you proceed. 
‘Type randSeed, press [44 ] (to insert a space), type < last four 
digits of your phone number> and press (enter). Done should 
appear. Now your calculator is ready to generate numbers that 
are different from those of your classmates. 


randInt{1,1750) 
RandSeed 4684 


2. Insert a Lists G Spreadsheet page. If you already have a docu- 
ment open, press (ete! } (1) and select Add Lists @ Spreadsheet. 
If you do not have a document already open, press ( 
on the clickpad), then (4). Press and select Add Lists G 
Spreadsheet. 


3. Name column A students. Arrow down to the formula cell and 


press (enter ). students:= should appear. Type seq(x,x,1,1750) 
and (enter ). ‘This will put the digits 1 through 1750 in this list. 


. Name column B sampstudents. Arrow down to the formula 


cell and press (enter ). sampstudents:= should appear. Type 
randSamp(students,10,1) and press (enter). This function will 
take a random sample of 10 students from the list. “1” lets the 
function know to do the sampling without replacement. 


[g=sea0ex frrendsamect] 


562 
412 


1527 
534 
1529 


Note: Sampling with replacement is the default setting for this 
function. You can use 0 as the third input in the randSamp com- 
mand or close the parentheses after the second input. 


Chapter 6 TI-Nspire Technology Corners 
11. Analyzing random variables on the calculator 


Let’s explore what the calculator can do using the random vari- 
able X = Apgar score of a randomly selected newborn from the 
example on page 349. 
1. Insert a Lists G Spreadsheet page. Press (et) (1), arrow to Add 
Lists G Spreadsheet, and press (enter _). 
¢ Name column A apgar and column B apgrprob. 
e Enter the values of the random variable (0 — 10) in the apgar 
list and the corresponding probabilities in apgrprob. 


2. Graph a histogram of the probability distribution. 


¢ Insert a Data © Statistics page. Press (etn ) (1), arrow to Add 
Data © Statistics, and press (st) 

© Press (ctrl ) and select Add X Variable with Summary 
List. Press and a dialogue box should appear. apgar 
should be in the X List and apgrprob should be in the 
Summary List. If they are not, use the drop-down boxes to 
select your variables. When your box looks like the one here, 


to [ok] and press (enter _). 


za 


[age _ T) 


apgiprob 
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The probability histogram should now be displayed. 


apgrprob 


3. To calculate the mean and standard deviation of the random 
variable, use one-variable statistics with apgar as the Data List 
and apgrprob as the Frequency List. 
© Press (ett) ¢ to go back to the Lists @ Spreadsheet page. 
© Press — Statistics — Stat Calculations > One- 
Variable Statistics. 

¢ Make sure your X1 List, Frequency List, and I‘ Result Column 
have the variables/values shown (you can press the down ar- 
row in the drop-down boxes to access the variable names and 


type C for 1* Result Column). (tab_) to [0K] and press (enter _). 


Frequency List | ‘apgrprob ] 


Category List 


Include Categones: ] 


1st Result Cotummn: | c 


fox}fconce 


e The statistics should now be displayed in your Lists @ Spread- 
sheet page. 


0.001 Title One-Var... 
0.006 & 8.128 


0.007 =x 8.128 
0.008 =x? 68.13 
0.012 sx := Sn-... HUNDEF 


12. Binomial coefficients on the calculator 


3) on the 'T'l-Nspire, 


proceed as follows. Open the Calculator Scratchpad by pressing 
(or (at) (a) on the clickpad). Press — Probability 
> Combinations, and then (enter), nCr( will appear. Complete 
the command nCr(5,2) and press (enter ). 


8 LW |Scrmtctood > Bae] 
ncr{5,2) h 
| 


To calculate a binomial coefficient like 


rk 
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13. Binomial probability on the calculator 


There are two handy commands on the ‘TI-Nspire for finding 
binomial probabilities: 


binomPdf£ (n,p,k) computes P(X = k) 
binomCdf£ (n,p,k) computes P(X = k) 


You will need to open the Calculator Scratchpad (press or 
@) on the clickpad). ‘These two commands can be found 
in the Distributions menu within the Statistics menu. You can 
access them by pressing (menu) —> Statistics > Distributions. A 
dialogue box will appear. Input n (the number of observations), p 
(probability of success), and k (number of successes). 


Ll — 


Prob Success. p: 
a 


For the parents having n = 5 children, each with probability 
p = 0.25 of type O blood: 


P(X = 3) = binomPd£ (5,0.25,3) = 0.08789 
To find P(X > 3), we used the complement rule: 


P(X > 3) =1- P(X S 3) = 1 — binomCd£ (5,0.25,3) 
= 0.01563 


Of course, we could also have done this as 


P(X > 3) = PX = 4) + P(X =5) 


= binomPdf (5,0.25,4) + 
binomPdf (5,0.25,5) 


= 0.01465 + 0.00098 = 0.01563 


On the TL-Nspire, you can also calculate using 
P(X > 3) = P(X = 4) + P(X =5) 
= binomCdf (5,0.25,4,5) 
= 0.01563 


0.087891 | 
0.015625 
0.015625 


binomPdf{5,0. 25,3} 
1-binomCdf(5,0.25,0,3) 
binomCadf(S,0.25,45) 


14. Geometric probability on the calculator 


There are two handy commands on the ‘TI-Nspire for finding 
geometric probabilities: 


geomPdf (p,k) computes P(Y = k) 


geomCdf (p,k) computes P(Y = k) 
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You will need to open the Calculator Scratchpad (press or 
(a). These two commands can be found in the Distributions 
menu within the Statistics menu. You can access them by pressing 


— Statistics — Distributions. A dialogue box will appear. 


Input p (probability of success) and k (number of trials to get the 
first success). 


For the Lucky Day Game, with probability of success p = 1/7 on 
each trial, 


P(Y = 10) = geomPd£ (1/7,10) = 0.0357 
To find P(Y < 10), use geomcdfé : 
PY < 10) = P(Y S 9) = geomCd£ (1/7,9) = 0.7503 


1 0.035676 = 
geom Pd. 1 10) 
\7 } 


1 
geomcad +9} 
\7 J 


0.750265 
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15. Confidence interval for a population proportion 


The T'l-Nspire can be used to construct a confidence interval for 
an unknown population proportion. We'll demonstrate using the 
example on page 500. Of n = 439 teens surveyed, x = 246 said 
they thought young people should wait to have sex until after 
mattiage. 
To construct a confidence interval: 
© Press (Ca) (a)) to insert a Calculator Scratchpad. 
© Press — Statistics —> Confidence Intervals —> 
1-Prop z Interval. 
¢ A dialogue box will appear: Enter the values as shown below. 


(tab_) to [ok] and press (enter _). 


Successes, x 


The lower and upper bounds of the confidence interval are 
reported, along with the sample proportion p, the margin of error 
(ME), and the sample size. 
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8 WW | Scratchpad > as 


zInterval_1Prop 246,439,0.95: stat results a 
‘Title’  "1—-Prop z Interval” 
"CLower" 0.513935 
"“CUpper" 0.606794 
"p" 0.560364 
"ME" 0.04643 
“n” 439 
v 
1799 


16. Inverse ton the calculator 


The TI-Nspire allows you to find critical values t* using the 
inverse t command. As with the calculator’s inverse Normal com- 
mand, you have to enter the area to the left of the desired critical 
value. Let’s use the inverse t command to find the critical values 
for parts (a) and (b) in the example on page 513. 

© Press (or (a) (A)) to insert a Calculator Scratchpad. 

e Press — Statistics — Distributions — Inverse t. 

e A dialogue box will appear. For part (a), enter .025 for the 

Area and 11 for the Deg of Freedom, df. (tab_) to [ox] and press 


Area: [0.025 


Deg of Freedom, df} 11 


[ox] |cancea} 


For part (b), enter .05 for the Area and 47 for the Deg of Freedom, 
df. to [0k] and press (enter_)- 


a + 
DegorFreedom ot [47 __—~i 


[ox}{cancel 


¢ The critical values t* should now be displayed. 


invt(0.025,11) 
invt(0.05,47) 
| 


-2.20099 Fj 


“1.67793 | | 


17. One-sample fintervals for ,. on the calculator 


Confidence intervals for a population mean using t procedures 
can be constructed on the 'T'l-Nspire, thus avoiding the use of 
‘Table B. Here is a brief summary of the techniques when you 
have only numerical summaries and when you have the actual 
data values. 
1. Using summary statistics: Auto pollution example, page 519 

* Insert a Lists @ Spreadsheet page: Press (etn) (1 ] and select 

Add Lists @ Spreadsheet. 
© Press — Statistics —> Confidence Intervals > t 


interval. 
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e The first dialogue box that appears asks for Data or Stats 
in the drop-down box. Select Stats, (tab ) to [ox], and press 


¢ In the next dialogue box, enter the values shown. 


xz 


© (tab ) to [ok] and press (enter ). 


e The results should now appear in the spreadsheet. 


=tinterval( 
Title 
CLower 1.16094 
CUpper 1.37406 
R 1.2675 
ME 0.106563 


Yel = 
jer]="tinterva” 


2. Using raw data: Video screen tension example, page 520 
Enter the 20 video screen tension readings data using the fol- 
lowing procedure. 
* Insert a Lists @ Spreadsheet page: Press (ett ) (1) and select 
Add Lists @ Spreadsheet. 
¢ Name the first column screen. 
e Arrow down to the first cell and enter the 20 values. 


To construct the ¢ interval: 

Press — Statistics —> Confidence Intervals > t 
interval. 

e The first dialogue box that appears asks for Data or Stats in the 
drop-down box. Select Data, (tab ) to [ox], and press (enter_). 

¢ Inthe next dialogue box, select the data list, screen, (tab ) to [0K |, 


and press (enter _). 


ust 
Frequencytist (1 


C Let 
1st Result Column: | ¢[] } 


| Cancel| 


¢ The results should now appear in the spreadsheet. (You may 
have to scroll up to see them.) 


A-13 


= screen | | 
=tinterval(’ 
289.374 
323.266 
306.32 
16.9465 


269.5 Title 
297. CLower 


269.6 CUpper 
283.3 x 
304.8 ME 
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18. One-proportion z test on the calculator 


The TI-Nspire can be used to test a claim about a population 
proportion. We’ll demonstrate using the example on page 559. 
In a random sample of size n = 500, the supervisor found x = 47 
potatoes with blemishes. ‘To perform a significance test: 
e Press (8) (or (or (ai) (a)) to insert a Calculator Scratchpad. 
© Press (menu) —> Statistics > Stat Tests —> 1-Prop < test. 
¢ A dialogue box will appear. Enter the values shown: po = 0.08, 
x = 47, and n= 500. Specify the alternative hypothesis as 
“H,: prop > po.” to [ox] and press (enter). 
Note: x is the number of successes and n is the number of trials. 
Both must be whole numbers! 


Po 
Successes, x: | 47 


Alternate Hyp: | Ha: prop > po 


Cancel| 


You can see that the test statistic is z = 1.15392 and the P-value is 
0.1243. 


zTest_1Prop 0.08,47,500, 1: stat. results 
"Tite" "1-Prop z Test" 
“Alternate Hyp" "prop > po" 
Lg 1.15392 
"PVal" 0.124267 
"p* 0.094 
a" $00 


‘To display the P-value as a shaded area under the Normal curve: 
e Press and select the Lists @ Spreadsheet icon | 
© Press (menu) — Statistics > Stat Tests > L-Prop z test. 

e A dialogue box will appear: Enter the values shown below. 


Check the box to Shade P Value. sto Shade P Value. (a Jo ) to [ox] and press (enter ). 
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PVal 40.1243 


19. Computing P-values from ¢ distributions on the 
calculator 


You can use the tedf command on the T'T-Nspire to calculate areas 
under a ¢ distribution curve. ‘The syntax is tedf(lower bound,upper 
bound,df). To use this command: 


¢ Press (or (at) @)) to insert a Calculator Scratchpad. 

© Press — Statistics — Distributions > t Cdf. 

e In the dialogue box that appears, enter your lower and upper 
bound and degrees of freedom. 


Lower Bound. 
Upper Bound 
Deg of Freedom. df. 


Cancel 


Use the t Cdf command to compute the P-values from the ex- 
amples on pages 577 and 578. 
© Better batteries: To find P(t = 1.54), use Lower Bound: 1.54, 
Upper Bound: 10000, and df14. 
¢ Two-sided test: To find the P-value for the two-sided test with 
df= 36 and t= — 3.17, execute the command 2 - tCdf 
(— 10000, —3.17,36). 


0.072927 4 
0.003108 


tCdf{ 1.54, 10000, 14) 
2: tcarl-10000,-3.17,36) 
| 


20. One-sample ftest for a mean on the calculator 


You can perform a one-sample t test using either raw data or 
summary statistics on the Tl-Nspire. Let’s use the calculator to 
carry out the test of Ho: . = 5 versus H,: 4p < 5 from the dissolved 
oxygen example on page 580. 

Start by entering the sample data into a column in a Lists G 
Spreadsheet page. Name the column oxygen. Then, to do the test: 

e Press — Statistics — Stats Tests — t Test. 

e The first dialogue box that appears asks for Data or Stats in 
the drop-down box. Make sure Data is selected. (tab ) to [0k 
and press (enter_). 

¢ In the next dialogue box, enter the values shown in the follow- 
ing box. To just “calculate,” leave the Shade PValue option 


unchecked. Then (tab_) to [0k] and press (enter ). 
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=tTest(5,'o 
4.53 Title 
5.04 Alternate... 1 < pO 
3,.29t 0.942556 
5.23 PVal 0.180945 
4.13 df 


The test statistic is t= — 0.94 and the P-value is 0.1809. 
If you check Shade P Value, you see a t-distribution curve 
(df = 14) with the lower tail shaded. 


If you are given summary statistics instead of the original data, you 
would select the “Stats” option in the drop-down box. 
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21. Confidence interval for a difference in proportions 


The TL-Nspire can be used to construct a confidence interval for 
pi — p2. We'll demonstrate using the example on page 617. Of 

n, = 799 teens surveyed, X = 639 said they used social networking 
sites. Of nz = 2253 adults surveyed, X = 1555 said they engaged 
in social networking. ‘To construct a confidence interval: 


e Press (2) @)) to insert a Calculator Scratchpad. 

© Press — Statistics —> Confidence Intervals > 
2-Prop z Interval. 

e A dialogue box will appear. Enter the values shown below. 


tab ) to [ox] and press (enter ). 


Successes, x1: 


nt 


Successes, x2 


n2: 
C Levet 


Chapter 10 TI-Nspire™ Technology Corners 


kInterval_2Prop 639,799, 1555,2253,0.95> @ 


“Title” "2-Prop z Interval” 
0.143242 
0.109559 
0.033683 


0.79975 
0.690191 


799 
2253 


22. Significance test for a difference in proportions 


The T'l-Nspire can be used to perform significance tests for com- 
paring two proportions. Here, we use the data from the Hungry 
Children example on page 622. 


To perform a test of Ho: p — p2 = 0: 


Press (or (a) @)) to insert a Calculator Scratchpad. 

© Press — Statistics > Stat Tests —> 2-Prop z test. 

e A dialogue box will appear. Enter the values shown: x, = 19, 
n, = 80, x2 = 26, nz = 150. Specify the alternative hypothesis 
Hi: pi ¥ pz as shown. 


® (tab } to [ox] and press (enter_). 


Successes, x1 
ni 
Successes, x2. 
n2 


Alternate Hyp: | Ha: pl # p2 } 
Cancel| 


You will see that the z statistic is z = 1.168 and the P-value is 
0.2427, as shown here. Do you see the combined proportion of 
students who didn’t eat breakfast? It’s the f value, 0.1957. 


zTest_2Prop 19,80, 26,150,0 star results 
“Title” “2-Prop z Test" 
"Alternate Hyp” “pl ep2" 
"3" 1.16835 
0.242667 


0.2375 
0 173333 
0.195652 

80 
150 


‘To display the P-value as a shaded area under the standard 
Normal curve: 


Press and select the Lists @ Spreadsheet icon (=) 
© Press — Statistics > Stat Tests —> 2-Prop z test. A 
dialogue box will appear. 


e Enter the values shown in the following box. Check the box 


to Shade P Value. to [0k] and press (enter_). 
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Successes, x1 fo sd 
a | 
Successes, x2 
m2: [150 ] 
sean 
Ist Result Column. a 


23. Two-sample fintervals on the calculator 


You can use the two-sample t interval command on the TI-Nspire 
to construct a confidence interval for the difference between two 
means. We'll show you the steps using the summary statistics from 
the pine trees example on page 641. 


e Press (or (a) (A)) to insert a Calculator Scratchpad. 

e Press — Statistics — Confidence Intervals > 
2-Sample t interval. 

e In the first dialogue box, select Stats in the drop-down menu. 


to [0k] and press (enter_). Another dialogue box will appear. 


e Enter the summary statistics shown: 


e Enter the confidence level: C level: .90. For pooled: choose 


“No.” (We'll discuss pooling later.) to[ox]and press (enter ). 


tInterval_2Samp 34.53,14 26,30,23.7,17.» 
“Tite” "2-Sample t Interval” 
"“CLower" 3.93617 
“CUpper” 17.7238 
10.83 
6.89383 
55.7277 


34.53 
23.7 
14.26 
17.5 
30 
30 
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24. Two-sample ftests on the calculator 


‘Technology gives smaller P-values for two-sample t tests than the 
conservative method. That’s because calculators and software 
use the more complicated formula on page 640 to obtain a larger 
number of degrees of freedom. 

Start by entering the sample data into a column in a Lists G 
Spreadsheet page. Name column A calcium and enter the Group 
1 data. Name column B placebo and enter the Group 2 data. 
Then do the test: 

© Press — Statistics — Stats Tests — 2-Sample t Test. 
e In the first dialogue box, select Data in the drop-down menu. 
(tab) to [0K] and press (enter_). 


e In the next dialogue box, enter the values shown, (tab ) to [0K 


and press (enter_). 


List 2 [ placebo 


Frequency 1 
Frequency 2 
Alternate Hyp 
Pooled [No 
1st Result Column. [eq 
Draw. 


Note: To just “calculate,” leave the Shade P value option un- 
checked. 
e The results should now appear in the spreadsheet. 


“1 Title 2-Samp 

12 Alternate... 411 > 4.2 
1t 1.60372 
3 PVal 0.06442 
3 df 15.5905 


ae a1] 


If you check the Shade P value box, the appropriate t distribu- 
tion will also be displayed, showing the same results and the 
shaded area corresponding to the P-value. 


15.5905 | | 
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25. Finding P-values for chi-square tests on the calculator 


To find the P-value in the M&M’S® example on page 685 with 
your calculator, use the y7cdf command. We ask for the area 
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between x? = 10.180 and a very large number (we'll use 10,000) 
under the chi-square density curve with 5 degrees of freedom. 


e Press and the Calculator Scratchpad should appear. 
© Press (menu) —> Statistics > Distributions > y°Cdf. 


e In the dialogue box that appears, enter the values shown in 


the following box. (‘tab ) to [ok] and press (enter ). 


Lower Boung: | 10 180 } 
oe a 
Deg of Freedom, af | 5 ] 


[ox] [cance 


x’Cdf{10 18, 10000,5) 0.070293 Fj 


As the calculator screen shot shows, this method gives a more 
precise P-value than Table C. 


26. Chi-square test for goodness of fit on the calculator 


You can use the TI-Nspire to perform the calculations for a chi- 

square test for goodness of fit. We'll use the data from the hockey 

and birthdays example on page 688 to illustrate the steps. 

1. Enter the observed counts and expected counts in two separate 
columns in a Lists G@ Spreadsheet page. Name the columns ob- 
served and expected. 


Birthday Observed Expected 
Jan-Mar 32 20 
Apr-June 20 20 
July-Aug 16 20 
Sept-Dec 12 20 


2. Perform a chi-square test for goodness of fit. 

© Press —> Statistics > Stat Tests > x? GOF. 

e In the dialogue box that appears, enter the values shown in 

the following box. to [ok] and press 

If you leave the Shade P value box unchecked, you'll get the test 
results within the spreadsheet containing the test statistic, P-value, 
and df. If you check the Shade P value box, you'll get a picture 
of the appropriate chi-square distribution with the test statistic 
marked and shaded area corresponding to the P-value. 


Observed List | observed ] 


Deg of Freedom. df fa 


1st Result Column | c{) 


Draw [] Shade P Value | 
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We'll discuss the Comp List results later. 


27. Chi-square tests for two-way tables on the calculator 


You can use the 'TI-Nspire to perform calculations for a chi-square 
test for homogeneity. We'll use the data from the restaurant study 
on page 704 to illustrate the process. 


1. Press (Ca) (a)) to insert a Calculator Scratchpad. 


2. Define a matrix by doing the following: 
¢ Name your matrix by typing musicinfluence (ctr! ) “=” @: 


o |¥o|¥o| e 
bo | [pa] | Ped 
at | || Sat | 


Matrix 
Number of rows 


Number of columns [> -e 


¢ Type in the corresponding row data, pressing between 
entries. Press when finished, 


‘iy 


3 
3 
musicinfluence | 


w 
) 

“3 

- oOo 
w 

) 


3. To perform the chi-square test, do the following steps: 
e Press — Statistics > Stat tests > y° 2-way Test. 
e Specify the observed matrix, to [0k], and press (enter ). 
e The results will be displayed and the expected matrix and 
component matrix will be calculated. 
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x’ 2way musicinfluence: stat results 


"Title" "x' 2-way Test" 
“gee 18.2792 

“PVal” 0 001088 
“af” 4 


“Bp Mari" "Ey" 
[Comp Matix i OS ‘od 


4. To see the expected counts and component matrix, press 
and select stat.expmatrix for the expected matrix or stat.comp- 
matrix for the component matrix. 


10.716 
39.0617 


stat ExpManix 
34.2222 30.5556 
10.716 9.5679 10.716 
39.0617 348765 39.0617 


stat. CompMatrix 
0.520924 2.33374 0520924 
6.4038 
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28. Confidence interval for slope on the 
calculator 


Let’s use the data from the Ford F-150 truck example on page A-9 
to construct a confidence interval for the slope of a population 
(true) regression line on the 'T'l-Nspire. 

1. Insert a Lists @ Spreadsheet page, and name column A miles 
and column B price. Type the corresponding values into each 
column. 

2. ‘To construct a confidence interval: 

e Press — Statistics — Confidence Intervals — Lin- 
ear Reg t Intervals. 
e In the first dialogue box, select Slope. (tab_) to [0K] and press 
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e In the next dialogue box, select miles for the X List and price 
for the Y List. Enter the rest of the values shown. to 
and press (enter_). 


ant [sere Do 


{cancel} 


x : 
vue 
sev regents [11 
Frequency List: a 
Clevet [os ] 
IstResultCoumn[c}) 


[cancer| 


=LinRegtir 


9500 RegEqn a+b*x 

29932 -20875CLower 0.229313 
20953 41995 CUpper 0.096524 
24495 41995 0.162919 
75678 28986 ME 0.066395 ff 

8359 31891 df 14, 

4447 37991s 5740.13 
34077 34995 SESlope 0.030956 
58023 20988.a 38257.1 
44447 22896 '' 0.664248 | 
68474 33961 0.815014 

16883 Resid —_(-4763.85 


29. Significance test for slope on the calculator 


Let’s use the data from the crying and IQ study on page 754 to 
perform a significance test for the slope of the population regres- 
sion line on the TI-Nspire. 

1. Insert a Lists G Spreadsheet page, and name column A crycount 
and column B iqscore. ‘Type the corresponding values into each 
column. 

2. To doa significance test: 

e Press — Statistics — Stat Tests — Linear Reg t Test. 
¢ Select crycount for the X List and iqscore for the Y List. Enter 


the rest of the values as shown. (tab ) to [ox] and press (enter ). 


Y List 
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crycount™ iqscore 


87 Title Linear Re 

97 Alternate... B & p #0... 
103 RegEqn = atb*x 
106t 3.06549 
109 PVal 0.004105 
114 df 36. 
1194 91.2683 
132b 1.4929 
136s 17.4987 
159 SESlope 0.487001 

90 0,207 
100r 0.454973 
103 Resid {-19.1972.. 


30. Transforming to achieve linearity on the calculator 


We'll use the planet data on page 779 to illustrate a general 
strategy for performing transformations with logarithms on the 
'TL-Nspire. A similar approach could be used for transforming data 
with powers and roots. 

1. Insert a Lists G Spreadsheet page, and name column A distance 
and column B period. ‘Type the corresponding values into each 
column. 

2. Make a scatterplot of y versus x and confirm that there is a 
curved pattern. 

* Insert a Data Statistics page. Press (ett ) CJ and select Add 
Data © Statistics. 

@ Press and select distance for the horizontal axis. Press 
again and select period for the vertical axis. 


0 S$ 10 15 20 25 30 35 40 
distance 


anc 
! 


3. To “straighten” the curve (that is, determine the relationship), 
we can use different models of the explanatory-response data to 
see which one provides a linear relationship. 
¢ Press (et) 4 to return your spreadsheet. Name column c 
Indistance and column d Inperiod. 

e In the formula cell for Indistance, press and enter 
In(distance) to take the natural log of the distance values. 

¢ Repeat this step for Inperiod using the period data. 

4. To see if an exponential model fits the data: 

e Insert another Data G Statistics page. 

¢ Put distance on the horizontal axis and Inperiod on the verti- 
cal axis. If the relationship looks linear, then an exponential 
model is appropriate. 


istance 


5. ‘To see ifa power model fits the data: 


e Using the same Data @ Statistics page, change the horizontal 


axis to Indistance. 


e If this relationship looks linear, then a power model is 


appropriate. 


-1.0 00 1D. 2 
Indistance 


6. Ifa linear pattern is present, calculate the equation of the least- 


squares regression line: 


e In the spreadsheet, press — Statistics — Calcula- 


tions — Linear Regression(a + bx). 


e In the dialogue box, select Indistance for X List, Inperiod for 


3.0 


0 5 10 19 20 25 30 35 40 


4¢ 
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Y List, and enter the rest of the values as shown. [tab ) to 


and press (enter_). 


X List | ‘crycount 


Save Reg qn to: 


Frequency List. 
Alternate Hyp: 


Y List: | igscore 


4 


a Indista Inperiod 
=In(distanc =In(period) 


0.949331 -1,42296 Title 


Linear Re. # 


0.324346 0.486133 RegEqn atb*x 
0. 0.a 0.000254 | | 
0.421338 0.631804 b 1.49986 


1.64924 2.473347 


7. Construct a residual plot to look for any departures from the 
linear pattern. 
e Insert another Data © Statistics page. 
¢ For the horizontal axis select Indistance. For Ylist, use the 
stat.resid list stored in the calculator. 


stat.resid 


-10 00 10 20 30 4 
Indistance 


8. ‘To make a prediction for a specific value of the explanatory vari- 
able, compute log(x) or In(x), if appropriate. Then do f1(k) to 
obtain the predicted value of log y or In y. To get the predicted 
value of y, do 10*Ans or e*Ans to undo the logarithm trans- 
formation. Here’s our prediction of the period of revolution for 
Eris, which is at a distance of 102.15 AU from the sun. 


In(102.15) 4.62644 5 
ft(102.15) 6.93927 | 
6-9392747840111 1032.02 | | 

| 
| 


This page intentionally left blank 


Students are provided with the following formulas on both the 
multiple choice and free-response sections of the AP® Statistics 
exam. 


|. Descriptive Statistics 
Sx; 


x= 


ll. Probability 
P(A UB) = P(A) + P(B) — PAMB) 
P(AMB) 


PIB) =m 


E(X) = py = >. xipi 


Var(X) = 0% = >) (i — px) Pi 
If X has a binomial distribution with parameters n and p, then: 


n 


P(X =k) = @a apr 


Ly = np 
ox = Vinp(1 = p) 
1p =p 


_ [pa=p 
oe ea 


If x is the mean of a random sample of size n from an infinite popu- 
lation with mean yp and standard deviation a, then: 


z= he 
_ (on 
a 


Ill. Inferential Statistics 


statistic — parameter 


Standardized test statistic: ae mae 
standard deviation of statistic 


Confidence interval: statistic + (critical value) « (std. deviation of statistic) 


Single-Sample 
Statistic Standard Deviation of Statistic 
oO 
Sample Mean = 
d Vn 
Sample Proportion p(t — p) 
n 
Two-Sample 
Statistic Standard Deviation of Statistic 
Difference of oot 
sample means 7, + hh 


Special case when 


01 = 02 
1 1 
oe ea ae 
ny Ny 
Difference of . pr(1 — px) po(1 — po) 
sample proportions n i N 


Special case when 
Dy = Po 


og ag fle oc 
Vell — ay} o to 


(observed — expected)* 
expected 


Chi-square test statistic = 5” 
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Probability | / 


‘Table entry for z is the area under the standard Normal curve to the left of z. 


0003 
0005 
0007 
0010 
0013 
0019 
0026 
0035 
0047 
0062 
0082 
0107 
0139 
0179 
0228 
0287 
0359 
0446 
0548 
0668 
0808 
0968 
1151 
1357 
1587 
1841 
2119 
2420 
2743 
3085 
3446 
3821 
4207 
4602 
5000 


0003 
0005 
0007 
0009 
0013 
0018 
0025 
0034 
0045 
0060 
0080 
0104 
0136 
0174 
0222 
0281 
0351 
0436 
0537 
0655 
0793 
0951 
A181 
1335 
1562 
1814 
2090 
2389 
2709 
3050 
3409 
3783 
4168 
4562 
4960 


0003 
0005 
0006 
0009 
0013 
0018 
0024 
0033 
0044 
0059 
0078 
0102 
0132 
0170 
0217 
0274 
0344 
0427 
0526 
0643 
0778 
0934 
1112 
1314 
1539 
1788 
2061 
2358 
2676 
3015 
3372 
3745 
4129 
4522 
4920 


0003 
0004 
0006 
0009 
0012 
0017 
0023 
0032 
0043 
0057 
0075 
0099 
0129 
0166 
0212 
0268 
0336 
0418 
0516 
0630 
0764 
0918 
1093 
1292 
1515 
1762 
2033 
2327 
2643 
2981 
3336 
3707 
4090 
4483 
4880 


0003 
0004 
0006 
0008 
0012 
0016 
0023 
0031 
0041 
0055 
0073 
0096 
0125 
0162 
0207 
0262 
0329 
0409 
0505 
0618 
0749 
0901 
1075 
1271 
1492 
1736 
2005 
2296 
2611 
2946 
3300 
3669 
4052 
4443 
4840 


0003 
0004 
0006 
0008 
0011 
0016 
0022 
0030 
0040 
0054 
0071 
0094 
0122 
0158 
0202 
0256 
0322 
0401 
0495 
0606 
0735 
0885 
1056 
1251 
1469 
47M 
1977 
2266 
2578 
2912 
3264 
3632 
4013 
4404 
4801 


.0003 
.0004 
.0006 
.0008 
.0011 
.0015 
.0021 
.0029 
.0039 
.0052 
.0069 
.0091 
0119 
.0154 
.0197 
.0250 
.0314 
.0392 
0485 
.0594 
0721 
.0869 
1038 
.1230 
1446 
1685 
.1949 
.2236 
.2546 
.2877 
3228 
3594 
3974 
4364 
4761 


0003 
0004 
0005 
0008 
0011 
0015 
0021 
0028 
0038 
0051 
0068 
0089 
0116 
0150 
0192 
0244 
0307 
0384 
0475 
0582 
0708 
0853 
1020 
1210 
1423 
1660 
1922 
2206 
2514 
2843 
3192 
3557 
3936 
4325 
A721 


0003 
0004 
0005 
0007 
0010 
0014 
0020 
0027 
0037 
0049 
0066 
0087 
0113 
0146 
0188 
0239 
0301 
0375 
0465 
0571 
0694 
0838 
1003 
1190 
1401 
1635 
1894 
2177 
2483 
2810 
3156 
3520 
3897 
4286 
4681 


0002 
0003 
0005 
0007 
0010 
0014 
0019 
0026 
0036 
0048 
0064 
0084 
0110 
0143 
0183 
0233 
0294 
0367 
0455 
0559 
0681 
0823 
0985 
1170 
1379 
1611 
1867 
2148 
2451 
2776 
3121 
3483 
3859 
A247 
A641 


(Continued) 


T-2 


Probability 


Tables 


‘Table entry for z is the area under the standard Normal curve to the left of z. 


03 04 05 06 07 08 
rol) .5160 5199 5239 E279 9319 
EODIIT 00r 0596 .5636 0675 0714 
9910 5948 5987 .6026 6064 6103 
6293 6331 6368 .6406 6443 .6480 
.6664 6700 6736 6772 6808 .6844 
1019 7054 .7088 7123 7157 .7190 
1357 7389 1422 7454 7486 1517 
1673 7704 1734 1764 1794 7823 
1967 7995 8023 8051 .8078 8106 
8238 .8264 8289 8315 .8340 8365 
8485 .8508 8531 8554 8577 8599 
8708 8729 8749 8770 .8790 8810 
8907 8925 8944 8962 .8980 8997 
9082 .9099 19115 9131 9147 9162 
9236 9251 .9265 102710 19292. .9306 
.9370 9382 .9394 .9406 9418 9429 
.9484 9495 .9505 9515 9525 .9535 
.9582 .9591 .9599 .9608 9616 9625 
.9664 9671 .9678 .9686 .9693 .9699 

9732 9738 9744 .9750 .9756 .9761 
9788 9793 .9798 .9803 .9808 9812 
9834 9838 9842 9846 .9850 9854 
.9871 9875 .9878 9881 .9884 .9887 
9901 .9904 .9906 .9909 Eo il E99iI3 
50925 20927, 09929 9931 .9932 9934 
.9943 9945 9946 .9948 .9949 9951 
.9957 .9959 .9960 .9961 9962 .9963 
.9968 .9969 .9970 .9971 9972 .9973 
.9977 9977 .9978 .9979 .9979 .9980 
.9983 9984 .9984 .9985 9985 .9986 
.9988 9988 .9989 .9989 .9989 .9990 
1099 p90 2 p09? p02 20992 19993 
.9994 .9994 .9994 .9994 RE) 19995 
.9996 .9996 .9996 .9996 .9996 29996 
29997 SERVE .9997 .9997 (O9o7 59997 


Tables T-3 


Probability 
\ P 


‘Table entry for p and C is the point t* with probability p lying to its right 
and probability C lying between -¢* and t*. 


Tail probability p 

25 .20 15 10 05 025 02 01 005 0025 .001 .0005 
1.000 1.376 1.963 3.078 6.314 12.71 15.89 31.82 63.66 WES) 318.3 636.6 
0.816 1.061 1.386 1.886 2.920 4.303 4.849 6.965 9.925 14.09 22.33 31.60 
0.765 0.978 1.250 1.638 2.353 3.182 3.482 4.541 5.841 7.453 10.21 12.92 
0.741 0.941 1.190 533 2132 2.776 2.999 Sf Ah 4.604 5.598 Tales 8.610 
0.727 0.920 1156 1.476 2.015 2.571 215i 3:365 4.032 4.773 5.893 6.869 
0.718 0.906 1.134 1.440 1.943 2.447 2.612 3.143 3.707 4.317 5.208 5.959 
0.711 0.896 1.119 1.415 1.895 2.365 2.517 2.998 3.499 4.029 4.785 5.408 
0.706 0.889 1.108 1.397 1.860 2.306 2.449 2.896 3.355 3.833 4.501 5.041 
0.703 0.883 1.100 1.383 1.833 2.262 2.398 2.821 3.250 3.690 4.297 4.781 
0.700 0.879 1.093 1.372 1.812 2.228 2.359 2.764 3.169 3.581 4.144 4.587 
0.697 0.876 1.088 1.363 1.796 2.201 2.328 2.718 3.106 3.497 4.025 4.437 
0.695 0.873 1.083 1.356 1.782 2.179 2.303 2.681 3.055 3.428 3.930 4.318 
0.694 0.870 1.079 1.350 eA) 2.160 2.282 2.650 3.012 3.372 3.852 4.221 
0.692 0.868 1.076 1.345 1.761 2.145 2.264 2.624 2.977 3.326 3.787 4.140 
0.691 0.866 1.074 1.341 1e753 2h 2.249 2.602 2.947 3.286 ohifehe) 4.073 
0.690 0.865 1.071 1.337 1.746 2.120 2.235 2.583 2.921 3.252 3.686 4.015 
0.689 0.863 1.069 1.333 1.740 2.110 2.224 2.567 2.898 3.222 3.646 3.965 
0.688 0.862 1.067 1.330 1.734 2.101 2.214 2.552 2.878 3.197 3.611 3.922 
0.688 0.861 1.066 1.328 1.729 2.093 2.205 2.539 2.861 3.174 3.579 3.883 


a 
= 


a 2 = = ee 
OOAN DOA PWNHHI TDA AN Da fFwWhND — 


20 0.687 0.860 1.064 1.325 1.725 2.086 2.197 2.528 2.845 3.153 3.552 3.850 
21 0.686 0.859 1.063 1.323 1.721 2.080 2.189 2.518 2.831 3.135 3.527 3.819 
22 0.686 0.858 1.061 1.321 Al 2.074 2.183 2.508 2.819 3.119 3.505 3.792 
23 0.685 0.858 1.060 1.319 1.714 2.069 2.177 2.500 2.807 3.104 3.485 3.768 
24 0.685 0.857 1.059 1.318 IZA 2.064 PAA 2.492 2197, 3.091 3.467 3.745 
25 0.684 0.856 1.058 1.316 1.708 2.060 2.167 2.485 2.787 3.078 3.450 3.725 
26 0.684 0.856 1.058 1.315 1.706 2.056 2.162 2.479 2.779 3.067 3.435 3.707 
27 0.684 0.855 1.057 1.314 1.703 2.052 2.158 2.473 2.771 3.057 3.421 3.690 
28 0.683 0.855 1.056 1.313 1.701 2.048 2.154 2.467 2.763 3.047 3.408 3.674 
29 0.683 0.854 1.055 1.311 1.699 2.045 2.150 2.462 2.756 3.038 3.396 3.659 
30 0.683 0.854 1.055 1.310 1.697 2.042 2.147 2.457 2.750 3.030 3.385 3.646 


40 0.681 0.851 1.050 1.303 1.684 2.021 2.123 2.423 2.704 2.971 3.307 3.551 
50 0.679 0.849 1.047 1.299 1.676 2.009 2.109 2.403 2.678 2.937 3.261 3.496 
60 0.679 0.848 1.045 1.296 1.671 2.000 2.099 2.390 2.660 2.915 3.232 3.460 
80 0.678 0.846 1.043 1.292 1.664 1.990 2.088 2.374 2.639 2.887 Bal 95) 3.416 
100 0.677 0.845 1.042 1.290 1.660 1.984 2.081 2.364 2.626 2.871 3.174 3.390 
1000 0.675 0.842 1.037 1.282 1.646 1.962 2.056 2.330 2.581 2.813 3.098 3.300 
(oe) 0.674 0.841 1.036 1.282 1.645 1.960 2.054 2.326 2.576 2.807 3.091 3.291 
50% 60% 70% 80% 90% 95% 96% 98% 99% 99.5% 99.8% 99.9% 
Confidence level C 


T-4 Tables 


Probability 
P 


x? Table entry for p is the point x? with probability p lying to its right. 


Tail probability p 


df 25 .20 15 10 05 025 02 01 .005 0025 001 .0005 
il 1.32 1.64 2.07 PALA 3.84 5.02 5.41 6.63 7.88 9.14 10.83 12.12 
2 Pall 3.22 3.79 4.61 5:09 7.38 7.82 Ori 10.60 11.98 13.82 15.20 
3 4.11 4.64 5.32 6.25 7.81 9:35 9.84 11.34 12.84 14.32 16.27 17.73 
4 5.39 5:99 6.74 7.78 9.49 11.14 11.67 13.28 14.86 16.42 18.47 20.00 
5 6.63 7.29 8.12 9.24 11.07 12.83 13.39 15.09 16.75 18.39 20.51 22.11 
6 7.84 8.56 9.45 10.64 12.59 14.45 15.03 16.81 18.55 20.25 22.46 24.10 
7 9.04 9.80 10.75 12.02 14.07 16.01 16.62 18.48 20.28 22.04 24.32 26.02 
8 10.22 11.03 12.03 13.36 15.51 17.53 18.17 20.09 21.95 23.77 26.12 27.87 


9 11.39 12.24 13.29 14.68 16.92 19.02 19.68 21.67 23.59 25.46 27.88 29.67 
10 12.55 13.44 14.53 15.99 18.31 20.48 21.16 23.21 25.19 27.11 29.59 31.42 
11 13.70 14.63 a 17.28 19.68 PAY 22.62 24.72 26.76 28.73 31.26 33.14 
12 14.85 15.81 16.99 18.55 21.03 23.34 24.05 26.22 28.30 30.32 32.91 34.82 
13 15.98 16.98 18.20 19.81 22.36 24.74 25.47 27.69 29.82 31.88 34.53 36.48 
14 17.12 18.15 19.41 21.06 23.68 26.12 26.87 29.14 31.32 33.43 36.12 38.11 
15 18.25 19.31 20.60 22.31 25.00 27.49 28.26 30.58 32.80 34.95 37.70 39.72 
16 19.37 20.47 21.79 23.54 26.30 28.85 29.63 32.00 34.27 36.46 39.25 41.31 
17 20.49 21.61 22.98 24.77 27.59 30.19 31.00 33.41 35.72 37.95 40.79 42.88 
18 21.60 22.76 24.16 25.99 28.87 31.53 32.35 34.81 37.16 39.42 42.31 44.43 
19 22.72 23.90 25.33 27.20 30.14 32.85 33.69 36.19 38.58 40.88 43.82 45.97 
20 23.83 25.04 26.50 28.41 31.41 34.17 35.02 37.57 40.00 42.34 45.31 47.50 
21 24.93 26.17 27.66 29.62 32.67 35.48 36.34 38.93 41.40 43.78 46.80 49.01 
22 26.04 27.30 28.82 30.81 33.92 36.78 37.66 40.29 42.80 45.20 48.27 50.51 
23 27.14 28.43 29.98 32.01 35.17 38.08 38.97 41.64 44.18 46.62 49.73 52.00 
24 28.24 29.55 31.13 33.20 36.42 39.36 40.27 42.98 45.56 48.03 51.18 53.48 
25 29.34 30.68 32.28 34.38 37.65 40.65 41.57 44.31 46.93 49.44 52.62 54.95 
26 30.43 31.79 33.43 35.56 38.89 41.92 42.86 45.64 48.29 50.83 54.05 56.41 
27 31.53 32.91 34.57 36.74 40.11 43.19 44.14 46.96 49.64 52.22 55.48 57.86 
28 32.62 34.03 35.71 37.92 41.34 44.46 45.42 48.28 50.99 53.59 56.89 59.30 
29 33.71 35.14 36.85 39.09 42.56 45.72 46.69 49.59 52.34 54.97 58.30 60.73 
30 34.80 36.25 37.99 40.26 43.77 46.98 47.96 50.89 53.67 56.33 59.70 62.16 
40 45.62 47.27 49.24 51.81 0276 59.34 60.44 63.69 66.77 69.70 73.40 76.09 
50 56.33 58.16 60.35 63.17 67.50 71.42 72.61 76.15 79.49 82.66 86.66 89.56 
60 66.98 68.97 71.34 74.40 79.08 83.30 84.58 88.38 Sih95) 95.34 99.61 102.7 
80 88.13 90.41 93.11 96.58 101.9 106.6 108.1 112.3 116.3 120.1 124.8 128.3 

100 = 109.1 a aLEy, 114.7 118.5 124.3 129.6 131.1 135.8 140.2 144.3 149.4 153:2 
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Glossary/Glosario 


English 


Espanol 


1.5 x IQR rule for outliers An observation is called an outlier if 
it falls more than 1.5 X IQR above the third quartile or below the 
first quartile. (p. 56) 


regla 1.5 X la gama entre cuartiles para valores atipicos Se 
le dice valor atfpico a una observacion si cae a més de 1.5 X la 
gama entre cuartiles por encima del tercer cuartil o por debajo 
del primer cuartil. (pag. 56) 


10% condition When taking an SRS of size n from a population 


1 
of size N, check thatn = 0: (pp. 401, 494) 


condicié6n del 10% Cuando se toma una muestra aleatoria sen- 
cilla de tamafio n de una poblacién de tamafio N, se verifica que 


l 
ns Tits (pags. 401, 494) 


68-95-99.7 rule (also known as the empirical rule) In the Nor- 
mal distribution with mean j and standard deviation a, (a) ap- 
proximately 68% of the observations fall within o of the mean p, 
(b) approximately 95% of the observations fall within 20 of ju, 
and (c) approximately 99.7% of the observations fall within 30 
of js. (p. 110) 


addition rule for mutually exclusive events IfA and B are mutu- 
ally exclusive events, P(A or B) = P(A) + P(B). (p. 308) 


regla 68-95-99.7 A la que también se le dice la “regla empirica”. 
En la distribucién normal con media ju y desviacién esténdar o, 
(a) aproximadamente el 68% de las observaciones caen dentro de 
o de la media ju, (b) aproximadamente el 95% de las observaciones 
caen dentro de 20 de 1, y (c) aproximadamente el 99.7% de las 
observaciones caen dentro de 30 de p. (pag. 110) 


regla de suma para eventos que se excluyen mutuamente Si A 
y B son eventos que se excluyen entre sf, P(A o B) = P(A) + P(B). 
(pag. 308) 


alternative hypothesis H, The claim that we are trying to find 
evidence for in a significance test. (p. 540) 


hipétesis H, alternativa La proposicién de que en una prueba 
de significancia estadistica estamos tratando de hallar evidencia 
que esté a favor. (pag. 540) 


anonymity The names of individuals participating in a study are 
not known even to the director of the study. (p. 271) 


association Knowing the value of one variable helps predict the 
value of the other. If knowing the value of one variable does not 
help predict the value of the other, there is no association be- 
tween the variables. (p. 18) 


back-to-back stemplot (also called back-to-back stem-and-leaf 
plot) Plot used to compare the distribution of a quantitative variable 
for two groups. Each observation in both groups is separated into a 
stem, consisting of all but the final digit, and a leaf, the final digit. 
‘The stems are arranged in a vertical column with the smallest at the 
top. The values from one group are plotted on the left side of the 
stem and the values from the other group are plotted on the right 
side of the stem. Each leaf is written in the row next to its stem, with 
the leaves arranged in increasing order out from the stem. (p. 32) 


anonimato Cuando se desconocen los nombres de las personas 
que participan en un estudio; inclusive el director del estudio los 
ignora. (pag. 271) 


asociacién Saber el valor de una variable facilita la prediccién 
del valor de la otra. Si saber el valor de una variable no facilita la 
prediccién del valor de la otra, entonces no existe ninguna aso- 
ciacion entre las variables. (pag. 18) 


diagrama de tallos contiguos (también se le dice diagrama de 
tallos y hojas contiguos) Se utiliza para comparar la distribucién 
de una variable cuantitativa en dos grupos. Cada observacién efec- 
tuada en ambos grupos se separa en un tallo, que consiste de todos 
los digitos salvo el ultimo, y una hoja, que consta del ultimo digito. 
Los tallos se organizan en una columna vertical con las cifras mas 
pequefias arriba. Los valores de un grupo se diagraman al lado iz- 
quierdo del tallo y los valores del otro grupo se diagraman al lado 
derecho del tallo. Cada hoja se coloca en el renglén que esta al 
lado de su tallo, y las hojas dispuestas en orden ascendiente extend- 
iéndose hacia fuera a partir del tallo. (pag. 32) 


G-1 


G-2 Glossary/Glosario 


bar graph Graph used to display the distribution of a categorical 
variable or to compare the sizes of different quantities. The hori- 
zontal axis of a bar graph identifies the categories or quantities 
being compared. The graph is drawn with blank spaces between 
the bars to separate the items being compared. (p. 8) 


grafico de barras (pag. 8) Se usa para ilustrar la distribucién de 
una variable categorizada o para comparar el tamafio de diferen- 
tes cantidades. El eje horizontal del grafico de barras identifica 
las categorias o las cantidades que se han de comparar. Se puede 
dibujar con espacios en blanco entre las barras a fin de separar la 
s diversas categorias que se desea comparar. (pag. 8) 


bias The design of a statistical study shows bias if it would con- 
sistently underestimate or consistently overestimate the value you 
want to know. (p. 212) 


sesgo Al disefiar un estudio estadistico se demuestra un sesgo si 
de manera constante se subestima o sobrestima el valor que se 
desea saber. (pag. 212) 


biased estimator A statistic used to estimate a parameter is biased 
if the mean of its sampling distribution is not equal to the true 
value of the parameter being estimated. (p. 431) 


calculador sesgado La estadistica que se usa para computar 
un parametro estd sesgada si la media de la distribucién de su 
muestreo no equivale al valor real del parametro que se esta com- 
putando. (pag. 431) 


bimodal A graph of quantitative data with two clear peaks. 
(p. 29) 


bimodal Grafico de datos cuantitativos con dos picos bien defini- 
dos. (pag. 29) 


binomial coefficient The number of ways of arrang- 


ing k successes among n observations is given by the bino- 
| 

mial coefficient (7) = a Py for k = 0, 1, 2, ... 

n! =n(n — 1)(n— 2)-...-3+2-1 and 0! 


n where 


1. (p. 392) 


coeficiente binomial La cantidad de maneras de organizar 
k aciertos entre n observaciones se representa con el coefici- 
n! 


ag is | _ 
ente binomial G lin — I para k = 0, 1, 2, ... 
n! = n(n — 1)(n — 2)-...+3+2+1y0! = 1. (pag. 392) 


nen el que 


binomial distribution In a binomial setting, suppose we let 
X = the number of successes. The probability distribution of X is 
a binomial distribution with parameters n and p, where n is the 
number of trials of the chance process and p is the probability of 
a success on any one trial. The possible values of X are the whole 
numbers from 0 to n. (p. 388) 


distribucién binomial En un entorno binomial, supongamos 
que se permite que X = la cantidad de aciertos. La distribucion 
de la probabilidad de X es una distribucién binomial con los para- 
metros n y p, en la que n es la cantidad de ensayos del proceso de 
probabilidad y p es la probabilidad de un acierto en cualquiera de 
los ensayos. Los posibles valores de X son los ntimeros enteros de 


Oan. (pag. 388) 


binomial probability formula If X has the binomial distribution 
with n trials and probability p of success on each trial, the pos- 
sible values of X are 0, 1, 2, ..., n. If k is any one of these values, 


P(X =k) = (Joa — py’, (p. 409) 


formula de probabilidad binomial Si X tiene la distribucién 
binomial con n ensayos y la probabilidad p de acierto en cada en- 
sayo, los posibles valores de X son 0, 1, 2, ..., n. Sik es cualquiera 


de estos valores, P(X = k) = (ia — p)"~* (pag. 409) 


binomial random variable The count X of successes in a bino- 
mial setting. (p. 388) 


binomial setting Arises when we perform several independent 
trials of the same chance process and record the number of times 
that a particular outcome occurs. ‘The four conditions for a bino- 
mial setting: 


¢ Binary? The possible outcomes of each trial can be classified 
as “success” or “failure.” 

¢ Independent? Trials must be independent; that is, knowing the 
result of one trial must not tell us anything about the result of 
any other trial. 

¢ Number? The number of trials n of the chance process must 
be fixed in advance. 


¢ Success? ‘There is the same probability p of success on each 


trial. (p. 388) 


variable aleatoria binomial La cuenta X de aciertos en un en- 
torno binomial. (pag. 388) 


entorno binomial Surge cuando se realizan varios ensayos in- 
dependientes del mismo proceso de probabilidad y se anota la 
cantidad de veces que se produce un resultado dado. Las cuatro 
condiciones que definen un entorno binomial son: 


e Binario? Los resultados posibles de cada ensayo se pueden 
clasificar como “acierto” 0 “fracaso”. 

e¢ Independiente? Los ensayos han de ser independientes; es 
decir, saber el resultado de un ensayo no debe indicar nada 
acerca del resultado de otro ensayo. 

¢ Ntimero? La cantidad de ensayos n del proceso de probabili- 
dad se tiene que fijar con anticipacién. 

¢ Acierto? Existe la misma probabilidad p de lograr un acierto en 
cada ensayo. (pag. 388) 


block Group of experimental units that are known before the ex- 
periment to be similar in some way that is expected to affect the 
response to the treatments. (p. 252) 


bloque Grupo de unidades experimentales que antes del experi- 
mento se sabe son similares de alguna manera previsible que af 
ecte la respuesta a los tratamientos. (pag. 252) 


boxplot Graph of the five-number summary. The box spans 
the quartiles and shows the spread of the central half of the dis- 
tribution. The median is marked within the box. Lines extend 
from the box to the smallest and largest observations that are 
not outliers. Outliers are marked with a special symbol such as 
an asterisk (*). (p. 57) 


categorical variable Variable that places an individual into one 
of several groups or categories. (p. 3) 


Glossary/Glosario G-3 


diagrama de caja y bigotes Un grafico del resumen de cinco ci- 
fras. La caja abarca los cuartiles y muestra el alcance de la mitad 
central de la distribucién. Dentro de la caja se marca la media. 
Las lineas se extienden a partir de la caja a las observaciones mas 
pequefia y mds grande que no son valores atfpicos. Los valores 
atfpicos se marcan con un simbolo especial tal como un asterisco 


(*). (pag. 57) 


variable categorizada Coloca a un individuo en uno o varios 
grupos o categorias. (pag. 3) 


census Study that attempts to collect data from every individual 
in the population. (p. 210) 


censo Un estudio en el que se trata de recoger datos acerca de 
cada individuo en la poblacién. (pag. 210) 


central limit theorem (CLT) In an SRS of size n from any 
population with mean y and finite standard deviation a, when 
n is large, the sampling distribution of the sample mean x is ap- 
proximately Normal. (p. 457) 


teorema del limite central ‘Traza una muestra aleatoria sencilla 
de tamafio na partir de una poblacion con la media p y una des- 
viacion estandar finita de o. El teorema del limite central mani- 
fiesta que cuando n es grande, la distribucién de muestreo de la 
media de la muestra x es aproximadamente normal. (pag. +57) 


Chebyshev’s inequality In any distribution, the proportion of 
observations falling within k standard deviations of the mean is at 


1 
least 1 — gz (p. 112) 


chi-square distribution Family of distributions that take only 
nonnegative values and are skewed to the right. A particular chi- 
square distribution is specified by giving its degrees of freedom. 


(p. 685) 


desigualdad de Chebychov En cualquier distribuci6n, la pro- 
porcion de observaciones que yacen dentro de k desviaciones 


] 
esténdar de la media es al menos | — gz (pag. 112) 


distribucién de ji cuadrado Familia de distribuciones que acep- 
ta solo valores no negativos y que esta sesgada hacia la derecha. Se 
especifica una distribucion de ji cuadrado dada citando sus grados 


de libertad. (pag. 685) 


chi-square statistic Measure of how far the observed counts are 
from the expected counts. ‘The formula is 


(Observed — Expected)’ 
Expected 


2 


where the sum is over all possible values of the categorical variable 
or all cells in the two-way table. (p. 682) 


estadistica de jicuadrado Una medicién de la distancia entre las 
cuentas observadas y las cuentas previstas. La formula es 


(Observadas — Previstas)* 


Previstas 


y=> 


a 


en la que la suma esta sobre todos los valores posibles de la vari- 
able categorizada o sobre todas las celdas en la tabla de doble via. 


(pag. 682) 


chi-square test for goodness of fit Suppose the Random, 10%, 
and Large Counts conditions are met. To determine whether a 
categorical variable has a specified distribution in the population 
of interest, expressed as the proportion of individuals falling into 
each possible category, perform a test of 


Ho: The specified distribution of the categorical variable in the 
population of interest is correct. 


H,: The specified distribution of the categorical variable in the 
population of interest is not correct. 


Start by finding the expected count for each category assuming 
that Hy is true. Then calculate the chi-square statistic 


(Observed — Expected)’ 
Expected 


vy=> 


prueba de ji cuadrado para confirmar el cuadre Supongamos 
que se cumplen las condiciones de aleatorio, del 10% y de cuen- 
tas grandes. Para determinar si una variable categorizada tiene una 
distribucién especifica en la poblacién de interés, expresada como 
la proporcién de individuos que se encuentran dentro de cada cat- 
egorfa posible, se realiza una prueba de 


Hp: La distribucién especificada de la variable categorizada en la 
poblacion de interés es correcta 


H,;: La distribucion especificada de la variable categorizada en la 
poblacion de interés no es correcta. 


Se comienza hallando la cuenta prevista para cada categoria, 
asumiendo que Hp es verdad. Luego, se calcula la estadistica de 
ji cuadrado 
(Observadas — Previstas)? 

Previstas 


v=> 
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where the sum is over the k different categories. ‘The P-value is the 
area to the right of x? under the density curve of the chi-square 
distribution with k — 1 degrees of freedom. (p. 680) 


en la que la suma esta sobre las k categorfas diferentes. E] valor- 
P es el drea a la derecha de y* bajo la curva de densidad de la 
distribucién ji cuadrado con k — 1 grados de libertad. (pag. 680) 


chi-square test for homogeneity Suppose the Random, 10%, 
and Large Counts conditions are met. You can use the chi-square 
test for homogeneity to test 


Ho: There is no difference in the distribution of a categorical 
variable for several populations or treatments. 


H,: There is a difference in the distribution of a categorical vari- 
able for several populations or treatments. 


Start by finding the expected counts. Then calculate the chi- 
square statistic 


(Observed — Expected)? 
Expected 


v=> 


where the sum is over all cells (not including totals) in the 
two-way table. If Hp is true, the x” statistic has approximately a 
chi-square distribution with degrees of freedom = (number of 
rows — 1)(number of columns — 1). The P-value is the area to the 
tight of x? under the corresponding chi-square density curve. 


(p. 708) 


prueba de ji cuadrado de homogeneidad Supongamos que se 
han cumplido las condiciones de aleatorio, del 10% y de cuentas 
grandes. Se puede usar la prueba de ji cuadrado de homogenei- 
dad para verificar que 

Ho: No hay diferencia en la distribucién de una variable catego- 

rizada entre varias poblaciones o tratamientos. 

H,: Si hay diferencia en la distribucién de una variable categori- 
zada entre varias poblaciones o tratamientos. 

Se comienza hallando las cuentas previstas. Luego se computa la 
estadistica de ji cuadrado 


(Observadas — Previstas)? 


Previstas 


¥v=> 


en la que la suma esta por sobre todas las celdas (sin incluir los 
totales) en la tabla de doble via. Si Ho es verdad, la estadistica 
x? tiene una distribucién de aproximadamente ji cuadrado con 
grados de libertad = (ntimero de renglones — 1)(ntimero de co- 
lumnas — 1). El valor P es el area a la derecha de x” bajo la curva 
de densidad de ji cuadrado correspondiente. (pag. 708) 


chi-square test for independence Suppose the Random, 10%, 
and Large Counts conditions are met. You can use the chi-square 
test for independence to test 


Hy: There is no association between two categorical variables in 
the population of interest. 


H,: There is an association between two categorical variables in 
the population of interest. 


Or, alternatively, 


Hy: Two categorical variables are independent in the population 
of interest 


H,: Two categorical variables are not independent in the popu- 
lation of interest. 


Start by finding the expected counts. Then calculate the chi- 
square statistic 


(Observed — Expected)? 
Expected 


v=> 


where the sum is over all cells in the two-way table. If Hp is true, 
the x’ statistic has approximately a chi-square distribution with de- 
grees of freedom = (number of rows — 1)(number of columns — 1). 


The P-value is the area to the right of y? under the corresponding 
chi-square density curve. (p. 697) 


prueba de ji cuadrado de independencia Supongamos que se 
han cumplido las condiciones de aleatorio, del 10% y de cuentas 
grandes. Se puede usar la prueba de ji cuadrado de independen- 
cia para verificar que 

Ho: No hay ninguna asociacién entre dos variables categorizadas 
en la poblacién de interés. 


H,: Si hay una asociacién entre dos variables categorizadas en la 
a Vv g 
poblacion de interés. 


O alternativamente, 


Hp: Dos variables categorizadas son independientes en la po- 
blacion de interés. 


H,: Dos variables categorizadas no son independientes en la 
d g Pp 
poblacion de interés. 


Se comienza encontrando las cuentas previstas. Luego se com- 
P $' 
puta la estadistica de ji cuadrado 
. 2 
(Observadas — Previstas)* 


Previstas 


x=> 


en la que la suma esta sobre todas las celdas en la tabla de doble 
via. Si Ho es verdad, la estadistica y” tiene una distribucién de 
aproximadamente ji cuadrado con grados de libertad = (ntimero 
de renglones — 1)(ntimero de columnas — 1). El valor P es el area 
a la derecha de x” bajo la curva de densidad de ji cuadrado cor- 
respondiente. (pag. 697) 


cluster sample Sample obtained by classifying the popula- 
tion into groups of individuals that are located near each other, 
called clusters, and then choosing an SRS of the clusters. All 
individuals in the chosen clusters are included in the sample. 


(p. 221) 


muestra de cluster Muestra que se obtiene clasificando la po- 
blacién en grupos de individuos que estan ubicados uno cerca del 
otro, Hamados clusters, y luego escogiendo una muestra aleatoria 
sencilla de los clusters. Todos los individuos en los clusters escogi- 
dos se incluyen en la muestra. (pag. 221) 


coefficient of determination 7” Fraction of the variation in the 
values of y that is accounted for by the least-squares regression 
line of y on x. We can calculate r using the formula 


¥ residuals? 
>, (yi a yy 


P=] 


(p. 179) 
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coeficiente de determinacién r’ La fraccién de la variacién en 
los valores y que se tiene en cuenta por la linea de regresién de 
minimos cuadrados de y sobre x. Se puede computar 7” utilizando 
la formula 


¥ residuales* 
oy (yi = yy 


(pag. 179) 


comparison Experimental design principle. Use a design that 
compares two or more treatments. (p. 240) 


comparacion Principio de disefio experimental. Se usa un dis- 
efio que compara dos o mas tratamientos. (pag. 240) 


complement of an event AC Event “not A”. (p. 307) 


complemento de un evento A© Se refiere al evento “que no es 
A”. (pag. 307) 


complement rule The probability that an event does not occur 
is 1 minus the probability that the event does occur. In symbols, 
P(A) = 1- P(A). (pp. 308, 314) 


regla del complemento La probabilidad de que no suceda un 
evento es 1 menos la probabilidad de que el evento sf suceda. 
En representacién simbdlica, P(A°) = 1 - P(A). (pags. 308, 314) 


completely randomized design Design in which the experimen- 
tal units are assigned to the treatments completely by chance. 


(p. 245) 


disefio completamente aleatorizado Cuando las unidades ex- 
perimentales se les asignan a los tratamientos de manera comple- 
tamente al azar. (pag. 245) 


(Observed — Expected)? 


Expected 
added together to produce the test statistic x7. (p. 690) 


components Individual terms that are 


(Observades — Previstas)* 


Previstas 


componentes Los términos individuales 


que se suman para producir la estadistica de prueba 7. (pag. 690) 


conditional distribution Term that describes the values of one 
variable among individuals who have a specific value of another 
variable. There is a separate conditional distribution for each 
value of the other variable. (p. 15) 


distribuci6n condicional Describe los valores de una variable 
entre individuos que tienen un valor especifico de otra variable. 
Hay una distribucién condicional separada para cada valor de la 
otra variable. (pag. 15) 


conditional probability Probability that one event happens giv- 
en that another event is already known to have happened. Sup- 
pose we know that event A has happened. Then the probability 
that event B happens given that event A has happened is denoted 
by P(B | A). To find the conditional probability P(B | A), use the 


formula 


P(AMB) 


(p. 320) 


probabilidad condicional La probabilidad de que un evento 
suceda a la luz de que se sabe que otro evento ya sucedi6. Supon- 
gamos que nos consta que el evento A ya sucedié. Entonces la 
probabilidad de que el evento B suceda en vista de que el evento 
A ya sucedié, se denota con P(B | A). Para hallar la probabilidad 
condicional P(B | A), se usa la formula 


P(AMB) 


(pag. 320) 


conditions for regression inference Suppose we have n obser- 
vations on an explanatory variable x and a response variable y. 
Our goal is to study or predict the behavior of y for given values 
of x. 


e Linear: The actual relationship between x and y is linear. For 
any fixed value of x, the mean response y falls on the popula- 
tion (true) regression line jz, = a + fx. 

¢ Independent: Individual observations are independent. 
When sampling is done without replacement, check the 10% 
condition. 


¢ Normal: For any fixed value of x, the response y varies accord- 
ing to a Normal distribution. 


condiciones para la inferencia de regresi6n Supongamos que 
tenemos n observaciones en una variable explicativa x y una vari- 
able de respuesta y. Nuestra meta consiste en estudiar o predecir 
el comportamiento de y ante los valores dados de x. 


¢ Lineal (linear): La relacién real entre x y y es lineal. Para todo 
valor fijo de x, la respuesta media y cae en la linea de regresion 
de la poblacién (verdadera) py, = a + Gx. 

e Independiente (independent): Las observaciones individuales 
son independientes. Cuando el muestreo se hace sin reem- 
plazo, se verifica la condicién del 10%. 

¢ Normal (normal): Para cualquier valor fijo de x, la respuesta y 
varia segtin una distribuci6én normal. 
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e Equal SD: The standard deviation of y (call it 7) is the same 
for all values of x. 


¢ Random: The data are produced from a well-designed random 
sample or randomized experiment. (p. 743) 


¢ Desviaci6n estandar equivalente: La desviacion esténdar de y 
es la misma para todos los valores de x. 


e Aleatorio (random): Los datos son producidos a partir de una 
muestra aleatoria 0 un experimento aleatorio, ambos bien 


disefiados. (pag. 743) 


confidence interval Gives an interval of plausible values for a pa- 
rameter. The interval is calculated from the data and has the form 


point estimate + margin of error 
or, alternatively, 
statistic + (critical value) - (standard deviation of statistic) 


(p. 480) 


intervalo de confianza Ofrece un intervalo de valores plausibles 
para un pardmetro. El intervalo se computa a partir de los datos 
y tiene la forma 


Estimado de punto + margen de error 
o alternativamente, 
estadistica + (valor critico) - (desviacion estandar de la estadistica) 


(pag. +80) 


confidence level C Success rate of the method for calculating 
the confidence interval. In C% of all possible samples, the meth- 
od would yield an interval that captures the true parameter value. 


(p. 480) 


nivel de confianza C La tasa de aciertos del método con el que 
se computa el intervalo de confianza. En el C% de todas las 
muestras posibles, el método produciria un intervalo que capta el 
valor verdadero del pardmetro. (pag. +80) 


confidential A basic principle of data ethics that requires that 
individual data be kept private. (p. 270) 


confidencial Principio bdsico de la ética de la gestion de datos. 
Requiere que los datos individuales se mantengan en reserva. 
(pag. 270) 


confounding When two variables are associated in such a way 
that their effects on a response variable cannot be distinguished 
from each other. (p. 236) 


confuso Cuando dos variables se asocian de tal manera que sus 
efectos en una variable de respuesta no se pueden distinguir el 
uno del otro. (pag. 236) 


continuous random variable Variable that takes all values in an 


interval of numbers. The probability distribution of a continuous 
random variable is described by a density curve. The probability 
of any event is the area under the density curve and above the 
values of the variable that make up the event. (p. 356) 


variable aleatoria continua Emplea todos los valores en un in- 
tervalo de cifras. La distribucién de la probabilidad de una vari- 
able aleatoria continua se describe con una curva de densidad. La 
probabilidad de cualquier evento en el drea debajo de la curva de 
densidad y encima de los valores de la variable que componen el 
evento. (pag. 356) 


control Experimental design principle that mandates keeping 
other variables that might affect the response the same for all 
groups. (p. 242) 


control Principio del disefio experimental. Se mantienen otras 
variables que podrian afectar la respuesta iguales para todos los 
grupos. (pag. 242) 


control group Experimental group whose primary purpose is to 
provide a baseline for comparing the effects of the other treat- 
ments. Depending on the purpose of the experiment, a control 
group may be given a placebo or an active treatment. (p. 246) 


grupo de control Grupo experimental cuyo fin primario es esta- 
blecer una linea base mediante la cual se comparan los efectos de 
otros tratamientos. Segtin el objeto del experimento, a un grupo 
de control se le puede dar un placebo o un tratamiento activo. 
(pag. 246) 


convenience sample Sample selected by taking from the popula- 
tion individuals that are easy to reach. (p. 212) 


muestra de conveniencia Muestra escogida de individuos de la 
poblacién con quienes es facil hacer contacto. (pag. 212) 


correlation Measures the direction and strength of the linear 
relationship between two quantitative variables. Correlation 
is usually written as r. We can calculate r using the formula 


a (3 — =) - *), (pp. 150, 154) 


n-1 Ss Sy 


correlacién Mide el sentido y la fuerza de la relacion lin- 
eal entre dos variables cuantitativas. La correlacién general- 
mente se denomina con una r. Calculamos la r con la formula 


| xj —x\(Vi-¥ 3 
r 5 " ( a ) (ngs. 150,154 


critical value Multiplier that makes the interval wide enough 
to have the stated capture rate. The critical value depends on 
both the confidence level C and the sampling distribution of the 
statistic. (pp. +86, +97) 


valor critico Multiplicador que amplia el intervalo lo suficiente 
para retener la tasa de captacién indicada. El valor critico de- 
pende de tanto el nivel de confianza C como de la distribucién 
de muestreo de la estadistica. (pags. +86, +97) 


cumulative relative frequency graph Graph used to examine lo- 
cation within a distribution. Cumulative relative frequency graphs 
begin by grouping the observations into equal-width classes. ‘The 
completed graph shows the accumulating percent of observations 
as you move through the classes in increasing order. (p. 87) 


data analysis Process of describing data using graphs and nu- 
merical summaries. (p. 2) 


density curve Curve that (a) is always on or above the horizon- 
tal axis and (b) has area exactly | underneath it. A density curve 
describes the overall pattern of a distribution. The area under the 
curve and above any interval of values on the horizontal axis is 
the proportion of all observations that fall in that interval. (p. 105) 
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grafico de la frecuencia relativa acumulada Se usa para exami- 
nar la ubicacién dentro de una distribucién. Los grdaficos de la 
frecuencia relativa acumulada se inician agrupando las observa- 
ciones en clases del mismo ancho. El grafico completado muestra 
el porcentaje de observaciones que se van acumulando a medida 
que se desplaza por las clases en orden ascendiente. (pag. 87) 


andlisis de los datos Proceso que describe los datos haciendo uso 
de graficos y restimenes numéricos. (pag. 2) 


curva de densidad Curva que (a) siempre estd sobre 0 por en- 
cima del eje horizontal y (b) tiene 1 area exactamente debajo. La 
curva de densidad describe el patrén general de una distribucién. 
El area debajo de la curva y encima de todo intervalo de valores 
en el eje horizontal es la proporcién de todas las observaciones 
que caen en dicho intervalo. (pag. 105) 


describing a distribution In any graph of data, look for the over- 
all pattern and for striking departures from that pattern. Shape, 
center, and spread describe the overall pattern of the distribution 
of a quantitative variable. (p. 26) 


descripcién de una distribucién En un grafico de datos, se ob- 
serva cual es el patrén general y se busca también valores atipicos 
que no se ajusten al patron. Forma, centro y amplitud describen 
el patron general de la distribucién de una variable cuantitativa. 


(pag. 26) 


describing a scatterplot In any graph of data, look for the overall 
pattern and for striking departures from that pattern. Direction, 
form, and strength describe the overall pattern of a scatterplot. 


(p. 147) 


descripcion de un grafico de dispersi6n En todo grafico de da- 
tos, se observa cual es el patron general y se busca también valores 
atipicos que no se ajusten al patrén. Direccién, forma y fuerza 
describen el patrén general de la distribucién de un grafico de 
dispersion. (pag. 147) 


discrete random variable ‘Takes a fixed set of possible values with 
gaps between. The probability distribution of a discrete random 
variable gives its possible values and their probabilities. ‘The prob- 
ability of any event is the sum of the probabilities for the values of 
the variable that make up the event. (p. 348) 


distribution Tells what values a variable takes and how often it 
takes these values. (p. 4) 


variable aleatoria discreta Emplea un conjunto fijo de valores 
posibles entre los cuales hay brechas. La distribucién de la proba- 
bilidad de una variable aleatoria discreta arroja valores posibles 
y sus probabilidades. La probabilidad de cualquier evento es la 
suma de las probabilidades de los valores de la variable que com- 
pone el evento. (pag. 348) 


distribuci6n Indica qué valores adopta una variable y con qué 
frecuencia adopta dichos valores. (pag. 4) 


distribution of sample data Gives the values of the variable for 
all the individuals in the sample. (p. 428) 


distribucién de los datos de la muestra Indica los valores de la 
variable que les corresponden a todos los individuos en la mues- 
tra. (pag. 428) 


dotplot Simple graph that shows each data value as a dot above 
its location on a number line. (p. 25) 


grafico de puntos Un grafico sencillo que muestra el valor de 
cada dato encima de su ubicacién a lo largo de una linea de ci- 
fras. (pag. 25) 


double-blind An experiment in which neither the subjects nor 
those who interact with them and measure the response variable 
know which treatment a subject received. (p. 248) 


event Any collection of outcomes from some chance process. An 
event is a subset of the sample space. Events are usually desig- 
nated by capital letters, like A, B, C, and so on. (p. 306) 


doble ciego Experimento en el que ninguno de los sujetos ni 
aquellos que interacttian con los sujetos y que miden la variable 
de repuesta saben qué tratamiento recibié el sujeto. (pag. 248) 


evento Cualquier coleccién de los resultados de un proceso de 
probabilidad. Es decir, un evento es un subconjunto del espacio 
de muestras. Los eventos generalmente se designan con maytiscu- 
las tales como A, B, C, y asf sucesivamente. (pdg. 306) 
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expected counts Expected numbers of individuals in the sample 
that would fall in each cell of the one-way or two-way table if Ho 
were true. (p. 681) 


cuentas previstas Las cantidades previstas de individuos en la 
muestra que caerfan en cada celda en la tabla, sea de una via o de 
dos vias, si Ho fuera verdad. (pag. 681) 


experiment A study in which researchers deliberately impose 
treatments on individuals to measure their responses. (p. 235) 


experimento Estudio en el que los investigadores deliberadamente 
les imponen tratamientos a individuos, con el fin de medir sus 
respuestas. (pag. 235) 


experimental units Smallest collection of individuals to which 
treatments are applied. (p. 237) 


unidades experimentales La colecci6n mas pequefia de indi- 
viduos a quienes se les aplican los tratamientos. (pag. 237) 


explanatory variable Variable that may help explain or predict 
changes in a response variable. (pp. 143, 236) 


variable explicativa Variable que puede ayudar a explicar o pre- 
decir cambios en una variable de respuesta. (pags. 143, 236) 


exponential model Relationship of the form y = ab”. If the rela- 
tionship between two variables follows an exponential model and 
we plot the logarithm (base 10 or base e) of y against x, we should 
observe a straight-line pattern in the transformed data. (p. 775) 


modelo exponencial Relacién de la forma y = ab”. Sila relacién 
entre dos variables se ajusta a un modelo exponencial, y trazamos 
el logaritmo (de base 10 0 de base e) de y con respecto a x, se 
debe observar un patrén en linea recta en los datos transformados. 
(pag. 775) 


extrapolation Use of a regression line for prediction far outside 
the interval of values of the explanatory variable x used to obtain 
the line. Such predictions are often not accurate. (p. 168) 


factorial For any positive whole number n, its factorial n! is 
n! =n-(n-1)-(n-2):...- 3-2-1 In addition, we define 
0! = 1. (p. 409) 


extrapolacién Uso de una linea de regresi6n para hacer predic- 
ciones muy por fuera del intervalo de valores de la variable expli- 
cativa x que se utiliza para obtener la Ifnea. Tales predicciones a 
menudo carecen precision. (pag. 168) 


factorial Para cualquier ntimero entero positivo n, su factorial 
niesn! =n-(n—-1)-(n—-2)-...- 3-2-1 Ademas, definimos 
0! = 1. (pag. 409) 


factors Explanatory variables in an experiment. (p. 238) 


factores Las variables explicativas en un experimento. (pag. 238) 


fail to reject Ho If the observed result is not very unlikely to oc- 
cur when the null hypothesis is true, we should fail to reject Ho 
and say that we do not have convincing evidence for H,. (p. 544) 


first quartile Q; If the observations in a data set are ordered from 
lowest to highest, the first quartile Q; is the median of the observa- 
tions whose position is to the left of the median. (p. 54) 


no rechazar Hy Si no es muy improbable que el resultado ob- 
servado suceda cuando es verdad la hipotesis nula, no se debe 
rechazar Hp y se ha de indicar que no contamos con evidencia 
convincente de H,. (pag. 544) 


primer cuartil Q; Si las observaciones del conjunto de datos 
se organizan en orden ascendiente (del mas bajo al més alto), el 
primer cuartil Q) es la media de las observaciones cuya posicién 
se encuentra a la izquierda de la media. (pag. 54) 


five-number summary Smallest observation, first quartile, me- 
dian, third quartile, and largest observation, written in order from 
smallest to largest. In symbols: 


Minimum Q; Median Q; Maximum 
(p. 57) 


resumen de cinco cifras Consta de la observacién mas pequefia, 
el primer cuartil, la media, el tercer cuartil y la observaci6n més 
grande, enumeradas en orden ascendiente, desde la mas pequefia 
hasta la mds grande. Representado en forma simbélica, el re- 
sumen de cinco cifras es 


Minimo Q; Media Q; Maximo 
(pag. 57) 


frequency table ‘Table that displays the count (frequency) of 
observations in each category or class. (p. 8) 


general addition rule If A and B are any two events resulting 
from some chance process, then the probability that event A or 
event B (or both) occur is P(A or B) = P(A UB) = P(A) + P(B) - 
P(AMB). (p. 310) 


tabla de frecuencias Muestra la cuenta (frecuencia) de observa- 
ciones en cada categoria o clase. (pag. 8) 


regla general de adicién Si A y B son dos eventos cualquiera 
que resulten de algtin proceso de probabilidad, la probabilidad de 
que el evento A o el evento B (o ambos) suceda es P(A 0 B) = 
P(A UB) = P(A) + P(B) — P(ANM B). (pag. 310) 


general multiplication rule ‘The probability that events A and 
B both occur can be found using the formula P(A N B) = P(A) - 
P(B | A) where P(B | A) is the conditional probability that event B 
occurs given that event A has already occurred. (p. 321) 


Glossary/Glosario G-9 


regla general de multiplicacién La probabilidad de que suce- 
dan los eventos A y B se puede determinar utilizando la formula 
P(A M B) = P(A)- P(B | A) en la que P(B | A) es la probabilidad 
condicional de que suceda el evento B a la luz de que el evento 
A ya sucedié. (pag. 321) 


geometric distribution In a geometric setting, suppose we let 
Y = the number of trials it takes to get a success. The probability 
distribution of Y is a geometric distribution with parameter p, the 
probability of a success on any trial. The possible values of Y are 
1, 2, 3, .... (p. 405) 


distribuci6n geométrica En un entorno geométrico, suponga- 
mos que se permite que Y = la cantidad de ensayos que se pre- 
cisan para lograr un acierto. La distribucién de la probabilidad de 
Y es una distribucién geométrica con el parémetro p, la probabi- 
lidad de lograr n acierto en cualquier ensayo. Los valores posibles 
de Y son 1, 2, 3, .... (pag. 405) 


geometric probability formula If Y has the geometric dis- 
tribution with probability p of success on each trial, the pos- 
sible values of Y are 1, 2, 3, .... If k is any one of these values, 


P(Y =k) = (1— pp. (p. 406) 


férmula de probabilidad geométrica Si Y tiene una distribucién 
geometria con la probabilidad p de acierto en cada ensayo, los po- 
sibles valores de Y son 1, 2, 3, .... Sik es uno cualquiera de estos 


valores, P(Y = k) = (1 — p)*~'p. (pag. 406) 


geometric random variable ‘The number of trials Y that it takes 
to get a success in a geometric setting. (p. +05) 


variable aleatoria geométrica La cantidad de ensayos Y que 
se precisan para lograr un acierto en un entorno geométrico. 


(pag. 405) 


geometric setting Arises when we perform independent trials of 
the same chance process and record the number of trials it takes 
to get one success. On each trial, the probability p of success must 
be the same. (p. 404) 


histogram Graph that displays the distribution of a quantita- 
tive variable. The horizontal axis is marked in the units of mea- 
surement for the variable. The vertical axis contains the scale of 
counts or percents. Each bar in the graph represents an equal- 
width class. The base of the bar covers the class, and the bar 
height is the class frequency or relative frequency. (p. 33) 


independent events ‘Iwo events are independent if the occur- 
rence of one event does not change the probability that the other 
event will happen. In other words, events A and B are indepen- 


dent if P(A | B) = P(A) and P(B | A) = P(B). (p. 327) 


entorno geométrico Surge un entorno geométrico cuando se 
realizan ensayos independientes del mismo proceso de probabili- 
dad y se graban la cantidad de ensayos que se precisan para lograr 
un acierto. En cada ensayo, la probabilidad p de lograr un acierto 
tiene que ser la misma. (pag. +04) 


histograma Muestra la distribucién de una variable cuantitativa. 
En el eje horizontal se denotan las unidades de medicién de la 
variable. El eje vertical contiene la escala de cuentas 0 porcentajes. 
Cada barra del grafico representa una clase de ancho equivalente. 
La base de la barra abarca la clase, y la altura de la barra es la fre- 
cuencia o la frecuencia relativa de la clase. (pag. 33) 


eventos independientes Dos eventos son independientes si 
el hecho de que uno suceda no cambia la probabilidad de que 
el otro suceda. E's decir, los eventos A y B son independientes si 


P(A | B) = P(A) y P(B| A) = P(B). (pag. 327) 


independent random variables If knowing whether any event 
involving X alone has occurred tells us nothing about the occur- 
rence of any event involving Y alone and vice versa, then X and Y 
are independent random variables. (p. 371) 


variables aleatorias independientes Saber que ha sucedido un 
evento que implique el valor X solo, no nos indica nada respecto 
al hecho de que suceda un evento que implique el valor Y solo, 
y viceversa. Ein tal caso, tanto X como Y son variables aleatorias 
independientes. (pag. 371) 


individuals Objects described by a set of data. Individuals may 
be people, animals, or things. (p. 2) 


individuos Objetos descritos por un conjunto de datos. Los indi- 
viduos pueden ser personas, animales o cosas. (pag. 2) 


inference Drawing conclusions that go beyond the data at hand. 


(pp. 5, 223) 


inferencia Llegar a conclusiones que van mas alla de los datos 
que estén a la mano. (pags. 5, 223) 


inference about cause and effect Conclusion from the results 
of an experiment that the treatments caused the difference in re- 
sponses. Requires a well-designed experiment in which the treat- 
ments are randomly assigned to the experimental units. (p. 266) 


inferencia sobre causa y efecto Uso de los resultados de un ex- 
perimento para llegar a la conclusion de que son los tratamientos 
los que marcan la diferencia en las respuestas. Exige un experi- 
mento bien disefiado en el que los tratamientos se asignan de 
manera aleatoria a las unidades experimentales. (pag. 266) 
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inference about a population Conclusion about the larger 
population based on sample data. Requires that the individuals 
taking part in a study be randomly selected from the population 
of interest. (p. 266) 


inferencia sobre una poblacién Conclusién sobre una _po- 
blacion en general con base en datos muestrales. Se precisa que 
los participantes del estudio sean escogidos de manera aleatoria a 
partir de la poblacién de interés. (pag. 266) 


influential observation An observation is influential for a statisti- 
cal calculation if removing it would markedly change the result 
of the calculation. Points that are outliers in the x direction of 
a scatterplot are often influential for the least-squares regression 


line. (p. 189) 


observacion influyente La observacién es influyente en un cém- 
puto estadistico si al retirarla se notarfa un cambio sustancial en el 
resultado del cémputo. Los puntos que son valores atipicos en el 
sentido x en un grafico de dispersién, a menudo son influyentes en 
al menos una linea de regresi6n de minimos cuadrados. (pag. 189) 


informed consent Basic principle of data ethics that states that 
individuals must be informed in advance about the nature of a 
study and any risk of harm it may bring. Participating individuals 
must then consent in writing. (p. 270) 


autorizaci6n informada Principio basico de ética en la gestion 
de los datos. A los individuos se les ha de informar con antelacién 
acerca de la naturaleza de un estudio y de los riesgos 0 perjuicios 
que podria conllevar. Los individuos que participen luego ten- 
dran que dar su autorizacion por escrito. (pag. 270) 


institutional review board Board charged with protecting the 
safety and well-being of the participants in advance of a planned 
study and with monitoring the study itself. (p. 270) 


junta de revisién institucional Principio bdsico de la ética de 
la gestién de datos. ‘Todos los estudios planificados tienen que 
contar con aprobacion anticipada y tienen que contar con un 
monitoreo por una junta de revision institucional cuya funcién 
consiste en salvaguardar la seguridad y el bienestar de los partici- 
pantes. (pag. 270) 


interquartile range [OR = Q; — Q). (p. 54) 


gama entre cuartiles IQR = Q; — Q). (pag. 54) 


intersection The intersection of events A and B, denoted by 
ACB, refers to the occurrence of both of two events at the same 
time. (p. 311) 


lack of realism When the treatments, the subjects, or the envi- 
ronment of an experiment are not realistic. Lack of realism can 
limit researchers’ ability to apply the conclusions of an experi- 
ment to the settings of greatest interest. (p. 268) 


intersecci6n E] punto de cruce de los eventos A y B, designado 
con A1) B, se refiere a la situacién en la que ambos eventos su- 
ceden simulténeamente. (pag. 311) 


falta de realismo Cuando los tratamientos, los sujetos o el en- 
torno de un experimento no son realistas. La carencia de realismo 
puede limitar la capacidad de los investigadores de aplicar las 
conclusiones de un experimento a los entornos de gran interés. 


(pag. 268) 


Large Counts condition It is safe to use Normal approximation 
for performing inference about a proportion p if np = 10 and 


n(1 - p) = 10. (p. 403) 


condicién de cuentas grandes Se puede utilizar sin problemas 
la aproximacion normal para realizar la inferencia de una propor- 


cién p sinp = 10 yn(1 — p) = 10. (pag. 403) 


Large Counts condition for a chi-square test It is safe to use 
a chi-square distribution to perform calculations if all expected 
counts are at least 5. (p. 687) 


condicién de cuentas grandes en la prueba de ji cuadrado 
Se puede utilizar sin problemas la distribucién de ji cuadrado 
para realizar cOmputos si todas las cuentas previstas son de al 
menos 5. (pag. 687) 


law of large numbers If we observe more and more repetitions 
of any chance process, the proportion of times that a specific out- 
come occurs approaches a single value, which we call the prob- 
ability of that outcome. (p. 291) 


ley de las cifras grandes Si se observan mas y mds repeticiones en 
cualquier proceso de probabilidad, la proporcidn de veces que se 
da un resultado especifico se aproxima a un valor sencillo, al cual 
se le denomina la probabilidad de dicho resultado. (pag. 291) 


least-squares regression line ‘The line that makes the sum of the 
squared vertical distances of the data points from the line as small 
as possible. (p. 169) 


linea de regresi6n de minimos cuadrados La linea que reduce al 
minimo posible la suma de las distancias verticales cuadraticas de 
los puntos de datos a partir de la linea. (pag. 169) 


level Specific value of an explanatory variable (factor) in an 
experiment. 


nivel Valor especffico de una variable explicativa (factor) en un 
experimento. (pag. 238) 


linear transformation A transformation of a random variable 
that involves adding a constant a, multiplying by a constant b, or 
both. We can write a linear transformation of the random variable 
X in the form Y = a + bX. The shape, center, and spread of the 
probability distribution of Y are as follows: 

Shape: Same as the probability distribution of X unless b is 
negative. 

Center: pay = a + bux 

Spread: oy = |blox 

(p. 368) 


margin of error The difference between the point estimate and 
the true parameter value will be less than the margin of error in 
C% of all samples, where C is the confidence level. (p. +80) 
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transformaci6n lineal La transformacién de una variable aleato- 
ria que implica agregar una a constante, multiplicada por una b 
constante, o ambas. La transformaci6n lineal de la variable alea- 
toria X se puede escribir en la forma Y = a + bX. La forma, el 
centro y la amplitud de la distribucién de la probabilidad de Y 
son como sigue: 

Forma: Igual que la distribucion de la probabilidad de X a 
menos que b sea negativo. 

Centro: py = a + byx 

Amplitud: oy = |blox 

(pag. 368) 


margen de error La diferencia entre el estimado del punto y el 
valor real del pardmetro sera menor que el margen de error en 
C% de todas las muestras, en el que C es el nivel de confianza. 


(pag. 480) 


marginal distribution The distribution of one of the categori- 
cal variables in a two-way table of counts among all individuals 
described by the table. (p. 12) 


distribucién marginal La distribucién de una de las variables 
categorizadas en una tabla de doble via de cuentas entre todos los 
individuos descritos por la tabla. (pag. 12) 


matched pairs design Common form of blocking for comparing 
just two treatments. In some matched pairs designs, each subject 
receives both treatments in a random order. In others, the sub- 
jects are matched in pairs as closely as possible, and each subject 
in a pair is randomly assigned to receive one of the treatments. 


(p. 255) 


disefio de pares coincidentes Forma comun de crear bloques 
para efectos de comparacién de tan solo dos tratamientos. En 
algunos disefios de pares coincidentes, cada tema se somete a 
ambos tratamientos en un orden aleatorio. En otros, los temas se 
ponen en pares que coincidan lo més posible y cada tema en un 
par se asigna de manera aleatoria a fin de que reciba uno de los 
tratamientos. (pag. 255) 


mean x Arithmetic average. To find the mean of a set of observa- 
tions, add their values and divide by the number of observations. 
Dx; 


In symbols, ¥ = (p. 49) 


1 


media x E] promedio aritmético. Para hallar la media de un con- 
junto de observaciones, se suman todos los valores y se divide en- 


x 
(pag. +9) 


tre el ntimero de observaciones. En sfmbolos, ¥ = —— 
n 


mean of a density curve Point at which a density curve would 
balance if made of solid material. (p. 107) 


media de una curva de densidad FE] punto en el cual la cur- 
va se equilibraria si estuviera elaborada de un material macizo. 


(pag. 107) 


mean (expected value) of a discrete random variable To find 
the mean (expected value) of X, multiply each possible value by 
its probability, then add all the products: 


(p. 351) 


media (valor previsto) de una variable aleatoria discreta Para 
hallar la media (un valor previsto) de X, se multiplica cada valor 
posible por su probabilidad y luego se suman todos los productos: 


by = E(X) = xp) + x2p2 + x33 + «.. 
(pag. 351) 


= xpi 


mean (expected value) of a geometric random variable If Y is a 
geometric random variable with probability of success p on each 


l 
trial, then its mean (expected value) is fy = E(Y) = rs That is, 


the expected number of trials required to get the first success is 


Lip. (p. 408) 


media (valor previsto) de una variable aleatoria geométrica Si 
Y es una variable aleatoria geométrica con probabilidad de aci- 
erto p en cada ensayo, entonces su media (un valor previsto) es 


1 
py = E(Y) = rs Es decir, la cifra de ensayos prevista para lograr el 


primer acierto es l/p. (pag. +08) 


mean and standard deviation of a binomial random variable If 
a count X of successes has the binomial distribution with number 
of trials n and probability of success p, the mean and standard 


deviation of X are fx = np and ox = Vnp(l — p). (p. 398) 


media y desviacién esténdar de una variable binomial alea- 
toria Si una cuenta X de aciertos tiene una distribucién bino- 
mial con la cantidad de ensayos n y la probabilidad de acier- 
tos p, la media y la desviacién esténdar de X son jux = np and 


ox = Vnp(1 — p) (pag. 398) 
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mean and standard deviation of the sampling distribution of a 
sample mean x Suppose that x is the mean of an SRS of size n 
from a large population with mean jz and standard deviation o. 
Then 


e The mean of the sampling distribution of x is jug = pu. 


e The standard deviation of the sampling distribution of x is 


l 
Ox = a long as the 10% condition is satisfied: n = To. 
n 


(p. 452) 


media y desviaci6n estandar de la distribucién de muestreo 
de la media de una muestra Supongamos que x es la me- 
dia de una muestra aleatoria sencilla de tamafio n a partir de 


una poblacién grande con media y una desviacién estandar o. 
Asi 


e La media de la distribucién de muestreo de x es py = [U. 


e La desviacion estandar de la distribuci6n de muestreo de x es 


og, ee 
Oz = —= siempre y cuando se cumpla con la condicién del 10%: 


Vn 


ns aN. (pag. 452) 


median ‘The midpoint of a distribution; the number such that 
about half the observations are smaller and about half are larger. 
‘To find the median of a distribution: (1) Arrange all observations 
in order of size, from smallest to largest. (2) If the number of ob- 
servations n is odd, the median is the center observation in the 
ordered list. (3) If the number of observations n is even, the me- 
dian is the average of the two center observations in the ordered 


list. (p. 51) 


median of a density curve ‘The point with half the area under 


the curve to its left and the remaining half of the area to its right. 
(p. 106) 


media E] punto intermedio de una distribuci6n, con una cifra 
tal que aproximadamente la mitad de las observaciones son mas 
pequefias y la mitad son mds grandes. Para hallar la media de una 
distribucién: (1) Se organizan todas las observaciones en orden de 
su tamafio, de las més pequefias a las mds grandes. (2) Si la can- 
tidad de observaciones n es impar, la media es el la observacion 
central en la lista organizada. (3) Si la cantidad de observaciones 
n es par, la media es el promedio de las dos observaciones cen- 
trales en la lista organizada. (pag. 51) 


media de una curva de densidad _E] punto en el que la mitad del 
drea que esta debajo de la curva esta a la izquierda y la otra mitad 
del rea esta a la derecha. (pag. 106) 


mode Value or class in a statistical distribution having the great- 
est frequency. (p. 26) 


modo En una distribucién estadistica, el valor 0 clase que tiene 
la mayor frecuencia. (pag. 26) 


multimodal A graph of quantitative data with more than two 
clear peaks. (p. 29) 


multimodal Grdfico de datos cuantitativos que tiene mas de dos 
picos claros. (pag. 29) 


multiple comparisons Problem of how to do many comparisons 
at once with an overall measure of confidence in all our conclu- 


sions. (p. 700) 


multiplication rule for independent events If A and B are inde- 
pendent events, then the probability that A and B both occur is 
P(A B) = P(A) - P(B). (p. 328) 


comparaciones mitiltiples EF] problema de c6mo hacer muchas 
comparaciones a la vez con una medida de confianza general en 
todas las conclusiones a las que se llega. (pag. 700) 


regla de multiplicacién de eventos independientes Si A y B son 
eventos independientes, la probabilidad de que sucedan ambos, 
tanto A como B es P(A MN B) = P(A) - P(B). (pag. 328) 


mutually exclusive (disjoint) Two events that have no outcomes 
in common and so can never occur together. (p. 307) 


negative association When above-average values of one variable 
tend to accompany below-average values of the other. (p. 148) 


exclusivos mutuamente (desencajamiento) Dos eventos que no 
tienen resultados en comin y por lo tanto nunca pueden suceder 
a la vez. (pag. 307) 


asociacién negativa Cuando los valores por encima del prome- 
dio de una variable tienden a acompaiiar a los valores por debajo 
del promedio de la otra. (pag. 148) 


nonresponse Occurs when an individual chosen for the sample 
can’t be contacted or refuses to participate. (p. 225) 


no respondié Sucede cuando a un individuo escogido para la 
muestra no se le puede contactar 0 el sujeto se niega a participar. 
(pag. 225) 


Normal approximation to a binomial distribution Suppose 
that a count X of successes has the binomial distribution with n 
trials and success probability p. When n is large, the distribution 
of X is approximately Normal with mean np and standard devia- 


tion Vnp(l — p). We use this approximation when np = 10 and 
n(1—p) = 10. (p. 403) 


aproximaci6n normal hacia una distribucién binomial Supon- 
gamos que una cuenta X de aciertos tiene la distribucién bino- 
mial con n ensayos y una probabilidad de acierto p. Cuando n 
es grande, la distribucién de X es aproximadamente normal con 


media np y desviacion estandar Vnp(l — p). Se hace uso de esta 
aproximacion cuando np = 10 and n(1 — p) = 10. (pag. 403) 


Normal curves Important class of density curves that are sym- 
metric, single-peaked, and bell-shaped. (p. 109) 
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curvas normales Clase importante de curvas de densidad que 
son simétricas, de un solo pico y con la forma de curva de cam- 
pana. (pag. 109) 


Normal distribution Distribution described by a Normal density 
curve. Any particular Normal distribution is completely specified 
by two numbers, its mean ju and standard deviation 7. The mean 
of a Normal distribution is at the center of the symmetric Normal 
curve. The standard deviation is the distance from the center to 
the change-of-curvature points on either side. We abbreviate the 


Normal distribution with mean jy and standard deviation o as 
N(w, 0). (p. 109) 


distribucién normal Segtin la describe una curva de densidad 
normal. Cualquier distribucién normal dada se especifica com- 
pletamente con dos cifras, su media ju y la desviacién estandar 
a. La media de una distribucién normal yace en el centro de la 
curva normal simétrica. La desviacion estandar es la distancia del 
centro a los puntos a ambos lados en los que cambia la curva. La 
distribucién normal con la media pz y la desviaci6n estandar o se 
abrevia N(j1, o). (pag. 109) 


Normal/Large Sample condition for inference about a mean 
The population has a Normal distribution or the sample size is 
large (n = 30). If the population distribution has unknown shape 
and n < 30, use a graph of the sample data to assess the Normal- 
ity of the population. Do not use t procedures if the graph shows 
strong skewness or outliers. (p. 515) 


condicién de muestra normal/grande para inferir sobre una 
media La poblaci6n tiene una distribucién normal o la muestra 
es de tamafio grande (n = 30). Si la distribucién de la poblacién 
tiene una forma desconocida y n < 30, se usa un grafico de los 
datos de la muestra para evaluar la normalidad de la poblaci6n. 
No se han de usar los procedimientos t si en el grafico se aprecian 
valores atfpicos o un sesgo marcado. (pag. 515) 


Normal probability plot Plot used to assess whether a data 
set follows a Normal distribution. ‘To make a Normal probabil- 
ity plot, (1) arrange the data values from smallest to largest and 
record the percentile of each observation, (2) use the standard 
Normal distribution to find the z-scores at these same percentiles, 
and (3) plot each observation x against the corresponding z. If the 
points on a Normal probability plot lie close to a straight line, the 
plot indicates that the data are approximately Normal. (p. 122) 


grafico de probabilidad normal Se usa para evaluar si un con- 
junto de datos se cifie a una distribuci6n normal. Para trazar un 
grafico de probabilidad normal, (1) se dispone de los valores de 
los datos del mas pequefio al mds grande y se anota el percentil 
de cada observaci6n, (2) se usa la distribucién normal estandar 
para hallar los puntos z en esos mismos percentiles, y (3) se traza 
cada observacion x con la z correspondiente. Si los puntos en un 
grafico de probabilidad normal yacen cerca de una linea recta, 
el grafico indica que los datos son aproximadamente normales. 


(pag. 122) 


null hypothesis Hp Claim we weight evidence against in a 
significance test. Often the null hypothesis is a statement of “no 
difference.” (p. 540) 


observational study Study that observes individuals and mea- 
sures variables of interest but does not attempt to influence the 
responses. (p. 235) 


observed counts Actual numbers of individuals in the sample 
that fall in each cell of the one-way or two-way table. (p. 681) 


hip6tesis nula Hp Contrapeso de la evidencia en una prueba de 
significancia. A menudo la hipotesis nula es una declaracién de 
“no hay diferencia.” (pag. 540) 


estudio de observacién Se observan los individuos y se miden 
las variables de interés pero no se trata de influir en las respuestas. 


(pag. 235) 


cuentas observadas Las cifras reales que corresponden a individ- 
uos en la muestra que caen en cada celda de la tabla de una via o 
en la de dos vias. (pag. 681) 


one-sample ¢ interval for a mean When the Random, 10%, and 
Normal/Large Sample conditions are met, a C% confidence in- 
terval for ju is 


_ * Sy 
Cf 
n 


V/ 


where ¢* is the critical value for the t distribution with df = n — 1, 
with C% of the area between —t* and t*. (p. 518) 


intervalo t de una sola muestra para una media Cuando se 
cumplen las condiciones de aleatorio, del 10% y de cuentas nor- 
males/grandes, un intervalo de confianza C% para es 


Sy 
Vn 
en el que t* es el valor critico para la distribucién t con df = n — 1, 
con C% del area entre —t* y t*. (pag. 518) 


xt 
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one-sample t test for a mean Suppose that the Random, 10%, 
and Normal/Large Sample conditions are met. To test the hy- 
pothesis Ho: 44 = j49, compute the one-sample t statistic 


bs X= [lo 
Sy 
Vn 
Find the P-value by calculating the probability of getting a t statis- 


tic this large or larger in the direction specified by the alternative 
hypothesis H, in a t distribution with df =n — 1. (p. 580) 


intervalo t de una sola muestra para una media Supongamos que 
se han cumplido las condiciones de aleatorio, del 10% y de muestra 
normal/grande. Para probar la hipotesis Ho: 44 = fo, se computa la 
estadistica t de una sola muestra 


Sy 
Se halla el valor P computando la probabilidad de obtener una es- 
tadistica t de este tamafio o mas grande en el sentido especificado 


por la hipotesis alternativa en una distribuci6n t con df = n — 1. 


(pag. 580) 


one-sample z interval for a proportion When the Random, 
10%, and Large Counts conditions are met, a C% confidence in- 
terval for the unknown proportion p is 


. [PAD 
a aon 


where z”* is the critical value for the standard Normal curve with 
C% of its area between —z* and z*. (p. 498) 


intervalo z de una sola muestra para una proporcién Cuando 
se cumplen las condiciones de aleatorio, del 10% y de cuen- 
tas grandes, un intervalo de confianza C% para la proporcién 


desconocida p es 
s. ¢ (pal-Pp 
a 


en el que z* es el valor critico para la curva normal estandar con 
C% de su area entre — z* y z*. (pag. 498) 


one-sample z test for a proportion Suppose that the Random, 
10%, and Large Counts conditions are met. To test the hypothesis 
Ho: p = po, compute the < statistic 


Pp — Po 


| po(l — po) 


Find the P-value by calculating the probability of getting a z statis- 
tic this large or larger in the direction specified by the alternative 
hypothesis H,. (p. 559) 


one-sided alternative hypothesis An alternative hypothesis that 
states that a parameter is larger than the null hypothesis value 
or that states that the parameter is smaller than the null value. 


(p. 541) 


prueba z de una sola muestra para una proporcién Suponga- 
mos que se han cumplido las condiciones de aleatorio, del 10% 
y de muestras grandes. Para probar la hipétesis Ho: p = po, se 
computa la estadistica 


p — po 
pol — po) 


1 


Se halla el valor P computando la probabilidad de obtener una es- 
tadistica z de este tamafio o mas grande en el sentido especificado 
por la hipotesis alternativa H,. (pag. 559) 


hipétesis alternativa unilateral Hipotesis alternativa que indica 
que un pardmetro es més grande que el valor de la hipotesis nula 
o que indica que el pardametro es mas pequefio que el valor nulo. 
(pag. 541) 


one-way table ‘Table used to display the distribution of a single 
categorical variable. (p. 680) 


tabla de una via Se usa para mostrar la distribucién de una sola 
variable categorizada. (pag. 680) 


outlier Individual value that falls outside the overall pattern of a 
distribution. (p. 26) 


valor atipico Un valor individual que cae por fuera del patrén 
general de la distribucién. (pag. 26) 


outlier in regression Observation that lies outside the overall 
pattern of the other observations. Points that are outliers in the y 
direction but not the x direction of a scatterplot have large residu- 
als. Other outliers may not have large residuals. (p. 189) 


P-value The probability, computed assuming Hp is true, that the 
statistic would take a value as extreme as or more extreme than 
the one actually observed, in the direction specified by H,. The 
smaller the P-value, the stronger the evidence against Ho and in 
favor of H, provided by the data. (p. 543) 


valor atipico en regresi6n Observacién que yace por fuera del 
patron general de las otras observaciones. Los puntos que son va- 
lores atipicos en el sentido y pero no en el sentido x de un grafico de 
dispersion tienen residuales grandes. Es posible que otros valores 
atipicos no tengan residuales grandes. (pag. 189) 


valor P La probabilidad, computada suponiendo que Hp es ver- 
dad, de que la estadistica tomarfa un valor tan extremo o mas ex- 
tremo que el que de hecho se observa, en el sentido especificado 
por H,. Cuanto menor sea el valor P, mas fuerte seré la evidencia 
contra Ho y en favor de H, que proporcionan los datos. (pag. 543) 


paired data Study designs that involve making two observations 
on the same individual or one observation on each of two similar 


individuals result in paired data. (p. 586) 
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datos apareados Se estudian disefios que implican hacer dos 
observaciones del mismo individuo, 0 una observacion de cada 
uno de dos individuos parecidos, resultando en datos apareados. 


(pag. 586) 


paired t procedures When paired data result from measuring 
the same quantitative variable twice, we can make comparisons 
by analyzing the differences in each pair. If the conditions for 
inference are met, we can use one-sample ¢ procedures to per- 
form inference about the mean difference jug. These methods are 
sometimes called paired t procedures. (p. 586) 


procedimientos t apareados Cuando la misma variable cuanti- 
tativa se mide dos veces pueden resultar datos apareados, con los 
cuales se pueden hacer comparaciones que analizan las diferen- 
cias en cada par. Si se cumplen las condiciones para la inferencia, 
se pueden usar procedimientos t de una sola muestra para realizar 
una inferencia acerca de la diferencia media jug. A estos métodos 
a veces se les denomina procedimientos t apareados. (pag. 586) 


parameter A number that describes some characteristic of the 
population. (p. 424) 


pardmetro Ntimero que describe algunas de las caracteristicas de 
la poblacién. (pag. 424) 


percentile The pth percentile of a distribution is the value with 
p percent of the observations less than it. (p. 85) 


percentil El percentil pavo de una distribucién es el valor cuyo 
porcentaje de las observaciones es menor que la cifra. (pag. 85) 


pie chart Chart that shows the distribution of a categorical vari- 
able as a “pie” whose slices are sized by the counts or percents 
for the categories. A pie chart must include all the categories that 
make up a whole. (p. 8) 


grafico circular Muestra la distribucion de una variable categori- 
zada n la forma de un cfrculo subdividido segtin las cuentas o los 
porcentajes de las categorias. El grafico circular tiene que incluir 
todas las categorias que componen la totalidad. (pag. 8) 


placebo Inactive (fake) treatment. (p. 244) 


placebo ‘Tratamiento inactivo (falso). (pag. 244) 


placebo effect Describes the fact that some subjects respond fa- 
vorably to any treatment, even an inactive one (placebo). (p. 247) 


efecto placebo Describe el hecho de que algunos sujetos respon- 
den de manera favorable a cualquier tratamiento, incluso uno 
inactivo (con placebo). (pag. 247) 


point estimate Specific value of a point estimator from a sample. 


(p. 477) 


estimado de punto E] valor especifico de un estimador de punto 
tomado de una muestra. (pag. +77) 


point estimator Statistic that provides an estimate of a popula- 
tion parameter. (p. 477) 


estimador de punto Estadistica que nos da un estimado de un 
parametro de la poblacién. (pag. +77) 


pooled or combined sample proportion The overall proportion 
of successes in the two samples is 


count of successes in both samples combined —_X, + Xp 


pc 


count of individuals in both samples combined =n; + 12 


(p. 621) 


proporcién combinada de la muestra La proporcién total de aci- 
ertos en las dos muestras es 


cuenta de aciertos en 
~ _ ambas muestras combinadas _ X14 
Cc cuenta de individuos en ny 4 
ambas muestras combinadas 


(pag. 621) 


population In a statistical study, the entire group of individuals 
we want information about. (p. 210) 


poblacién En un estudio estadistico, la poblacién es el grupo 
completo de individuos sobre el cual deseamos contar con infor- 
macion. (pag. 210) 


population distribution Gives the values of the variable for all 
the individuals in the population. (p. 428) 


distribucién de la poblacién Presenta los valores de la variable 
para todos los individuos en la poblacion. (pag. 428) 


population regression line Regression line j1, = a + Gx based 
on the entire population of data. (p. 739) 


linea de regresion de la poblacién La linea de regresion pu, = a + Gx 


basada en la totalidad de la poblacion de datos. (pag. 739) 


positive association When above-average values of one variable 
to accompany above-average values of the other and also of be- 
low-average values to occur together. (p. 148) 


asociaci6n positiva Cuando los valores por encima del promedio 
de una variable tienden a acompaiiar a los valores por encima del 
promedio de la otra, y los valores por debajo del promedio tam- 
bién tienden a suceder juntos. (pég. 148) 


power The probability that a test will reject Hp at a chosen sig- 
nificance level a when a specified alternative value of the pa- 
rameter is true. The power of a test against any alternative is | 
minus the probability of a Type II error for that alternative; that is, 
power = | — £. (p. 565) 


poder La probabilidad de que una prueba rechace Hp en un niv- 
el de significancia dado cuando un valor alternativo especificado 
del pardmetro es verdad. El poder de una prueba con respecto a 
cualquier alternativa es 1 menos la probabilidad de un error ‘Tipo 
II para dicha alternativa; es decir, poder = 1 — (3. (pag. 565) 
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power model Relationship of the form y = ax’. When experi- 
ence or theory suggests that the relationship between two vari- 
ables is described by a power model, you can transform the data 
to achieve linearity in two ways: (1) raise the values of the explana- 
tory variable x to the p power and plot the points (x?, y), or (2) 
take the pth root of the values of the response variable y and plot 
the points (x, Vy). If you don’t know what power to use, taking 
the logarithms of both variables should produce a linear pattern. 


(p. 767) 


modelo de poder Relacidn de la forma y = ax’. Cuando la ex- 
periencia o una teorfa sugiere que la relacién entre dos variables 
la describe un modelo de poder, se pueden transformar los datos 
para que logren la linealidad de dos maneras: (1) elevar los va- 
lores de la variable explicativa x a la potencia p y trazar los puntos 
(x?, y), o (2) tomar la rafz p de los valores de la variable de respu- 
esta y y trazar los puntos (x, Wy). Si no se sabe qué potencia se ha 
de usar, se toman los logaritmos de ambas variables para producir 
un patron lineal. (pag. 767) 


predicted value y (read “y hat”) is the predicted value of the re- 
sponse variable y for a given value of the explanatory variable x. 


(p. 166) 


valor proyectado jy es el valor proyectado de la variable de respu- 
esta y para un valor dado de la viariable explicativa x. (pag. 166) 


probability A number between 0 and | that describes the propor- 
tion of times an outcome of a chance process would occur in a 
very long series of repetitions. (p. 291) 


probabilidad Cifra entre 0 y 1 que describe la proporcién de 
veces que un resultado de un proceso aleatorio sucederia en 
una serie muy prolongada de repeticiones. (pag. 291) 


probability distribution Gives the possible values of a random 
variable and their probabilities. (p. 348) 


distribucién de la probabilidad Presenta los valores posibles de 
una variable aleatoria y sus posibles probabilidades. (pag. 348) 


probability model Description of some chance process that 
consists of two parts: a sample space S and a probability for each 
outcome. (p. 305) 


quantitative variable Variable that takes numerical values for 
which it makes sense to find an average. (p. 3) 


random assignment Experimental design principle. Use chance 
to assign experimental units to treatments. Doing so helps create 
roughly equivalent groups of experimental units by balancing the 
effects of other variables among the treatment groups. (p. 241) 


modelo de probabilidad Descripcién de un proceso de proba- 
bilidad que consta de dos partes: un espacio de muestra s y una 
probabilidad para cada resultado. (pag. 305) 


variable cuantitativa ‘Toma valores numéricos para los cuales 
tiene sentido hallar un promedio. (pag. 3) 


asignacion aleatoria Principio de disefio experimental. Se usa 
el azar para asignar unidades experimentales a los tratamientos 
a fin de ayudar a formar grupos de unidades experimentales 
mas o menos equivalentes al equilibrar los efectos de otras vari- 
ables entre los grupos de tratamiento. (pag. 241) 


random condition ‘The data come from a well-designed random 
sample or randomized experiment. (p. +93) 


condici6n aleatoria Los datos provienen de una muestra alea- 
toria bien disefiada o de un experimento aleatorizado. (pag. 493) 


random sampling Using a chance process to determine which 
members of a population are included in the sample. (p. 214) 


muestreo aleatorio Uso de un proceso de probabilidad para de- 
terminar cuales miembros de la poblacién se han de incluir en la 
muestra. (pag. 214) 


random variable Variable that takes numerical values that de- 
scribe the outcomes of some chance process. (p. 348) 


randomization distribution Distribution of a. statistic (like 


pi a po or xX, — X2) in repeated random assignments of experimen- 
tal units to treatment groups assuming that the specific treatment 
received doesn’t affect individual responses. When the conditions 
are met, usual inference procedures based on the sampling distri- 
bution of the statistic will be approximately correct. (p. 627) 


variable aleatoria ‘Toma valores numéricos que describen los re- 
sultados de algtin proceso de azar y probabilidad. (pag. 348) 


distribucién de la aleatoriedad La distribucién de una estadistica 
(como fp; — pz 0 X; — X2) en designaciones aleatorizadas reiteradas 
de unidades experimentales a grupos de tratamiento, asumiendo 
que el tratamiento especiffico no afecte las respuestas individuales. 
Cuando se cumplan las condiciones, los procedimientos de infer- 
encia corrientes que se basan en la distribucién del muestreo de la 
estadistica serén aproximadamente correctos. (pag. 627) 


randomized block design Experimental design begun by form- 
ing blocks consisting of individuals that are similar in some way 
that is important to the response. Random assignment of treat- 
ments is then carried out separately within each block. (p. 252) 


disefio de bloques aleatorios Se comienza con la formacién de 
bloques compuestos por individuos que son similares de alguna 
manera que sea importante para la respuesta. La asignacion alea- 
toria de tratamientos luego se realiza separadamente dentro de 
cada bloque. (pag. 252) 


range ‘The maximum value minus the minimum value for a set 
of quantitative data. (p. 54) 
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gama El] valor méximo menos el valor minimo de un conjunto 
de datos cuantitativos. (pag. 54) 


regression line Line that describes how a response variable y 
changes as an explanatory variable x changes. We often use a 
regression line to predict the value of y for a given value of x. 


(p. 164) 


linea de regresi6n Una linea que describe cémo una variable de 
respuesta y cambia a medida que cambia una variable explicativa 
x. A menudo se usa una linea de regresi6n para predecir el valor 
de y para un valor dado de x. (pag. 164) 


reject Ho If the observed result is too unlikely to occur just by 
chance when the null hypothesis is true, we can reject Ho and say 
that there is convincing evidence for Hy. (p. 544) 


rechazar Hy Si es demasiado improbable que el resultado obser- 
vado suceda por simple azar cuando la hipétesis nula es verdad, 
se puede rechazar Hg y decir que existe evidencia convincente a 
favor de Hy. (pag. 544) 


relative frequency table ‘Table that shows the percents (relative 
frequencies) of observations in each category or class. (p. 8) 


tabla de frecuencia relativa Permite apreciar los porcentajes 
(frecuencias relativas) de las observaciones en cada categoria o 
clase. (pag. 8) 


replication Experimental design principle. Use enough experi- 
mental units in each group so that any differences in the effects 
of the treatments can be distinguished from chance differences 
between the groups. (p. 242) 


replicaci6n Principio de disefio experimental. Se usan suficientes 
unidades experimentales en cada grupo a fin de que todas las dife- 
rencias en los efectos de los tratamientos se puedan distinguir de las 
diferencias de azar entre los grupos. (pag. 242) 


residual Difference between an observed value of the response 
variable and the value predicted by the regression line: 


residual = observed y — predicted y = y — y 


(p. 169) 


residual La diferencia entre un valor observado de la variable de 
respuesta y el valor proyectado por la linea de regresién. Es decir, 


residual = observada y — proyectada y = y — y. 


(pag. 169) 


residual plot Scatterplot of the residuals against the explanatory 
variable. Residual plots help us assess whether a linear model is 
appropriate. (p. 173) 


grafico residual Grdfico de dispersién de los residuales en com- 
paracién con la variable explicativa. Los trazados residuales nos 
ayudan a evaluar si el modelo lineal es apropiado. (paég. 173) 


resistant measure Statistic that is not affected very much by ex- 
treme observations. (p. 50) 


medida resistente Estadfstica que no se ve muy afectada por ob- 
servaciones extremas. (pag. 50) 


response bias Systemic pattern of inaccurate answers. (p. 227) 


sesgo de la respuesta Patron sistémico de respuestas imprecisas. 
(pag. 227) 


response variable Variable that measures an outcome of a study. 


(pp. 143, 236) 


variable de respuesta Variable que mide un resultado de un es- 
tudio. (pags. 143, 236) 


roundoff error Difference between the calculated approxima- 
tion of a number and its exact mathematical value. (p. 8) 


sample Subset of individuals in the population from which we 
actually collect data. (p. 210) 


error de redondeo La diferencia entre la aproximacién com- 
putada y su valor matematico exacto. (pag. 8) 


muestra Subconjunto de individuos en la poblacién a partir de la 
cual de hecho se recogen datos. (pag. 210) 


sample regression line (estimated regression line) Least-squares 
regression line y=a+ bx computed from the sample data. 


(p. 739) 


sample space S Set of all possible outcomes of a chance process. 


(p. 305) 


linea de regresién de la muestra (linea de regresi6n estimada) 
La linea de regresidn de mfnimos cuadrados y = a + bx com- 
putada a partir de los datos de la muestra. (pag. 739) 


espacios S de la muestra E] conjunto de todos los resultados po- 
sibles de un proceso de probabilidad. (pag. 305) 


sample survey Study that uses an organized plan to choose a 
sample that represents some specific population. We base con- 
clusions about the population on data from the sample. (p. 210) 


valoraci6n de la muestra Estudio que usa un plan organizado 
para escoger una muestra que represente una poblacién especffi- 
ca. Basamos las conclusiones sobre la poblacién en datos tomados 
de la muestra. (pag. 210) 
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sampling distribution The distribution of values taken by a sta- 


tistic in all possible samples of the same size from the same popu- 
lation. (p. 427) 


distribuci6n del muestreo La distribucién de valores tomados 
por la estadistica en todas las muestras posibles del mismo tamafio 
tomadas de la misma poblacion. (pag. 427) 


sampling distribution of a sample mean x Suppose that x is 
the mean of an SRS of size n drawn from a large population with 
mean yj and standard deviation o. Then 


e The mean of the sampling distribution of x is uz = pu. 


e ‘The standard deviation of the sampling distribution of x is 


1 
as long as the 10% condition is satisfied: n = 0 


e If the population has a Normal distribution, then the sampling 
distribution of x also has a Normal distribution. Otherwise, the 
central limit theorem tells us that the sampling distribution of 
x will be approximately Normal in most cases when n = 30. 


(p. 452) 


distribucién de muestreo de la media de una muestra x Su- 
pongamos que x es la media de una muestra aleatoria sencilla de 
tamafio n tomada de una poblacién grande con media js y una 
desviacion estandar a. Entonces 


¢ La media de la distribucién del muestreo de x es uz = pu. 


e La desviacion estandar de la distribucién del muestreo de x es 


1 
siempre y cuando se cumpla con la condicién del 10%: n = rT : 


e Sila poblacién tiene una distribucién normal, entonces la distri- 
bucion del muestreo x también tiene una distribuci6n normal. 
De no ser asf, el teorema del limite central nos indica que en la 
mayoria de los casos, la distribuci6n del muestreo x sera aproxi- 
madamente normal en la mayoria de casos cuando n = 30. 


sampling distribution of a sample proportion fp Choose an 
SRS of size n from a population of size N with proportion p of 
successes. Let p be the sample proportion of successes. Then 


¢ The mean of the sampling distribution of p is bg = p. 


¢ The standard deviation of the sampling distribution of f is 


_ (pap) 
FON on 


1 
as long as the 10% condition is satisfied: n = 0. 


¢ As n increases, the sampling distribution of / becomes ap- 
proximately Normal. Before you perform Normal calculations, 
check that the Large Counts condition is satisfied: np = 10 and 
n(1 — p) = 10. (p. 444) 


distribucion del muestreo de una proporcién de la muestra p 
Se escoge una muestra aleatoria sencilla de tamaiio n a partir de 
una poblacién de tamafio N con proporcién p de aciertos. Per- 
mita que fp sea la proporcién de aciertos de la muestra. Entonces 


¢ La media de la distribucion del muestreo de p es Lig = p. 


¢ La desviacién estandar de la distribucién del muestreo de f es 


_ [pd =P) 
Nn 


siempre y cuando que se cumpla con la condicién del 10%: 
1 
ns 7 oN 
¢ A medida que aumenta n, la distribucién del muestreo de p se 
torna aproximadamente normal. Antes de realizar los cémputos 
normales, se verifica que se haya cumplido con la condicién de 
cuentas grandes: np = 10 y n(1 — p) = 10. (pag. 444) 


sampling distribution of a slope Choose an SRS of n observa- 
tions (x, y) from a population of size N with least-squares regres- 
sion line py = a + Gx. Let b be the slope of the sample regression 
line. Then 


e The mean of the sampling distribution of b is up = (. 
e The standard deviation of the sampling distribution of b is 


O,= —? as long as the 10% condition is satisfied: n = Jy 
o.Vn 10 
e The sampling distribution of b will be approximately Normal if 
the values of the response variable y follow a Normal distribu- 
tion for each value of the explanatory variable x (the Normal 
condition). (p. 741) 


distribucién del muestreo de una pendiente Se escoge una 
muestra aleatoria sencilla de n observaciones (x, y) a partir de una 
poblacién tamaiio N con linea de regresién de minimos cuadra- 
dos j1, = a + Gx. Se permite que b sea la pendiente de una linea 
de regresion de la muestra. Entonces 


e La media de la distribucién del muestreo de b es pu, = 8. 


e La desviacion estandar de la distribucién del muestreo de b es 


o, a 
0» = ——= siempre y cuando se cumpla con la condicién del 
o,Vn 


1 
10%:n= To: 


¢ La distribucién del muestreo de b serd aproximadamente 
normal si los valores de la variable de respuesta y siguen una 
distribucién normal para cada valor de la variable explicativa x 
(la condicién normal). (pag. 741) 


sampling distribution of /; — fp. Choose an SRS of size nj 

from population | with proportion of successes p; and an in- 

dependent SRS of size nz from population 2 with proportion 
of successes p2. 

e Shape: When 1 fj, 7)(1—p)), n2p2, and 12(1—pz2) are all at 
least 10, the sampling distribution of p; — pz is approximately 
Normal. 

¢ Center: The mean of the sampling distribution is p; — pz. 

¢ Spread: The standard deviation of the sampling distribution of 


Pi — pr 1s 


= — pi) p21 — p2) 
ny 


n2 


as long as each sample is no more than 10% of its population. 


(p. 612) 
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distribucion del muestreo p; — pz Se escoge una muestra 
aleatoria sencilla de tamafio n) a partir de una poblacion | con 
una proporcion de aciertos p; y una muestra aleatoria sencilla 
independiente de tamafio nz a partir de la poblacién 2 con una 
proporcién de aciertos pp. 


¢ Forma: Cuando 7) fj, 1)(1- pj), n2p2, and n2(1-p2) son por lo 


menos 10, la distribucién del muestreo de py — pz es aproxi- 
madamente normal. 


¢ Centro: La media de la distribucién del muestreo es p — p2. 
¢ Amplitud: La desviacién estandar de la distribucién del 
muestreo de fp; — p2 es 


re — p)) p21 — po) 


ny) n2 


siempre y cuando cada muestra es de no mas del 10% de su 
poblacién. (pag. 612) 


sampling distribution of x; — x, Choose an SRS of size n, from 

population | with mean jz; and standard deviation o; and an in- 

dependent SRS of size nz from population 2 with mean jz and 

standard deviation 02. 

e Shape: When the population distributions are Normal, the 
sampling distribution of x; — x2 is Normal. In other cases, the 
sampling distribution of x; — x2 will be approximately Normal 
if the sample sizes are large enough (n; = 30 and n; = 30). 


¢ Center: The mean of the sampling distribution is ju) — fu. 


e Spread: The standard deviation of the sampling distribution of 
X] — X2 is 


2 
oT 


ny 


as long as each sample is no more than 10% of its population. 


(p. 638) 


distribucién de muestreo de x; — x, Se escoge una muestra 
aleatoria sencilla de tamafio a partir de una poblacién | con una 
media js) y una desviacién estandar o; y una muestra aleatoria 
sencilla independiente de tamajfio nz a partir de una poblacién 2 
con una media /12 y una desviacion estandar 02. 


¢ Forma: Cuando las distribuciones de la poblacién son 
normales, la distribucién del muestreo de x; — x2 es normal. 
En otros casos, la distribucién del muestreo de x, — x serd 
aproximadamente normal si los tamaiios de las muestras son 
suficientemente grandes (nm, = 30 y nz = 30). 


¢ Centro: La media de la distribucién del muestreo es 4) — [12. 


¢ Amplitud: La desviacién estandar de la distribucién del 
muestreo de x} — x2 es 


siempre y cuando cada muestra es de no mas del 10% de su 
poblacién. (pag. 638) 


sampling variability The value of a statistic varies in repeated 
random sampling. (p. 425) 


variabilidad del muestreo El valor de una estadistica varfa en 
muestras aleatorias reiteradas. (pag. 425) 


scatterplot Plot that shows the relationship between two quanti- 
tative variables measured on the same individuals. ‘The values of 
one variable appear on the horizontal axis, and the values of the 
other variable appear on the vertical axis. Each individual in the 
data appears as a point in the graph. (p. 145) 


grafico de dispersi6n Permite apreciar la relacion entre dos vari- 
ables cuantitativas midiendo los mismos individuos. Los valores 
de una variable figuran en el eje horizontal, y los valores de la 
otra variable figuran en el eje vertical. Cada individuo en los datos 
figura como un punto en el grafico. (pag. 145) 


segmented bar graph Graph used to compare the distribution of 
a categorical variable in each of several groups. For each group, 
there is a single bar with “segments” that correspond to the differ- 
ent values of the categorical variable. The height of each segment 
is determined by the percent of individuals in the group with that 
value. Each bar has a total height of 100%. (p. 17) 


grafico de barras segmentado Se usa para comparar la distribu- 
ci6n de una variable categorizada en cada uno de varios grupos. 
Para cada grupo, hay una sola barra que tiene “segmentos” que 
corresponden a los diferentes valores de la variable categorizada. 
La altura de cada segmento la determina el porcentaje de indi- 
viduos en el grupo que tengan ese valor. Cada barra tiene una 
altura total del 100%. (pag. 17) 
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side-by-side bar graph Graph used to compare the distribution 
of a categorical variable in each of several groups. For each value 
of the categorical variable, there is a bar corresponding to each 
group. The height of each bar is determined by the count or per- 
cent of individuals in the group with that value. (p. 17) 


grafico de barras contiguas Se usa para comparar la distribucion 
de una variable categorizada en cada uno de varios grupos. Para 
cada valor de la variable categorizada, hay una barra que correspon- 
de a cada grupo. La altura de la barra la determina el cuenteo 0 el 
porcentaje de individuos en el grupo que tengan ese valor. (pag. 17) 


significance level Fixed value a that we use as a cutoff for de- 
ciding whether an observed result is too unlikely to happen by 
chance alone when the null hypothesis is true. The significance 
level gives the probability of a Type I error. (p. 545) 


nivel de significancia Valor fijo a@ que se usa como punto de 
corte para decidir si un resultado observado es demasiado improb- 
able para suceder solo al azar cuando la hipotesis nula es verdad. 
E] nivel de significancia genera la probabilidad de un error Tipo 
1. (pag. 545) 


significance test Procedure for using observed data to decide be- 
tween two competing claims (also called hypotheses). The claims 
are often statements about a parameter. (p. 539) 


prueba de significancia Procedimiento en el que se usan datos 
observados para decidir entre dos opciones que compiten entre sf 
(también se les dice hipétesis). Las opciones a menudo son enun- 
ciados acerca de un pardmetro. (pag. 539) 


simple random sample (SRS) Sample chosen in such a way 
that every group of n individuals in the population has an equal 
chance to be selected as the sample. (p. 214) 


muestra aleatoria sencilla Muestra tomada de tal manera que 
cada grupo de n individuos en la poblacién tenga la misma opor- 
tunidad de ser escogido como la muestra. (pag. 214) 


simulation Imitation of chance behavior, based on a model that 
accurately reflects the situation. (p. 295) 


simulaci6én Imitacién de conducta de azar, basada en un modelo 
que refleja la situacion con precision. (pag. 295) 


single-blind An experiment in which either the subjects or those 
who interact with them and measure the response variable, but 
not both, know which treatment a subject received. (p. 248) 


ciego sencillo Experimento en el que ya sea los sujetos o bien 
aquellos que interacttian con los sujetos y miden la variable de re- 
spuesta, pero no ambos, saben cuél fue el tratamiento que recibié 
un sujeto. (pag. 248) 


skewness A distribution is skewed to the right if the right side of 
the graph (containing the half of the observations with larger val- 
ues) is much longer than the left side. It is skewed to the left if the 
left side of the graph is much longer than the right side. (p. 27) 


asimetria Distribucién que estd sesgada hacia la derecha si \a 
derecha del grafico (que contiene la mitad de las observaciones 
con valores mas grandes) es mucho mas larga que el lado izquier- 
do. Esta sesgada hacia la izquierda si el lado izquierdo del grafico 
es mucho més largo que el lado derecho. (pag. 27) 


slope Suppose that y is a response variable (plotted on the verti- 
cal axis) and x is an explanatory variable (plotted on the horizontal 
axis). A regression line relating y to x has an equation of the form 
y =a + bx. In this equation, b is the slope, the amount by which 
y is predicted to change when x increases by one unit. (p. 166) 


pendiente Supongamos que y es una variable de respuesta (traza- 
da en el eje vertical) y x es una variable explicativa (trazada en 
el eje horizontal). Una linea de regresi6n que relacione y con x 
tiene una ecuacion en la forma de y = a + bx. En esta ecuacion, 
b es la pendiente, la cantidad mediante la cual se predice que y 
cambiaré cuando x tenga un aumento de una unidad. (pag. 166) 


splitting stems Method for spreading out a stemplot that has too 
few stems. (p. 32) 


tallos separados Método para separar un grafico de tallos que 
tiene una carencia de tallos. (pag. 32) 


standard deviation s, Statistic that measures the typical distance 
of the values in a distribution from the mean. It is calculated by 
finding an “average” of the squared distances and then taking the 
square root. In symbols, 


x= Sai 
(p. 61) 


desviacion estandar s, E:stadistica que mide la distancia tipica de 
F q p 
los valores en una distribuci6n a partir de la media. Se computa 
P P 
hallando un “promedio” de las distancias al cuadrado a las que 
luego se les computa la rafz cuadrada. En sfmbolos se representa 
Pp P 


5 = > (x; ~~ ra 
Vn—-1 
(pag. 61) 


standard deviation of a random variable Square root of the 
variance of a random variable og. The standard deviation mea- 
sures the typical distance of the values in a distribution from their 
mean. In symbols, 


Ox = V>G; _ Lx)’ 


(p. 353) 


desviaci6n estandar de una variable aleatoria La rafz cuadrada 
de la variacién de una variable aleatoria of. La desviacion estén- 
dar mide la distancia tipica de los valores en una distribucién a 
partir de su media. En sfmbolos se represent 


Ox = V3 — px)’ pi 
(pag. 353) 


standard deviation of the residuals (s) If we use a least-squares 
line to predict the values of a response variable y from an explana- 
tory variable x, the standard deviation of the residuals (s) is given 


by 
= 
n—-2 


This value gives the approximate size of a “typical” prediction 
error (residual). (p. 177) 
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desviacién estandar de las residuales (s) Si se hace uso de la linea 
de cuadrados minimos para predecir los valores de una variable de 
respuesta y a partir de una variable explicativa x, la desviaci6n estén- 
dar de las residuales (s) la da 


= >o- 7 


n—-2 


Este valor ofrece un tamafio aproximado de un error de predic- 
cion “tfpico” (residual). (pag. 177) 


standard error When the standard deviation of a statistic is esti- 
mated from data, the result is the standard error of the statistic. 


(p. 497) 


error estandar Cuando se computa la desviacion esténdar de una 
estadistica a partir de datos, el resultado es el error estandar de la 
estadistica. (pag. 497) 


standard error of fp; — pz Estimated standard deviation of the 
statistic fp; — pr, given by 


on ® 


ny) nz 


(p. 616) 


error estandar de fp; — fz La desviacion estandar coputada de la 
estadistica p; — 2, dada por 


oe ee 


nN) 12 


(pag. 616) 


ae 
Vn 
standard deviation. It describes how far x will typically be from ju 
in repeated SRSs of size n. (p. 518) 


standard error of the sample mean where s, is the sample 


Pp ‘ Sx 
error estandar de la media de la muestra —= en el que s, es la 
n 


desviaci6n estandar de la muestra. Describe la distancia tfpica en 
la que x se separa de jz en muestras aleatorias sencillas reiteradas 
de tamanio n. (pag. 518) 


p(l — p) ‘ 
standard error of the sample proportion oe where p 


is the sample proportion. It describes how far f will typically be 
from p in repeated SRSs of size n. (p. +96) 


_. pl = p) 
error estandar de la proporcién de la muestra \ /— 

n 
en la que f es la proporcion de la muestra. Se describe la distancia 
tipica en la que f se separa de p en muestras aleatorias sencillas 


reiteradas de tamafio n. (pag. +96) 


standard error of the slope Formula used to estimate the spread 
of the sampling distribution of b: 


s 
SE, = ———== 
aVn —] 

(p. 747) 


error estandar de la pendiente Se usa para computar la amplitud 
de la distribucion del muestreo de b: 


s 
SE, = ——== 
Wn = 


(pag. 747) 


standard error of x; — x2 Estimated standard deviation of the 
statistic x) — Xz, given by 


(p. 640) 


error estandar de x; — x2 La desviaci6n esténdar computada de 
la estadistica x} — x2, dada por 


2 2 
ST 8? 
—+ ps 


ny n2 


(pag. 640) 


standard Normal distribution Normal distribution with mean 0 
and standard deviation 1. (p. 113) 


distribuci6n normal estandar La distribucién normal con me- 


dia de 0 y desviacion estandar 1. (pag. 113) 


standard Normal table (Table A) Table of areas under the stan- 
dard Normal curve. The table entry for each value z is the area 
under the curve to the left of z. (p. 113) 


tabla normal est4ndar (Tabla A) Tabla de areas debajo de la 
curva normal estandar. La entrada a la tabla de cada valor z es el 
area debajo de la curva a la izquierda de z. (pag. 113) 


standardized score (z-score) If x is an observation from a distri- 
bution that has known mean and standard deviation, the stan- 


x — mean : 
. A standardized 


dardized value of x is z= —_ 
standard deviation 


value is often called a z-score. (p. 90) 


puntuaci6n estandarizada Si x es una observacion a partir de 
una distribucion que tiene una media y desviacién esténdar cono- 
x — media 


cidas, el valor estandarizado de x es z = otis P . 
desviacion estandar 


valor estandarizado a menudo se le dice puntuacion z. (pag. 90) 
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statistic Number that describes some characteristic of a sample. 


(p. 424) 


estadistica Ntimero que describe alguna caracteristica de una 
muestra. (pag. 424) 


statistically significant (1) Observed effect so large that it would 
rarely occur by chance. (p. 249) 

(2) If the P-value is smaller than alpha, we say that the results of 
a statistical study are significant at level a. In that case, we reject 
the null hypothesis Hp and conclude that there is convincing evi- 
dence in favor of the alternative hypothesis H,. (p. 545) 


estadisticamente significativo (1) Efecto observado que es tan 
grande que raramente sucederia producto del azar. (pag. 249) 
(2) Si el valor P es menor que alfa, se dice que los resultados de 
un estudio estadistico son significativos al nivel a. En tal caso, se 
rechaza la hipétesis nula Hy y se concluye que hay evidencia con- 
vincente a favor de la hipétesis H, alternativa. (pag. 545) 


stemplot (also called stem-and-leaf plot) Simple graphical dis- 
play for fairly small data sets that gives a quick picture of the shape 
of a distribution while including the actual numerical values in 
the graph. Each observation is separated into a stem, consisting 
of all but the final digit, and a leaf, the final digit. The stems are 
arranged in a vertical column with the smallest at the top. Each 
leaf is written in the row to the right of its stem, with the leaves 
arranged in increasing order out from the stem. (p. 31) 


grafico de tallos al que también se le dice grafico de tallos y 
hojas Representacién grafica sencilla de conjuntos de datos rela- 
tivamente pequefios que dan una imagen raépida de la forma de 
una distribuci6n, al tiempo que incluyen los valores numéricos 
mismos en el grafico. Cada observacion se separa en un tallo com- 
puesto de todos menos el ultimo digito, y una hoja, que es ese 
Ultimo digito. Los tallos se disponen en una columna vertical en 
la cual el valor mas pequefio estd arriba. Cada hoja se escribe en 
el rengl6n a la derecha del tallo, con las hojas dispuestas en orden 
ascendiente comenzando a partir del tallo. (pag. 31) 


stratified random sample Sample obtained by classifying the 
population into groups of similar individuals, called strata, then 
choosing a separate SRS in each stratum and combining these 
SRSs to form the sample. (p. 219) 


muestra aleatoria estratificada Muestra que se obtiene clasifi- 
cando la poblacién en grupos de individuos parecidos, Ilamados 
estratos. Luego, se escoge una muestra aleatoria sencilla separada 
en cada estrato y se combinan estas muestras aleatorias sencillas 
para conformar la muestra. (pag. 219) 


subjects Experimental units that are human beings. (p. 237) 


sujetos Unidades experimentales que son seres humanos. 
(pag. 237) 


symmetric A graph in which the right and left sides are approxi- 
mately mirror images of each other. (p. 27) 


t distribution Draw an SRS of size n from a large population that 
has a Normal distribution with mean ju and standard deviation 
a. The statistic 


Vin 
has the t distribution with degrees of freedom df = n — 1. This 


statistic will have approximately a t,,_; distribution if the sample 
size is large enough. (p. 512) 


simétrico Si los lados derecho e izquierdo de un grafico son 
reflejos aproximados uno del otro, se dice que son simétricos. 


(pag. 27) 


distribucién ¢ Se grafica una muestra aleatoria sencilla de 
tamafio n a partir de una poblacién grande que tiene una 
distribucién normal con media de js y desviacién esténdar 
o. La estadistica 


tiene la distribucién t con grados de libertad df = n — 1. Esta 
estadistica tendra una distribucién aproximada t,,—; si el tamaiio 
de la muestra es suficientemente grande. (pag. 512) 


t interval for the slope @ When the conditions for regression 
inference are met, a C% confidence interval for the slope @ of the 
population (true) regression line is b + t*SE,. In this formula, 
the standard error of the slope is 


SE, = 2 


sVn =1 


and t* is the critical value for the ¢ distribution with df = n — 2 
having C% of its area between —¢* and t*. (p. 548) 


intervalo t para la pendiente G (pdg. 748) Cuando se cumple 
con las condiciones para lograr una inferencia de regresién, el 
intervalo de confianza C% para la pendiente ( de la linea de 
regresion de la poblacion (verdadera) es b + t*SE,. En esta f6r- 
mula, el error estandar de la pendiente es 


SE, = —— 


s 
Vn =] 


y t* es el valor critico para la distribuci6n t y df = n — 2 tiene el 
C&% de su area entre —t* y t*. (pag. 548) 


t test for the slope Suppose the conditions for inference are met. 
‘To test the hypothesis Ho: 8 = {, compute the test statistic 


_ by 
SE, 
Find the P-value by calculating the probability of getting a t statis- 


tic this large or larger in the direction specified by the alternative 
hypothesis H,. Use the ¢ distribution with df = n — 2. (p. 753) 
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prueba ¢ para la pendiente Supongamos que se cumple con 
todas las condiciones para la inferencia. Para poner a prueba la 
hipotesis Hp: G = Go, se computa la estadfstica de prueba 


_ b= By 
SE, 


Se halla el valor P computando la probabilidad de obtener una 
estadistica t de este tamafio o mas grande en el sentido especifica- 
do por la hipétesis alternativa H,. Se usa la distribucién t con 


df = n — 2. (pag. 753) 


test statistic Calculation that measures how far a sample statis- 
tic diverges from what we would expect if the null hypothesis Ho 
were true, in standardized units. That is, 


statistic — parameter 


test statistic = aa ee 
standard deviation of statistic 


(p. 556) 


estadistica de prueba Mide la divergencia entre la estadistica de 
muestra y lo que esperarfamos si la hipétesis nula Hp fuera verdad, 
expresado en unidades estandarizadas. Es decir, 


estadistica — parametro 


estadistica de prueba = — - 
P desviacién estandar de la estadistica 


(pag. 556) 


third quartile Q3 In a data set in which the observations are 
ordered from lowest to highest, the median of the observations 
whose position is to the right of the median. (p. 54) 


tercer cuartil Q3 Si las observaciones en un conjunto de datos 
se organizan de la mas baja a la mas alta, el tercer cuartil Q3 es la 
media de las observaciones cuya posicion esta a la derecha de la 
media. (pag. 54) 


transforming Applying a function such as the logarithm or 
square root to a quantitative variable. (p. 767) 


transformaci6n La aplicacién de una funcién tal como el loga- 
ritmo o la rafz cuadrada a una variable cuantitativa se denomina 
transformacién del dato. (pdg. 767) 


treatment Specific condition applied to the individuals in an 
experiment. If an experiment has several explanatory variables, 
a treatment is a combination of specific values of these variables. 


(p. 237) 


tratamiento Una condicién especifica que se les aplica a los 
individuos en un experimento. Si un experimento tiene varias 
variables explicativas, el tratamiento es una combinacioén de los 
valores especifficos de estas variables. (pag. 237) 


tree diagram Diagram used to display the sample space for a 
chance process that involves a sequence of outcomes. (p. 322) 


diagrama de drbol Se usa para mostrar el espacio de muestra 
para un proceso de probabilidad que produce una secuencia de 
resultados. (pag. 322) 


two-sample t interval for a difference between two means When 
the Random, 10%, and Normal/Large Sample conditions are 
met, an approximate C% confidence interval for f4, — juz is 


9. 2 
oa sj 83 
a= 22) eft i 
14] 2 


Here ¢* is the critical value with area C% between — t* and t* for 
the ¢ distribution with degrees of freedom from either Option 1 (tech- 
nology) or Option 2 (the smaller of n; — 1 and n; — 1). (p. 641) 


intervalo t de dos muestras para obtener la diferencia entre dos 
medias Cuando se cumple con las condiciones aleatoria, del 
10% y de muestra normal/grande, un intervalo de confianza C% 
aproximado pata ju) — /u2 es 


2 2 
= = ue ST 2 
(x) — X%2) # rf +— 
Ny nz 


en el que ¢* es el valor critico con drea C% entre — t* y t* para la 
distribuci6n t con grados de libertad ya sea la Opcién | (tecnologia) 
o la Opcién 2 (el valor menor de 1; — 1 y nz — 1). (pag. 641) 


two-sample t statistic When we standardize the estimate x, — x2, 
the result is the two-sample t statistic 


_ G1 = x2) — (a = ba) 


st. 83 
2 
i ahs 20 
ny nz 

The statistic t has the same interpretation as any z or t statistic: it 


says how far x — x2 is from its mean in standard deviation units. 


(p. 640) 


estadistica t de dos muestras Cuando se estandariza el estimado 
X | — X2, el resultado es una estadistica t de dos muestras 


_ G1 = x2) — Gn = fa) 


z 2 
ST 85 
—+— 


ny) nz 
La estadistica t tiene la misma interpretacién de cualquier es- 
tadistica z o t: indica la distancia de x; — x2 a partir de su media 
en unidades de desviacién estandar. (pag. 640) 
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two-sample t¢ test for the difference between two means Sup- 
pose the Random, 10%, and Normal/Large Sample conditions 
are met. To test the hypothesis Ho: ft; — juz = hypothesized value, 
compute the two-sample ¢ statistic 


(x) — X2) — (fu) — pr) 


Find the P-value by calculating the probability of getting a t sta- 
tistic this large or larger in the direction specified by the alterna- 
tive hypothesis H,. Use the t distribution with degrees of freedom 
approximated by technology or the smaller of n) — | and nz — 1. 


(p. 645) 


prueba t de dos muestras para obtener la diferencia entre dos 
medias Supongamos que se ha cumplido con las condiciones 
aleatoria, del 10% y de muestra normal/grande. Para someter a 
prueba la hipétesis Ho: 4) — (42 = valor hipotético, se computa la 
estadistica t de dos muestras 


(x1 = X2) — (ta — Ha) 
2 <2 
j Bt BE 
Ny 2 
Halla el valor P computando la probabilidad de obtener una es- 
tadistica t de este tamafio o mas grande en el sentido especificado 
por la hipotesis H, alternativa. Se usa la distribucién t con grados 


de libertad aproximados por la tecnologia o el valor menor de 
ny — ly nz — 1. (pag. 645) 


two-sample z interval for a difference between two proportions 
When the Random, 10%, and Large Counts conditions are met, 
an approximate C% confidence interval for p; — pz is 


en pi(1— pi) px(1 — pr) 
b-men/t Pi p p 


ny) 2 


where z”* is the critical value for the standard Normal curve with 
C% of its area between —z* and z*. (p. 617) 


intervalo z de dos muestras para obtener la diferencia entre dos 
proporciones Cuando se cumplen las condiciones de aleatorio, 
del 10% y de muestras grandes, el intervalo de confianza C% 
aproximado para p; — p2 es 


— naa — pi) , bo - py 


(p1 — p2) + z* 


ny n? 


en el que z* es el valor critico para la curva esténdar normal con 
C&% de su area entre —z* y z*. (pag. 617) 


two-sample z test for the difference between two propor- 
tions Suppose the Random, 10%, and Large Counts conditions 
are met. To test the hypothesis Ho: p; — p2 = 0, first find the 
pooled proportion fe of successes in both samples combined. 
‘Then compute the z statistic 


' (pi — p2) — 0 
7 — bo) , Pcl = po) 


ny 12 


Find the P-value by calculating the probability of getting a z statis- 
tic this large or larger in the direction specified by the alternative 
hypothesis H,. (p. 622) 


prueba z de dos muestras para obtener la diferencia entre dos 
proporciones Supongamos que se cumplen las condiciones 
aleatoria, del 10% y de muestras grandes. Para someter a prueba 
la hipétesis Ho: p; — p2 = 0, primero se halla la proporcién com- 
binada fg de aciertos en ambas muestras combinadas. Luego se 
computa la estadistica z 


(p1 — p2) — 0 
= cll = fo) 


ny n2 


r= 
& 


Se halla el valor P computando la probabilidad de obtener una es- 
tadistica z de este tamafio o mas grande en el sentido especificado 
por la hipotesis H, alternativa. (pag. 622) 


two-sided alternative hypothesis ‘The alternative hypothesis is 
two-sided if it states that the parameter is different from the null 
value (it could be either smaller or larger). (p. 541) 


hipotesis alternativa bilateral La hipotesis alternativa es bilateral 
si indica que el pardmetro es diferente del valor nulo (podria ser 
mas pequefio o mas grande). (pag. 541) 


two-way table ‘Table of counts that organizes data about two cat- 
egorical variables. (p. 12) 


tabla de doble via Una tabla de doble via de cuentas organiza los 
datos acerca de dos variables categorizadas. (pag. 12) 


Type I error Occurs if we reject Hp when Hp is true. (p. 547) 


error Tipo I Sucede si rechazamos Hy cuando Hp es verdad. 
(pag. 547) 


Type II error Occurs if we fail to reject Hy when H, is true. 
(p. 547) 


unbiased estimator A statistic used for estimating a parameter is 
unbiased if the mean of its sampling distribution is equal to the 
true value of the parameter being estimated. (p. 429) 


error Tipo II Sucede si no rechazamos Hy cuando H, es verdad. 
(pag. 547) 


estimador sin sesgo La estadistica que se usa para computar un 
parametro es un estimador sin sesgo si la media de distribucién de 
su muestreo equivale al valor verdadero del parémetro que se esté 
computando. (pag. 429) 


undercoverage Occurs when some members of the population 
cannot be chosen in a sample. (p. 225) 


Glossary/Glosario  G-25 


subcobertura Sucede cuando algunos miembros de la poblacién 
no pueden ser escogidos para la muestra. (pag. 225) 


unimodal A graph of quantitative data with a single peak. (p. 28) 


unimodal Grdfico de datos cuantitativos con un solo pico. (pag. 28) 


union The union of events A and B, denoted by A U B, consists 
of all outcomes in A or B or both. (p. 311) 


variability of a statistic Spread of a statistic’s sampling distribu- 
tion. Statistics from larger samples have less variability. (p. 433) 


variable Any characteristic of an individual. A variable can take 
different values for different individuals. (p. 2) 


variance of a random variable ox Weighted average of the 
squared deviations of the values of the variable from their mean. 
In symbols, 


ox = > (qj - Lx)" pi 
(p. 352) 


unién La unidén de los eventos A y B, denotados por A U B, con- 
siste en todos los resultados en A o B, 0 ambos. (pag. 311) 


variabilidad de una estadistica La variabilidad de una estadisti- 
ca se describe por la amplitud de la distribucién de su muestreo. 
Las estadisticas de muestras més grandes tienen menos variabili- 


dad. (pag. 433) 


variable ‘Toda caracterfstica de un individuo. Una variable puede 
tener diferentes valores para diferentes individuos. (pag. 2) 


ae 3 ‘ 2 3 
variacion de una variable aleatoria ox Promedio sopesado de 
las desviaciones cuadraticas de los valores de la variable a partir 
de su media. En simbolos se expresa 


ox = > (x; — px)’ pi 
(pag. 352) 


variance s? “Average” squared deviation of the observations in a 
data set from their mean. In symbols, 


y\2 4 ! 2 
(x2 x) Vise T an x) = SG x) 


n—-1 


(p. 61) 


. se ‘2 : 4 “4: “ . 
variacion s; La desviacién cuadratica “promedio” de las observa- 
ciones en el conjunto de datos segtin su separaci6n de la media. 
En sfmbolos se expresa 


+ (xy — x2 +. + (Xn — XY 


~= FSS (=) 


n= 


(pag. 61) 


Venn diagrams Used to display the sample space for a chance 
process. Venn diagrams can also be used to find probabilities in- 
volving events A and B. (p. 314) 


diagramas Venn Se usan para mostrar el espacio de muestras 
para un proceso de probabilidad. Los diagramas Venn también 
se pueden usar para hallar las probabilidades que existen para los 
eventos A y B. (pag. 314) 


voluntary response sample People decide whether to join a sam- 
ple by responding to a general invitation. (p. 212) 


wording of questions Most important influence on the answers 
given in a survey. Confusing or leading questions can introduce 
strong bias, and changes in wording can greatly change a survey’s 
outcome. Even the order in which questions are asked matters. 


(p. 227) 


y intercept Suppose that y is a response variable (plotted on the 
vertical axis of a graph) and x is an explanatory variable (plotted 
on the horizontal axis). A regression line relating y to x has an 
equation of the form y = a + bx. In this equation, the number 
ais the y intercept, the predicted value of y when x = 0. (p. 166) 


muestra de respuesta voluntaria La gente que decide vincularse 
a una muestra mediante una respuesta a una invitacién general- 
izada. (pag. 212) 


terminologia de las preguntas La influencia més importante 
sobre las respuestas que se dan en un sondeo. Toda pregunta con- 
fusa, capciosa 0 que sugiera una respuesta puede introducirle al 
proceso un sesgo marcado, y los cambios en la terminologia pu- 
eden modificar de manera marcada los resultados de tal sondeo. 
Incluso los resultados se pueden ver afectados por el orden en que 
se hacen las preguntas. (pag. 227) 


interceptacién y Supongamos que y es una variable de respu- 
esta (graficada en el eje vertical) y x es una variable explicativa 
(graficada en el eje horizontal). La linea de regresion que rela- 
ciona y con x tiene una ecuacién y = a + bx. Segtin esta ecu- 
aci6n, el ntimero a es la interceptacion y, el valor proyectado de y 
cuando x = 0. (pag. 166) 
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A 


Addition 
of constant, 93-94, 95-97, 
364, 366-368 
of Normal random variables, 
378-380 
of random variables, 
370-376, 378-380 
Addition rule 
general, 310-311, 312 
for mutually exclusive events, 
308, 312, 330 
Alternative hypothesis, 539-540 
for chi-square test, 681, 710 
American Community Survey 
(ACS), 226 
Anonyinity, vs. confidentiality, 
271 
Applets 
Correlation and Regression, 
152, 170 
Mean and Median, 53 
Normal Approximation to 
Binomial, 402 
Normal Density Curve, 110, 
116 
One-Variable Statistical 
Calculator, 36 
Probability, 290 
Reese’s Pieces”, 441 
Rice University Sampling 
Distributions, 453-454, 
456 
Statistical Power, 590-591 
Test of Significance, 538 
Area under standard Normal 
curve, 115 
Association, 18-19 
confounding and, 236 
independent events and, 328 
negative, 146, 147-148, 
15] 


positive, 148, 151 
statistically significant, 249 
vs. causation, 156 
Average, vs. single 
measurement, 451, 456 


B 
Back-to-back stemplots, 32 
Bar graphs, 8-11, 16 
segmented, 17 
vs. histograms, 38 
Bayes’s theorem, 323 
Bell curve. See Normal 
distributions 
Bias, 212, 223, 225-227, 
429-432 
in experiments, 240-241 
variability and, 434-435 
Biased estimator, +3 1-432 
Bimodal distributions, 29 


Binomial coefficients, 398-400 
Binomial counts, 443 
Binomial distributions, 
388404 
definition of, 388 
mean of, 397-400 
Normal approximation for, 
402-404 
sampling and, 401-404 
standard deviation of, 
397-400 
Binomial probability, 390-396 
calculation of, 394-396 
formula for, 393 
Binomial random variables, 
382-404 
definition of, 388 
mean of, 398-400 
sampling and, 401-404 
standard deviation of, 
398-400 
variance of, 398-400 
Binomial settings, 387-389, 401 
BINS mnemonic, 388 
Blocking, 251-257 
matched pairs and, 255 
Boxplots, 57-59 


Cc 
Calculator 

for binomial coefficients, 
392-393 

for binomial probabilities, 
394-395 

for boxplots, 59 

for chi-square test for 
goodness of fit, 
689-690, 691 

for chi-square test for 
homogeneity, 706-707 

for confidence intervals, 501, 
521-522, 618-619 

for discrete random variables, 
354 

for geometric probabilities, 
406-407 

for histograms, 36-37 

for inverse t, 513-514 

for least-squares regression 
lines, 171-172 

for Normal distributions, 
116-117, 125 

for Normal probability plots, 
125 

for numerical summaries, 
63-64 

for one-proportion z test, 561 

for one-sample t intervals, 
521-522 

for one-sample t tests, 
582-583 

for P-values, 579, 686 


for residual plots, 175-176 
for scatterplots, 150, 155 
for significance test 
for difference in 
proportions, 624 
for significance test for 
slope of regression line, 
756-757 
for simple random samples, 
215-216 
for standard deviations, 62 
for transformation, 781-782 
for two-sample t interval, 
643-644 
for two-sample t test, 
647-648 
for two-way tables, 706-707 
Call-in polls, 212 
Cases, 4 
Categorical variables, 3-4 
association between, 17-19, 
711-713 
definition of, 3 
distributions of, 8, 11-18, 
697-701 
graphs for, 8-11, 16-18 
in two-way tables, 11-12, 
697-704, 711-713 
values of, 7 
vs. quantitative variables, 
508, 520, 720 
Causation 
evidence for, 268-270 
experiments and, 238, 249, 
268-270 
explanatory variables and, 
144 
inference about, 266-268. 
See also Inference 
response variables and, 144 
vs. association, 156 
vs. correlation, 190-191 
Center of distribution, 26, 
49-53, 64-65, 429-432, 
613. See also Mean; 
Median 
for difference between 
means, 637 
for difference between 
proportions, 613 
measures of, 64-65 
for sample mean, 451-453, 
459, 478 
for slope of regression line, 
740, 741 
Central limit theorem (CLT), 
456-459 
Chance behavior. See 
Randomness 
Chebyshev’s inequality, 112 
Chi-square distributions, 
684-687 


P-values from, 704-705 


Chi-square statistic, 682-684 


components of, 690-691 
for multiple comparisons, 


700-701 


Chi-square test, 680-692 


alternative hypothesis in, 681 

calculator for, 689-690, 
691 

carrying out, 687-691 

conditions for, 687-688 

expected counts and, 
680-684, 701-704 

follow-up analysis for, 690, 
710 

hypothesis statement for, 
680-681 

Large Counts condition for, 
687 

for multiple comparisons, 
697-704 

null hypothesis in, 681 

observed counts and, 
681-684 

Random condition for, 687 

for two-way tables, 697-704, 
711-713 


Chi-square test for 


homogeneity, 697, 703, 
707-710 
calculator for, 706-707 
for comparing proportions, 
710, 719-720 
conditions for, 703 
for difference between 
proportions, 710 
guidelines for, 717-721 
vs. test for independence, 
717 


Chi-square test for 


independence, 697, 
713-717 
guidelines for, 717-721 
vs. test for homogeneity, 


717 


Cluster sample, 220-222 
Coefficient 


binomial, 392-393 
of determination, 178-180 


Column variables, 12 
Combined sample proportion, 


621 


Comparative experiments, 240 
Complement rule, 307-308, 


311, 314 


Completely randomized design, 


244-247 


Components, of chi-square 


statistic, 690-691 


Conditional distributions, 


14-18, 320 
comparing, 697-701 


I-1 


|-2 Index 


Conditional probability, 
318-328 

calculation of, 319-320 

conditional distributions and, 

320 

definition of, 319 

formula for, 319-320 

independence and, 326-328 

notation for, 319 

tree diagrams and, 


322-326 


Confidence intervals, 477-489, 


493-494 

calculation of, 487 

conditions for, 493-496 

construction of, 485-487, 
496-499 

critical values for, 486 

for difference between 
means, 641-643, 648 

for difference between 
proportions, 616-619, 
623 

formula for, 486-487 

four-step process for, 
499-500 

guidelines for, 487-488 

interpretation of, 481-485 

margin of error in, +87, 500, 
501-503, 523-524 

Normal/Large Sample 
condition for, 640 

one-sided tests and, 585-586 

for population mean, 
518-521 

for population proportion, 
496-499, 563-564 

Random condition for, 640 

sample size and, 501-503, 
522-524 

significance tests and, 
563-564, 583-586 

for slope of regression line, 
747-753 

standard errors and, 
496-497 

two-sided tests and, 564, 
583-586 


Confidence levels, 480-485 


critical values for, 497 
interpretation of, 481-485 
Confidentiality, 271 
Confounding, 236, 251 


Consent, informed, 271 
Continuous random variables 


definition of, 356 

from measuring, 356 

probability distributions of, 
355-358 

vs. discrete random variables, 


356 


definition of, 150 

explanatory variables and, 
156, 187 

formula for, 154-155 

interpreting, 155-157 

limitations of, 186-190 

linear association and, 
150-157, 187-188 


nonsense, 190 


positive/negative association 


and, 151 
properties of, 156 
response variables and, 156, 
187 
scatterplots and, 145-155 
strength of, 146, 151 
units of measure and, 156 
vs. causation, 190-191 
Correlation and Regression 
(applet), 152, 170 
Correlation coefficient, 150 
Counts 
binomial, 443 


in chi-square tests, 681-684, 


701-704 

expected, 680-684, 
701-704, 720-721 

in frequency tables, 7, 
38-39 

in histograms, 38-39 

observed, 681-684 

in two-way tables, 701-704, 
720-721 


Critical values 


for confidence intervals, 486, 


640 
for t-distributions, 512-513, 
640 


Cube root transformation, 769 
Cumulative relative frequency 


graphs, 86-89 
Curves 

bell. See Normal 
distributions 

density, 104-107, 355-358, 
685. See also Density 
curves 

Normal, 108, 109, 115, 
116-117, 356-358 


standard Normal, area under, 


115, 116-117 


D 

Data analysis, 2-6 
categorical, 7-20. See also 

Categorical variables 

definition of, 2 

Data ethics, 270-271 

Data tables. See Tables 

Data transformation. See 


for chi-square distributions, 
685 

definition of, 105 

mean of, 107 

median of, 106-107 


probability distributions and, 


355-358 
skewed, 107 
standard deviation of, 107 
symmetric, 106 
for t-distributions, 512 


Dependent variables, 144 
Difference between means, 


634-651 
confidence intervals for, 
641-643, 648 
sampling distribution of, 
636-639 
significance tests for, 
644-649, 651 
standard error of, 640 
two-sample t procedures for, 
644-651 
two-sample t statistic and, 


639-641, 650-651 


Difference between 


proportions, 612-629 
calculator for, 624 
chi-square test for 
homogeneity for, 710 

confidence intervals for, 
616-619, 623 

inference for experiments 
and, 625-627 

randomization distribution 
of, 627 

sampling distribution of, 
612-615 

significance tests for, 
619-624 

standard error of, 616 

two-sample z interval for, 
616-617 

two-sample < test for, 


621-623, 710 


Difference of random variables 


mean of, 376-377 
variance of, 376-377 


Discrete random variables 


from counting, 356 

definition of, 348 

mean (expected value) of, 
350-352 

probability distributions of, 
348-350, 354 

standard deviation of, 
352-353 

variance of, 352-353, 373 

vs. continuous random 


variables, 356 


binomial, 382-404 

of categorical variables, 7-8, 
11-18, 697-701 

center of. See Center of 
distribution 

chi-square, 684-687, 
704-705, 717-721 

comparing, 29-31 

conditional, 14-18, 320, 
697-701 

cumulative relative 
frequency graphs and, 


density curves and, 104-107 

five-number summary for, 
57-59, 63-64 

geometric, 404-408 

graphing of. See Graph(s) 

marginal, 12-14 

mean of, 49-51. See also 
Mean 

median of, 51-53. See also 
Median 

mode of, 26, 28-29, 35 

multimodal, 29 

Normal, 108-109. See also 
Normal distributions 

outliers in, 26, 56-57. See 
also Outliers 

overall pattern of, 26, 27-29 

percentiles of, 84, 85-86 

pie charts for, 8-9 

population, +28 

probability. See Probability 
distributions 

of quantitative variables, 
25-40 

quartiles of, 54-55, 
57-58 

randomization, 627 

range of, 53-55 

of sample data, 428 

sampling. See Sampling 
distribution 

shape of, 26, 27-29, 35, 108, 
435, 440-445, 451-456, 
478, 613 

skewed, 27-29, 52-53, 107, 
125, 516-517 

spread of, 26, 53-55, 
432-433, 440-445, 
478-479, 613, 637, 740, 
741 

symmetric, 27-29 

t, 510-514 

transforming data and, 
92-98. See also 
Transformation 

uniform, 355-356 

unimodal, 28-29 


Disjoint events, 307-308, 311 
Transformation addition rule for, 308, 312, 
Convenience sample, 211-212. Degrees of freedom, 511-512 330 
See also Sample/sampling for chi-square distributions, vs. independent events, 331 
Correlation, 150-157 685 Distributions, 4, 83-128 
calculation of, 154-155 Density curves, 84, 104-107 bimodal, 29 


z-scores and, 89-91, 97 
Division, by constant, 94, 
95-97, 364-366, 367-368 
Dotplots, 25-27 
shape of, 29 
Double-blind experiments, 248 


Control groups, 246-247 


E 
Effect size, 568 
Empirical (68-95-99.7) rule, 
109-112 
Equal standard deviation, for 
regression inference, 
743-746 
Error 
root mean squared, 746 
roundoff, 8 
standard. See Standard error 
Type I, 547-550 
‘Type IL, 547-548, 565-569 
Estimators 
biased, 431-432 
point, 477 
unbiased, 429-431, 432 
Ethical issues, 270-271 
Events 
complements of, 307-308, 
311, 314 
definition of, 306 
independent, 326-331 
mutually exclusive, 
307-308, 311, 330-331 
in Venn diagrams, 311] 
Expected counts 
in chi-square test, 680-684, 
701-704 
computation of, 701-704 
for multiple comparisons, 
701-704 
in two-way tables, 701-704, 
720-721 
Expected value. See also Mean 
of difference of random 
variables, 376-377 
of discrete random variables, 
350-352 
of sum of random variables, 
370-376 
Experiment(s), 208, 234-257 
bias in, 240-241 
blocking in, 251-257 
causation and, 238, 249, 
268-270 
comparative, 240 
confidentiality in, 271] 
confounding in, 236 
control groups in, 242, 
246-247 
ethical aspects of, 270-271 


types of. See Experimental 
design 
vs. observational studies, 208, 


235-236, 268-269 


Experimental design, 240-244 


analytic methods and, 650 

block, 251-257 

completely randomized, 
244-247 

control and, 242, 246-247 

double-blind, 248 

lack of realism and, 268 

matched pairs, 255 

placebo-controlled, 243-244 

principles of, 242 

problems with, 239-240, 
247-248 

random assignment and, 
241-242, 249-251 

randomized block, 251-257 

randomized comparative, 
240-244 

replication and, 242 

scope of inference and, 
266-268. See also 
Inference 


single-blind, 248 


Experimental units, 237 
Explanatory variables, 143-144, 


236 
causation and, 144 
correlation and, 156, 187 
in experiments, 238 
as factors, 238 
multiple, 244-247 
regression and, 164, 187 


Exponential models, 774-778 
Extrapolation, 168 


Factorial notation, 392 
Factors, in experiments, 238 
False negative, 287 

False positive, 287 

Finite population correction 


(FPC), 445 


First quartile, 54-55 


in five-number summary, 


57-58 


Fisher, Ronald A., 546-547 
Five-number summary, 57-58, 


63-64 


Geometric probability, 404-408 
Geometric random variables, 
387, 404-408 
definition of, 405 
distributions of, 405 
mean of, 408 
standard deviation of, 408 
Geometric settings, 404-406 
German tank problem, 
422-423, 435 
Gosset, William S., 512 
Graphing calculator. See 
Calculator 
Graph(s), 25-40 
bar, 8-11, 16, 17-18, 38 
boxplots, 57-59 
for categorical variables, 
8-11, 16, 17-18 
cumulative relative 
frequency, 86-89 
density curves and, 104-107 
dotplots, 25-27 
histograms, 33-40 
interpreting, 25-27 
Normal probability plots, 
122-125 
outliers in, 26, 56-57, 147, 
188-190. See also 
Outliers 
pie charts, 8-9 
for quantitative variables, 
25-40 
scatterplots, 145-155 
stemplots, 31-33 


H 
Histograms, 33-40 
cautions for, 38-39 
choosing classes for, 
35-36 
constructing, 33-38 
counts and percents in, 
38-39 
definition of, 33 
density curves and, 104-107 
frequency, 34, 35 
frequency tables and, 8 
interpreting, 35 
relative frequency, 34, 35 
relative frequency tables 
and, 8 


shape of distributions in, 35 


Index 1-3 


I 
Independent condition, for 
regression inference, 
743-746 
Independent events, 
326-328 
association and, 328 
multiplication rule for, 
328-331 
vs. mutually exclusive events, 
330-331 
Independent variables, 144, 
371-376 
adding, 370-376 
definition of, 371 
subtracting, 376-378 
Individuals, 2, 4 
Inequality, Chebyshev’s, 112 
Inference, 5-6 
cause and effect and, 
266-268 
for difference between 
means, 640, 648-649, 
650 
for difference between 
proportions, 625-627 
for experiments, 249-251, 
266-268, 625-627, 
648-649 
for linear regression, 
738-758 
for mean, 520, 586-589 
populations and, 266-268 
probability and, 289 
for proportion, 520 
sample size and, 565-569 
for sampling, 223-225, 
266-268 
scope of, 266-268 
for slope of regression line, 
746-747 
Inference problems, four-step 
process for, 65 
Inflection points, 108 
Influential observations, 189 
Informed consent, 271 
Institutional review boards, 
270-271 
Internet polls, 212-213 
Interquartile range (IQR), 
54-55, 64-65 


Intersections, in Venn 


Four-step process, for statistical 
problems, 65 
Frequency histograms, 34, 35 


experimental units in, 237 
inference for, 249-251, 
266-268, 625-627 


vs. bar graphs, 38 
Homogeneity, chi-square test 


for, 703, 707-710 


diagrams, 311] 
Interval estimate, 480. See also 
Confidence intervals 


informed consent for, 271 Frequency tables, 8, 86-87 Hypothesis 

multifactor, 245-247 alternative, 539-540, 618, 

placebos in, 244 G 710 J : 

problems in, 239-240, Galton, Francis, 184 null, 539-541, 543. See also JMP (software), regression 
247-248 Gauss, Carl Friedrich, 109 Null hypothesis output for, 181-182 

random assignment in, General addition rule, one-sided, 540 
244-247 310-311, 312 populations vs. samples and, — L 

statistically significant results General Social Survey (GSS), 541 Large Counts condition, 403 
in, 249 226 two-sided, 540 for chi-square tests, 687 


for confidence interval for 
proportion, 494-496 


subjects in, 237 
terminology for, 237-239 


Hypothesis tests. See 
Significance tests 


Geometric distributions, 


404-408 


1-4 Index 


Large Counts condition (cont.) 
for significance tests, 555, 
621-622 
Large-sample test for 
proportion, 558 
Law of averages, 294-295 
Law of large numbers, 291, 294 
Laws of probability, 224 
Least-squares regression, 
164-192, 736-758. See 
also Regression 
explanatory variables and, 
156, 187 
inference for, 738-758 
influential observations in, 
189 
limitations of, 186-190 
response variables and, 164, 
187 
software output for, 181-182 
transformation for, 765-785 
Least-squares regression line, 
168-172, 739-758. See 
also Regression line 
calculation of, 183-185 
coefficient of determination 
and, 178-181 
influential points and, 189 
population (true), 739 
residuals and, 169-176 
sample (estimated), 
739 
slope of. See Slope of 
regression line 
y intercept for, 166, 746, 749 
Least-squares residuals, 
169-176 
Left-skewed distributions, 
27-28 
Levels, in experiments, 
238 
Linear condition, for regression 
inference, 743-746 
Linear regression. See 
Regression 
Linear relationships, 150-157. 
See also Correlation 
direction of, 146 
form of, 146 
regression lines and, 
164-172, 187-188. See 
also Regression line 
strength of, 147-148, 
150-151 
Linear transformation, 
364-369. See also 
Transformation 
Logarithm transformation, 
771-782 
exponential models and, 
774-778 
power models and, 771-774, 
778-780 
Logarithmic models, 771 
Long-run regularity, 
293 


M 
Margin of error 
in confidence intervals, 487, 
500, 501-503 
definition of, 480 
sample size and, 501-503, 
523-524 
Marginal distributions, 12-14 
Matched pairs design, 255 
Mean, 49-51 
as arithmetic average, +9 
as balance point, 51 
of binomial distribution, 
397-400 
of binomial random variable, 
398-400 
comparison of, 634-651. See 
also Difference between 
means 
definition of, 49 
of density curve, 107 
of difference of random 
variables, 376-377 
of discrete random variable, 
350-352 
of geometric random 
variable, 408 
inference for, 520, 586-589 
linear transformations and, 
368 
as nonresistant measure, 50 
of Normal distribution, 108 
notation for, 49, 107 
outliers and, 50 
population. See Population 
mean 
sample, 49, 450-461. See 
also Sample mean 
of sampling distribution, 
429-432, 443-446, 
451-453, 459, 478, 613, 
637, 740, 741 
of sum of random variables, 
371-376 
vs. median, 52-53, 65 
Mean and Median (applet), 53 
Measurement units, conversion 
of, 92-98 
Median, 51-53 
calculation of, 51—52 
definition of, 51 
of density curve, 106-107 
in five-number summary, 57 
as midpoint, 49 
as resistant measure, 52—53 
vs. mean, 52-53, 65 
Minitab (software) 
quartiles and, 64 
regression output for, 
181-182 
Mnemonics 
BINS, 388 
SOCS, 26 
Mode, 26, 28-29 
Multimodal distributions, 29 
Multiple comparisons, 700-701 


Multiplication, by constant, 
94-97, 364-368 

Multiplication rule, for 
independent events, 
328-331 

Multistage sample, 222 

Mutually exclusive events, 
307-308, 311, 331 

addition rule for, 308, 312, 
330 


vs. independent events, 331] 


N 
Negative association, 146, 
147-148, 151. See also 
Association 
95% confidence interval, 480 
Nonlinear relationships. See 
also Curves 
transformation of, 765-786 
Nonresistant measures 
correlation as, 188-189 
standard deviation as, 62 
Nonresponse, in sample 
surveys, 225 
Nonsense correlation, 190 
Normal/Large Sample 
condition 
for confidence intervals, 640 
for population mean, 
514-515 
for sampling distributions, 
444, 458 
for significance tests, 
575-576 
for two-sample t procedures, 
649 
Normal approximation 
of binomial distribution, 
402-404 
of sample proportion, 443, 
445-446 
Normal Approximation to 
Binomial (applet), 402 
Normal condition 
for regression inference, 
743-746 
Normal curve, 109, 356-358 
Normal Curve (applet), 110, 
116-117 
Normal distributions, 84, 
108-109 
assessing normality, 121-125 
calculating probabilities 
involving, 118-121 
central limit theorem and, 
456-459 
Chebyshev’s inequality and, 
112 
distribution of sample 
proportion and, 444 
mean of, 108 
as probability distributions, 
356-358 
of sample proportion, 
444-446 


shape of, 108 
significance tests and, 556 
68-95-99.7 rule for, 
109-112, 122 
standard, 112-117 
standard deviation of, 
108-109 
usefulness of, 109 
vs. non-Normal distributions, 
109, 121-125. See also 
Skewed distributions 
Normal population, sampling 
from, 453-456 
Normal probability plots, 
122-125 
Normal random variables 
adding, 378-380 
subtracting, 379-381 
Notation 
for binomial coefficient, 392 
for conditional probability, 
319 
factorial, 392 
for mean, 49, 107 
for standard deviation, 61 
for variance, 61 
for Venn diagrams, 311-312 
Null hypothesis, 539-541, 543 
for chi-square test, 681, 710 
definition of, 539 
failure to reject, 547-550 
P-values and, 543-544 
rejection of, 547-550 
significance tests and, 
619-620 
Numerical summaries, 57-58, 


63-64 


oO 
Observational studies, 207-229. 
See also Sample/sampling 
definition of, 235-237 
ethical aspects of, 270-271 
vs. experiments, 208, 
235-236, 268-269 
Observed counts, in chi-square 
tests, 681-684 
One-proportion Z test, 
calculator for, 561 
One-sample t interval, for 
population mean, 518-520 
One-sample t test, 579-583 
calculator for, 582-583 
paired data and, 586-589 
One-sample z interval for 
proportion, +98 
One-sample z test for 
proportion, 557-56] 
One-sided alternative 
hypothesis, 540 
One-sided significance tests, 
562, 564 
confidence intervals and, 
564, 585-586 
One-Variable Statistical 
Calculator (applet), 36 


One-way table, 680 
1.5 X IOR rule, 56, 58 
Opinion polls. See Polls; 
Sample/sampling 
Outliers, 26, 56-57 
definition of, 26 
identifying, 56-57 
1.5 X IOR rule for, 56, 58 
in regression, 188-190 
in scatterplots, 147 
t procedures and, 516 
vs. influential observations, 


189 


P 
P-values, 543-544 
calculation of, 576-579 
from chi-square distributions, 
684-687, 704-705 
definition of, 543 
interpreting, 543-544 
for negative t-values, 
577-578 
statistical significance and, 
545-547 
for t test for slope, 753-754 
test statistic and, 556-557, 
576-578 
for two-sample t test, 
644-645 
for two-sided tests, 562-563 
Paired data, 586-589 
Parameters 
population, 424 
regression, 746-747 
vs. statistics, 424 
Percentiles, 84, 85-86 
in cumulative relative 


frequency graphs, 
—89 


quartiles and, 88 
Percents, in relative frequency 
tables, 8 
Physicians’ Health Study, 
243-244 
Pictographs, 10-11 
Pie charts, 8-9 
Placebos, 244 
Plus four estimate, 500 
Point estimate, 477, 480 
Polls, 210-213. See also 
Sample/sampling 
call-in, 212 
Internet, 212-213 
write-in, 212 
Pooled sample proportion, 
621 
Pooled two-sample t 
procedures, 650-651 
Population(s), 5, 208 
definition of, 210 
mean of. See Population 
mean 
in sample survey, 210 
vs. sample, 210 
Population distribution, +28 


Population mean, 49 


confidence intervals and, 
518-521 

estimation of, 507-526 

Normal/Large Sample 
condition for, 514-515 

one-sample ¢ interval for, 
518-520 

Random condition for, 
514-515 

sampling distribution and, 
510-511 

t-distributions and, 510-514 


Population parameters, 424, 


541 


Population proportion, 424 


confidence interval for, 
496-499, 563-564 
estimation of, 492-504 
one-sample z interval for, 498 
significance tests about, 


554-570 


Population regression line, 739. 


See also Regression line 


Population size, sampling 


variability and, 432-433 


Population variance, 431 
Positive association, 148, 151. 


See also Association 


Power, 565-569, 590-591 


definition of, 565 

sample size and, 647 

‘Type I error and, 568 

‘Type II error and, 565-569 
vs. significance level, 566 


Power models, logarithm 


transformation and, 


771-774, 778-780 


Probability, 286-333 


addition rule for mutually 
exclusive events and, 
308, 312 

basic rules of, 307-311 

binomial, 390-396 

conditional, 318-328 

definition of, 291 

general addition rule for, 
310-311, 312 

geometric, 404-408 

inference and, 289 

law of averages and, 
294-295 

law of large numbers and, 
291, 294 

long-run regularity and, 293 

mutually exclusive events 
and, 307-308, 311 

overview of, 289-292 

randomness and, 292-295 

simulation and, 295-299 

tree diagrams and, 322-326 

two-way tables and, 309-311 


Probability (applet), 290 
Probability distributions, 


348-349. See also Random 
variables 


adding/subtracting constant 
and, 364, 366-368 
of continuous random 
variables, 355-358 
definition of, 348 
density curves and, 355-358 
of discrete random variables, 
348-350 
multiplying/dividing by 
constant and, 364-368 
Normal distributions as, 
356-358 
uniform, 355-356 
Probability laws, 224, 291, 
294-295 
Probability models, 305-307 
Problem solving, four-step 
process for, 65 
Proportions 
difference between, 
616-629. See also 
Difference between 
proportions 
one-sample < test for, 
557-561 
two-sample < test for, 
616-617 
PROVE-IT Study, 610 
pth percentile, 85-86 


Q 
Quantitative variables, 3-4. See 
also Variables 


correlation between, 156. See 


also Correlation 

definition of, 3 

distribution of, 25-40. See 
also Distributions 

relationships between, 
140-203 

vs. categorical variables, 508, 
520, 720 

Quarttiles 

first, 54-55 

in five-number summary, 
57-58 

in interquartile range, 54-55 

percentiles and, 88 


third, 54-55 


R 
? (R-sq; coefficient of 
determination), 178-181 
Random assignment, 241-242, 
249-251 
inference and, 266-268. See 
also Inference 
Random condition 
for chi-square tests, 687, 703 
for confidence intervals, 493, 
494-496, 640 
for population mean, 
514-515 
for regression inference, 


743-746 


Index 1-5 


for significance tests, 555, 
575-576 


Random digits, 216 
Random digits table 


for simple random samples, 
216-217 
for simulations, 296-297 


Random samples, 213-225. See 


also Sample/sampling 

choosing with calculator, 
215-216 

choosing with Table D, 216 

choosing with technology, 
215 

confidence intervals and, 493 

inference and, 266-268. See 
also Inference 

population mean and, 
514-515 

significance tests and, 
555-556 

simple, 213-218 

stratified, 219-220 


Random variables, 344-410 


adding, 364, 366-368, 
370-376, 378-380 
adding/subtracting constant 
and, 364, 366-368 
binomial, 387-404. See also 

under Binomial 
combining, 370-376 
continuous, 355-358 
definition of, 348 
discrete, 348-354, 356, 373 
geometric, 387, 404-408 
independent, 371-376 
linear transformations of, 
364-369 
mean of, 350-352, 371, 
376-377 
multiplying/dividing by 
constant and, 364-368 
Normal, 378-381 
subtracting, 376-381 


Randomization distribution, 


for difference between 
proportions, 627 


Randomized block design, 


251-257 
matched pairs, 255 


Randomized comparative 


experiments, 240-244 


Randomness, 292-295 


law of averages and, 
294-295 

long-run regularity and, 293 

model for, 305-307 

myths about, 293-294 

predictability of, 293-294 


Range, 54 


definition of, 54 
interquartile, 54-55 


Reese’s Pieces® (applet), 441 
Regression 


coefficient of determination 


and, 178-181 


1-6 Index 


Regression (cont.) 
explanatory variables and, 
156, 187 
least-squares, 164-192. 
See also Least-squares 
regression 
limitations of, 186-190 
outliers in, 188-190 
response variables and, 164, 
187 
Regression inference, 738-758 
conditions for, 742-746 
estimating parameters and, 
746-747 
time-series data for, 744 
Regression line, 164-172 
definition of, 164 
equation of, 166 
extrapolation and, 168 
interpreting, 165-167 
least-squares, 168-172. 
See also Least-squares 
regression line 
linear relationships and, 
164-172, 187-188 
predicted value and, 166, 
167-168 
slope of. See Slope of 
regression line 
y intercept for, 166, 746, 749 
Regression output, interpreting, 
181-182 
Regression standard error, 746 
Regression to the mean, 
182-185 
Relative frequency graphs, 
cumulative, 86-89 
Relative frequency tables, 8, 34, 
35, 86-87 
Replication, in experimental 
design, 242 
Residual(s), 169-176 
calculating, 169-172 
definition of, 169 
interpreting, 172-176 
squaring, 170. See also Least- 
squares regression line 
standard deviation of, 
177-178, 180-181 
Residual plots, 172-176 
on calculator, 175-176 
Resistant measures, 50, 52-53, 
55 
Response variables, 143-144, 
236 
correlation and, 156, 187 
regression and, 164, 187 
Rice University Sampling 
Distributions (applet), 
453-454, 456 
Rightskewed distributions, 
27-28 
Root mean squared error, 
746 
Roundoff error, 8 
Row variables, 12 


S 
Sample data, distribution of, 
428 
Sample mean, 49, 450-461. See 
also Mean 
Normal/Large Sample 
condition for, 458 
sampling distribution of. See 
Sampling distribution of 
sample mean 
standard error of, 518 
Sample proportion, 424, 
440-446 
Large Counts condition for, 
444 
Normal approximation for, 
445-446 
Normal distribution of, 
444-446 
pooled (combined), 621 
sampling distribution of. See 
Sampling distribution of 
sample proportion 
10% condition for, 444 
as unbiased estimator, 443 
Sample regression line, 739. 
See also Regression line 
Sample/sampling, 5, 207-229. 
See also Observational 
studies 
bias in, 212, 223, 429-431. 
See also Bias 
binomial distributions and, 
401-404 
cluster, 220-222 
convenience, 2] 1-212 
definition of, 210 
ethical aspects of, 270-271 
inference for, 223-225, 
266-268 
laws of probability and, 224 
margin of error in, +80. See 
also Margin of error 
mean of. See Sample mean 
multistage, 222 
nonresponse in, 225 
populations and, 208, 210. 
See also Population(s) 
problems in, 225-227 
question order in, 227 
question wording in, 227 
random, 213-222, 266-268. 
See also Random 
samples 
sample size and, 223-225, 
432-433 
sampling errors in, 225-227 
selfselected, 212 
unbiased estimates and, 
432-433 
voluntary response, 212, 225 
vs. population, 210 
without replacement, 402 
Sample size, 224 
confidence intervals and, 


501-503, 522-524 


margin of error and, 
501-503, 523-524 
power and, 565-569, 647 
for significance tests, 
565-569, 589-590, 647 
variability and, 432-433 
Sample spaces, 305 
Venn diagrams and, 
311-314 
Sample surveys, 210-211. See 
also Sample/sampling 
multistage, 222 
as observational studies, 208, 
235-236 
sampling errors in, 225-227 
Sample variance, estimating, 
431-432 
Sampling distribution, 20-461 
bias and, 429-432. See also 
Bias 
binomial count and, 443 
center (mean) of, 429-432, 
443-444, 451-453, 459, 
478, 613, 637-638, 740, 
741 
characteristics of, 429-435 
definition of, 427 
of difference between means, 
636-639 
of difference between 
proportions, 612-615 
distribution of sample data 
and, 428 
Normal/Large Sample 
condition for, 458 
overview of, 422-424 
population distribution and, 
428 
shape of, 435, 440-445, 
453-456, 478, 613, 636, 
740, 741 
of slope of regression line, 
740-742 
spread of, 432-433, 
440-445, 478-479, 613, 
637, 740, 741 
standard deviation of, 444, 
451-453, 459, 478-479, 
613, 614-615 
vs. sample data distribution, 
428 
Sampling distribution of sample 
mean, 451-456 
center (mean) of, 451-453, 
459, 478 
central limit theorem and, 
456-459 
for Normal population, 
453-456 
shape of, 453-456, 459, 478 
standard deviation of, 
451-453, 459, 478-479 
Sampling distribution of sample 
proportion, 440-445 
center (mean) of, 440-445 


Normal approximation and, 
444-446 
shape of, 440-445 
spread of, 440-445 
standard deviation of, 444 
10% condition and, 444 
Sampling variability, 425-428 
bias and, 434-435 
definition of, 425 
Satterthwaite, F. E., 634 
Scatterplots, 145-155 
calculators for, 150, 155 
constructing, 145-146 
correlation and, 150-157. 
See also Correlation 
definition of, 145 
describing, 146-149 
direction of relationship and, 
146-149 
form in, 146 
linearizing curved patterns 
in. See Transformation 
outliers and, 147 
patterns in, 146-147 
positive/negative associations 
and, 146, 148, 151 
regression lines and, 
164-172, 738. See also 
Regression line 
strength of relationship and, 
146, 151 
Segmented bar graphs, 17 
Self-selected samples, 212 
Shape of distributions, 26, 
27-29, 35, 108-109, 435, 
441-443, 453-456, 478, 
613 
Significance level, 545-546, 
589 
definition of, 566 
‘Type I and II errors and, 
565-569 
vs. power, 566 
Significance tests, 536-607 
about population mean, 
575-579 
about population proportion, 
554-570 
alternative hypothesis and, 
539-540 
carrying out, 554-557, 
575-579 
chi-square. See Chi-square 
test 
confidence intervals and, 
563-564, 583-586 
definition of, 539 
for difference between 
means, 644-649 
for difference between 
proportions, 619-624 
four-step process for, 558 
guidelines for, 589-593 
Large Counts condition for, 


555, 621-622 


Normal/Large Sample 
condition for, 575-576 
null hypothesis and, 539-541 
one-sample, 579-583 
one-sided, 562, 585-586 
P-value for, 543-547, 
562-563, 576-578 
pitfalls in, 589-593 
power of, 565-569, 647 
purpose of, 560 
Random condition for, 555, 
575-576 
reasoning of, 538-539, 
541-543 
sample size for, 565-569, 
589-590, 647 
for slope of regression line, 
753-757 
statistical significance and, 
544-547. See also 
Significance level 
test statistic in, 556-557, 576 
two-sided, 562-564, 
583-586 
Type | and II errors and, 
547-550 
Simple random sample (SRS), 
213-218. See also Sample/ 
sampling 
Simulations, 295-299 
Single-blind experiments, 248 
68-95-99.7 rule, 109-112, 122 
Size. See Sample size 
Skewed distributions, 27-29, 
516-517 
density curves and, 107 
left- vs. right-skewed, 125 
mean vs. median for, 52-53 
Normal probability plot for, 
125 
Slope of regression line, 166, 
739-758 
calculator for, 756-757 
center of distribution for, 
740, 741 
confidence interval for, 
747-753 
inference about, 746-747 
P-value for, 753-754 
sampling distribution of, 
740-742 
significance test for, 
753-757 
standard error of, 746 
t interval for, 747-749 
t test for, 753-754 
SOCS mnemonic, 26 
Solving problems, four-step 
process for, 65 
Splitting stems, 32 
Spread, 26, 53-55 
in five-number summary, 57 
interquartile range and, 
53-55, 64-65 


measures of, 53-55 


of sampling distribution, 
432-433, 441-443, 
478-479, 613, 637, 740, 
741 
standard deviation and, 
60-62, 64-65 
Standard deviation, 60-62, 
372-373 
of binomial distributions, 
397-400 
of binomial random 
variables, 398-400 
calculation of, 62 
definition of, 61 
of density curve, 107 
of difference of random 
variables, 377-378 
of discrete random variable, 
352-353 
estimated. See Standard error 
of geometric random 
variable, 408 
linear transformations and, 
368 
as nonresistant measure, 62 
of Normal distribution, 
108-109 
notation for, 61 
regression line and, 746 
of residuals, 177-178, 
180-181 
of sampling distribution, 444, 
451-453, 459, 478-479, 
613, 614-615, 637-638 
spread and, 60-62, 64-65 
of sum of random variables, 
371-376 
usefulness of, 62, 64 
variance and, 60-61. See also 
Variance 
z-score and, 90 
Standard error 
confidence interval and, 
496-497 
definition of, 496-497 
of difference between means, 
640 
of difference between 
proportions, 616 
regression, 746 
of sample mean, 518 
of slope, 746 
Standard Normal curve, area 
under, 115, 116-117 
Standard Normal distributions, 
112-117 
Standard Normal tables, 
113-115 
Standardized values, 89-91 
Standardizing, 89-91 
State-plan-do-conclude 
process, 65 
Statistic 
definition of, 424 
variability of, 432-433 


vs. parameter, 424 


Statistical Power (applet), 
590-591 
Statistical problems, four-step 
process for, 65 
Statistical significance, 
544-547. See also 
Significance level 
multiple analyses and, 593 
practical importance of, 592 
sample size and, 589-591 
Statistically significant 
association, 249 
Statistically significant at level 
a, 545 
Statistics, definition of, 2 
Stemplots, 31-33 
Strata, 219-220 
vs. clusters, 221] 
Stratified random sample, 
219-220 
Student’s ¢-distribution, 512 
Study design, 207-273. See 
also Experimental design; 
Sample/sampling 
Subjects, experimental, 237 
confidentiality for, 271 
informed consent of, 271 
Subtraction 
of constant, 93-97, 364, 
366-368 
of random variables, 376-381 
Sum of random variables, mean 
of, 371-376 
Survey, sample, 210-211. See 
also Sample/sampling 
Symmetry, 27-29 


T 
t distribution, 510-514 
critical values for, 512-513, 
640 
degrees of freedom and, 
511-512 
Student’s, 512 
t interval 
one-sample, 518-522 
for population mean, 
518-522 
for slope of regression line, 
747-749 
two-sample, 641-643 
t procedures 
one-sample, 510-514, 
579-583, 586-589 
outliers and, 516 
paired, 586-589 
sample size and, 522-524 
t statistic, two-sample, 639-641, 
644, 650-651 
t test 
one-sample, 579-583, 
586-589 
paired data and, 586-589 
for slope, 753-754 
t values, negative, 577-578 
Tables, 3-4 


Index 1-7 


frequency, 8, 86-87 
one-way, 680 
random digit, 216-217, 
296-297 
relative frequency, 8, 34, 35, 
86-87 
standard Normal (‘Table A), 
113-115 
Table B, 513, 577-578, 641 
Table C, 685-686 
Table D, 216 
two-way. See ‘Two-way tables 
Technology. See Applets; 
Calculator 
10% condition, 401-402, 444 
for regression inference, 743 
‘Test of Significance (applet), 
538 
Test statistic, 556-557, 576 
calculation of, 576-578, 588 
calculator for, 561 
P-value and, 556-557 
z, distribution of, 627 
Theorem, Bayes’s, 323 
Third quartile, 54-55 
in five-number summary, 
57-58 
‘Time-series data, for regression 
inference, 744 
Transformation, 92-98, 
765-785 
to achieve linearity, 765-785 
adding/subtracting constant 
and, 93-97 
calculator for, 781-782 
definition of, 767 
exponential models and, 
774-778 
linear, 364-369 
with logarithms, 771-782 
model selection for, 
778-780 
multiplying/dividing by 
constant and, 94-97 
power models and, 767-774, 
778-780 
z-scores and, 97 
Treatment, 237 
‘Tree diagrams, 322-326 
‘True regression line, 739 
Truncated data, 32 
‘Two-sample t interval, 641-643 
‘Two-sample t procedures 
calculator for, 647-648 
for difference between 
means, 644-649 
guidelines for, 650-651 
Normal condition for, 649 
pooled, 650-651 
sample size for, 647 
‘Two-sample t statistic, 639-641, 
644 
pooled, 650-651 
‘Two-sample z interval, for 
difference between 
proportions, 616-617 


1-8 Index 


‘Two-sample z statistic, 
639-641 
chi-square statistic and, 710 
‘Two-sample z test 
for comparing proportions, 
710, 719-720 
for difference between 
proportions, 621-623 
Two-sided alternative 
hypothesis, 539-540 
‘Two-sided significance tests, 
562-564 
confidence intervals and, 
564, 583-586 
‘Two-way tables, 11-12, 15-16, 
309-311 
calculator for, 706-707 
chi-square tests for, 697-704, 
711-713 
collapsing, 721 
expected counts for, 
701-704, 720-721 
‘Type I errors, 547-550 
‘Type II errors, 547-548, 
565-569 


U 

Unbiased estimator, 429-431, 
432. See also Bias 

Uniform distributions, 355-356 

Unimodal distributions, 28-29 


Unions, in Venn diagrams, 311] 
Units of measure, conversion 


of, 92-98 


Vv 
Variability 
sampling, 425-428, 434-435 
of statistic, 432-433 
Variables, 2—4 
association between, 17-19. 


See also Association 
categorical, 3-4, 508, 520, 


697-701, 711-713, 720. 


See also Categorical 
variables 

column, 12 

confounding, 236, 251 

definition of, 2 

dependent, 144 

distribution of. See 
Distributions 

explanatory, 143-144, 156, 
187, 236, 238 

independent, 144 

negatively associated, 146, 
148, 151 

positively associated, 148, 
151 

quantitative, 3-4, 508. 
See also Quantitative 
variables 


random, 344-410. See also 
Random variables 
response, 143-144, 156, 164, 
187, 236 
row, 12 
standardizing, 184-185 
Variance. See also Standard 
deviation 
adding, 373 
of binomial random 
variables, 398-400 
definition of, 61 
of difference of independent 
random variables, 
376-377 
of discrete random variables, 
352-353 
equal vs. unequal, 650 
of Normal random variables, 
378-380 
notation for, 61 
population, 431 
sample, 431-432 
of sum of independent 
random variables, 373 
Venn diagrams, 311-314 
intersections in, 311 
notation for, 311-312 
unions in, 311 
Voluntary response sample, 
212, 225 


Ww 

Welch, B.L., 634 

Wilson, Edwin Bidwell, 500 
Write-in polls, 212 


xX 


x-bar. See Mean; Sample mean 


Y 
y intercept for regression line, 
166, 746 
confidence interval for, 749 


Zz 
z interval for difference 
between proportions, 
616-617 
z scores, 89-98 
for standard Normal curve, 
116-117 
transforming data and, 97. 
See also Transformation 
z statistic 
distribution of, 627 
two-sample, 639-641 
z test 
one-sample, 557-561 
two-sample, 621-623 


TECHNOLOGY CORNERS REFERENCE 


TI-Nspire instructions in Appendix B; HP Prime instructions at www.whfreeman.com/tps5e | 


1. Analyzing two-way tables page 16 

2. Histograms on the calculator page 36 

3. Making calculator boxplots page 59 

4. Computing numerical summaries with technology page 63 

5. From z-scores to areas, and vice versa page 116 

6. Normal probability plots page 125 

7. Scatterplots on the calculator page 150 

8. Least-squares regression lines on the calculator page 171 

9. Residual plots on the calculator page 175 
10. Choosing an SRS page 215 
11. Analyzing random variables on the calculator page 354 
12. Binomial coefficients on the calculator page 392 
13. Binomial probability on the calculator page 394 
14. Geometric probability on the calculator page 406 
15. Confidence interval for a population proportion page 501 
16. Inverse tf on the calculator page 513 
17. One-sample f intervals for jz on the calculator page 521 
18. One-proportion z test on the calculator page 561 
19. Computing P-values from f distributions on the calculator page 578 
20. One-sample ¢ test for a mean on the calculator page 582 
21. Confidence interval for a difference in proportions page 618 
22. Significance test for a difference in proportions page 624 
23. Two-sample tf intervals on the calculator page 643 
24. Two-sample t tests on the calculator page 647 
25. Finding P-values for chi-square tests on the calculator page 686 
26. Chi-square tests for goodness of fit on the calculator page 689 
27. Chi-square tests for two-way tables on the calculator page 706 
28. Confidence interval for slope on the calculator page 751 
29. Significance test for slope on the calculator page 756 
30. Transforming to achieve linearity on the calculator page 781 


Inference Summ 


ary 


Confidence intervals (Cls) 


STATE: What parameter do you want to estimate, and at what 
confidence level? 

PLAN: Choose the appropriate inference method. Check conditions. 

DO: If the conditions are met, perform calculations. 


CONCLUDE: | /nterpret your interval in the context of the problem. 


Significance tests 
What hypotheses do you want to test, and at what significance 
level? Define any parameters you use. 
Choose the appropriate inference method. Check conditions. 
If the conditions are met, perform calculations. 


* Compute the test statistic. 
¢ Find the P-value. 


Make a decision about the hypotheses in the context of the problem. 


Cl: statistic + (critical value)-(standard deviation of statistic) 


statistic — parameter 
standard deviation of statistic 


Standardized test statistic 


One-sample Z interval for 
(1-PropZInt) 


~ 2) 
+ 7* 
paz ; 


Interval 


Random Data from a random sample or randomized 
p experiment 
© 10%: n = 0.10N if sampling without 
replacement 


Large Counts At least 10 successes and failures; that 
is, np = 10 and n(1 — p) = 10 


One-sample z test for p 
(1-PropZTest) 


Test _ P= Po 


7 | Po(1 — Po) 
n 


Random Data from a random sample or randomized 
experiment 
© 10%: n =< 0.10N if sampling without 
replacement 
Large Counts np, = 10 and n(1 — po) = 10 


Random Data from independent random samples or 
randomized experiment 


Proportions Two-sample z interval for p; — p> ° 10%: n, = 0.10M and n, = 0.104, if 
(2-PropZInt) sampling without replacement 
Intatvel = = - 7 Large Counts At least 10 successes and failures in 
(6, — B) + nfo — Pi), Poll ~ Pe)) both samples/groups; that is, 
a 2 mp, = 10, m(1 — pr) = 10, 
Nop> = 10, m(1 — pr) = 10 
: Two-sample Z test for p; — pp Random Data from independent random samples or 
(2-PropZTest) randomized experiment 
; —f)—0 fe) ed ns cane and n = 0.10N, if 
x x x x sampling without replacement 
Pcl — Dc) Pl — po) 
Test “ ‘ ; “4 - “ Large Counts At least 10 successes and failures in 
1 2 . ; 
where both samplesiavOUns that is, - 
.  totalsuccesses  X, + Xp mp; = 10, m(1 — p;) = 10, 


c 


~ totalsample size 7, + nm Mop» = 10, (1 — Pr) = 10 


One-sample t interval for ;. 


Random Data from a random sample or randomized experiment 


© 10%: n= 0.10N if sampling without replacement 


interval (TInterval) 
nterval Normal/Large Sample Population distribution Normal or large 
xt Sx with df =n—1 sample (n = 30); no strong skewness or outliers if m < 30 and 
1 Vn population distribution has unknown shape 
(or paired ' 2 
data) One-sample t test for ju Random Data from a random sample or randomized experiment 
(T-Test) ° 10%: n = 0.10N if sampling without replacement 
Test t X — [lo a a Normal/Large Sample Population distribution Normal or large 
Sy sample (n = 30); no strong skewness or outliers if 7 < 30 and 
Vn population distribution has unknown shape 
Two-sample t interval for pry — pio Random = from independent random samples or randomized 
Means (2-SampTint) experimen 
5 0 2 10%: n, = 0.10N, and mn = 0.10N, if sampling without 
Interval (X, —X>) + | al 4. 82 replacement 
moon 
: . Normal/Large Sample Population distributions Normal or large 
df om technology or samples (n, = 30 and n, = 30); no strong skewness or outliers if 
min(m — 1, m% — 1) sample size < 30 and population distribution has unknown shape 
2 
Two-sample t test for ju; — pu2 Random Data from independent random samples or randomized 
(2-SampTTest) experiment 
(X1 — Xo) — (ta — fe) © 10%: n, < 0.10N, and nm = 0.10%) if sampling without 
Test 3} % 33 replacement 
nm th Normal/Large Sample Population distributions Normal or large 
df from technology or samples (n, = 30 and n, = 30); no strong skewness or outliers if 
min(n, — 1, % — 1) sample size < 30 and population distribution has unknown shape 
Chi-square test for goodness of fit Random Data from a random sample or randomized experiment 
(x°GOF-Test) © 10%: n = 0.10N if sampling without replacement 
1 Test ve (observed — expected)? Large Counts All expected counts at least 5 
Distribution apenled 
of with df = number of categories — 1 
categorical Chi-square test for homogeneity Random Data from independent random samples or random- 
variables (x°-Test) ized experiment 
2ormore | Test P (observed — expected)? 2 10%: m = 0.10N, Mm = 0.10Nb, and so on if sampling 
a expected without replacement 
df = (# of rows — 1)(# of columns — 1) Large Counts All expected counts at least 5 
Chi-square test for independence Random Data from a random sample or randomized experiment 
Relationship (x?-Test) © 10%: n = 0.10N if sampling without replacement 
penwnelé 1 Test 2 Large Counts All expected counts at least 5 
categorical v _ S (observed — expected) arge Counts All expected counts at leas 
variables expected 
df = (# of rows — 1)(# of columns — 1) 
One-sample finterval for 3 Linear Relationship between the variables is linear 
(LinRegTInt) Independent observations; check the 10% condition if sam- 
Relationship Interval b+ t*(SE,) pling without replacement. 
between 2 with df = n— 2 Normal Responses vary Normally around regression line for all 
uantitative 1 -val 
ss riables One-sample ftest for 3 ae 
é Equal SD around regression line for all x-values 
(slope) (LinRegTTest) 
Test Random Data from a random sample or randomized experiment 
b- fo. 
t with df = n—2 
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for the AP® Exam 


FIFTH EDITION 


Written to help you realize success on the AP® Statistics Exam and in your statistics 
course, Ihe Practice of Statistics for the AP® Exam, Fifth Edition, provides the 
built-in support you want and need. 


Clear explanations that adhere to the language and nomenclature used on the 
AP® Statistics Exam. 


Paired Examples and Exercises now enhanced with new video tutorials, featuring 
experienced AP® Statistics teachers, give you help when you need it most. 


AP® Exam Tips offer “insider information” to help you develop the habits necessary 
for success. 


Chapter AP® and Cumulative AP®-style Tests offer unparalleled, integrated support 
to practice, practice, and practice for the AP® Statistics Exam. 


Graphing Calculator instruction with new video tutorials for the TI-83/84, 
TI-89, TI-Nspire, and HP-Prime offer step-by-step instruction. 


Benefit from additional help in the most convenient way: 


New e-Books for students and teachers offer the accessibility you want and the 
flexibility you expect with video tutorials and other support accessed through a 
simple click. 


Strive for a 5 Guide preparing for the AP® Statistics Examination by 
Jason Molesky and Michael Legacy combines a study guide with an AP® prep 
guide and is the perfect companion to the text. (ISBN: 1-4641-5400-7) 


Standalone, 1-use e-Book (ISBN: 1-4641-5382-5) 

Package of 1-use e-Book and printed text: (ISBN: 1-4641-8894-7) 

Package of 6-use e-Book and printed text: (ISBN: 1-4641-7078-9) 

Package of 6-use e-Book, printed text and STRIVE GUIDE (ISBN: 1-4641-7077-0) 


Learn more: www.whfreeman.com/tps5e 
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